Publication Date
| In 2026 | 12 |
| Since 2025 | 958 |
| Since 2022 (last 5 years) | 4567 |
| Since 2017 (last 10 years) | 10500 |
| Since 2007 (last 20 years) | 21963 |
Descriptor
| Test Validity | 21786 |
| Validity | 13791 |
| Test Reliability | 10864 |
| Foreign Countries | 9887 |
| Test Construction | 6897 |
| Factor Analysis | 5761 |
| Measures (Individuals) | 5633 |
| Predictive Validity | 5022 |
| Psychometrics | 4820 |
| Reliability | 4635 |
| Correlation | 4376 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 1169 |
| Practitioners | 629 |
| Teachers | 336 |
| Administrators | 165 |
| Policymakers | 110 |
| Counselors | 63 |
| Students | 63 |
| Parents | 15 |
| Community | 12 |
| Media Staff | 10 |
| Support Staff | 8 |
| More ▼ | |
Location
| Turkey | 1397 |
| Australia | 705 |
| Canada | 626 |
| China | 528 |
| United States | 439 |
| Indonesia | 389 |
| United Kingdom | 363 |
| Germany | 340 |
| California | 338 |
| Netherlands | 336 |
| Spain | 311 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 7 |
| Meets WWC Standards with or without Reservations | 12 |
| Does not meet standards | 10 |
Bilker, Warren B.; Hansen, John A.; Brensinger, Colleen M.; Richard, Jan; Gur, Raquel E.; Gur, Ruben C. – Assessment, 2012
The Raven's Standard Progressive Matrices (RSPM) is a 60-item test for measuring abstract reasoning, considered a nonverbal estimate of fluid intelligence, and often included in clinical assessment batteries and research on patients with cognitive deficits. The goal was to develop and apply a predictive model approach to reduce the number of items…
Descriptors: Intelligence Tests, Abstract Reasoning, Test Items, Test Construction
Raykov, Tenko; Marcoulides, George A. – Structural Equation Modeling: A Multidisciplinary Journal, 2012
A latent variable modeling method is outlined, which accomplishes estimation of criterion validity and reliability for a multicomponent measuring instrument with hierarchical structure. The approach provides point and interval estimates for the scale criterion validity and reliability coefficients, and can also be used for testing composite or…
Descriptors: Predictive Validity, Reliability, Structural Equation Models, Measures (Individuals)
Looney, Marilyn A.; Gilbert, Jennie – Measurement in Physical Education and Exercise Science, 2012
The purpose of the study was to determine if currently used FITNESSGRAM[R] cut-off scores for the Back Saver Sit and Reach Test had the best criterion-referenced validity evidence for 6-12 year old children. Secondary analyses of an existing data set focused on the passive straight leg raise and Back Saver Sit and Reach Test flexibility scores of…
Descriptors: Cutting Scores, Physical Fitness, Tests, Test Validity
Butler, Heather A.; Dwyer, Christopher P.; Hogan, Michael J.; Franco, Amanda; Rivas, Silvia F.; Saiz, Carlos; Almeida, Leandro S. – Thinking Skills and Creativity, 2012
The Halpern Critical Thinking Assessment (HCTA) is a reliable measure of critical thinking that has been validated with numerous qualitatively different samples and measures of academic success (Halpern, 2010a). This paper presents several cross-national applications of the assessment, and recent work to expand the validation of the HCTA with…
Descriptors: Critical Thinking, Measures (Individuals), Behavior, Predictive Validity
Spagnoli, Paola; Caetano, Antonio; Silva, Ana – Social Indicators Research, 2012
The Subjective Happiness Scale (SHS) constitutes an instrument for assessing subjective happiness. This study aims to present the validation of the SHS in a Portuguese adult population. A large representative sample (1,017 participants), from five different age groups was considered. Configurational invariance of the unidimensional structure of…
Descriptors: Life Satisfaction, Validity, Measures (Individuals), Psychometrics
Bloom, Howard S. – Journal of Research on Educational Effectiveness, 2012
This article provides a detailed discussion of the theory and practice of modern regression discontinuity (RD) analysis for estimating the effects of interventions or treatments. Part 1 briefly chronicles the history of RD analysis and summarizes its past applications. Part 2 explains how in theory an RD analysis can identify an average effect of…
Descriptors: Regression (Statistics), Research Design, Cutting Scores, Computation
Smith, Christopher J.; Tefera, Akalu; Zeleke, Aklilu – International Journal of Mathematical Education in Science and Technology, 2012
The use of computer algebra systems such as Maple and Mathematica is becoming increasingly important and widespread in mathematics learning, teaching and research. In this article, we present computerized proof techniques of Gosper, Wilf-Zeilberger and Zeilberger that can be used for enhancing the teaching and learning of topics in discrete…
Descriptors: Algebra, Mathematics Education, Undergraduate Students, Higher Education
Ertmer, David J.; Jung, Jongmin – American Journal of Speech-Language Pathology, 2012
Purpose: To determine the concurrent validity of the Conditioned Assessment of Speech Production (CASP; Ertmer & Stoel-Gammon, 2008) and data obtained from speech samples recorded at the same intervals. Method: Nineteen children who are deaf who received cochlear implants before their 3rd birthdays participated in the study. Speech samples and…
Descriptors: Assistive Technology, Validity, Speech, Intervals
Shaw, Emily J.; McKenzie, Elizabeth – College Board, 2010
[Slides] presented at the annual conference of the Southern Association for College Admission Counseling, April 2010. This presentation summarizes recent research from the national SAT Validity Study and includes information on the Admitted Class Evaluation Service (ACES) system and how ACES can help institutions conduct their own validity…
Descriptors: College Entrance Examinations, Test Validity, Educational Research, Predictive Validity
Lee, Yi-Hsuan; Jia, Yue – Large-scale Assessments in Education, 2014
Background: Large-scale survey assessments have been used for decades to monitor what students know and can do. Such assessments aim at providing group-level scores for various populations, with little or no consequence to individual students for their test performance. Students' test-taking behaviors in survey assessments, particularly the level…
Descriptors: Measurement, Test Wiseness, Student Surveys, Response Style (Tests)
Moore, Brooke A.; Klingner, Janette K. – Journal of Learning Disabilities, 2014
This article synthesizes reading intervention research studies intended for use with struggling or at-risk students to determine which studies adequately address population validity, particularly in regard to the diverse reading needs of English language learners. An extensive search of the professional literature between 2001 and 2010 yielded a…
Descriptors: English Language Learners, Student Needs, Reading Instruction, Literature Reviews
Koç, Hakan; Demir, Selçuk Besir – Review of International Geographical Education Online, 2014
The purpose of the present study is to develop a valid and reliable map literacy scale that is able to determine map literacy of individuals, especially that of high school and university students. The study sample was composed of 518 students studying at various faculties at Cumhuriyet University and high schools in Sivas and its counties. With…
Descriptors: Map Skills, Test Construction, High School Students, College Students
Klieger, David M.; Cline, Frederick A.; Holtzman, Steven L.; Minsky, Jennifer L.; Lorenz, Florian – ETS Research Report Series, 2014
Given the serious consequences of making ill-fated admissions and funding decisions for applicants to graduate and professional school, it is important to rely on sound evidence to optimize such judgments. Previous meta-analytic research has demonstrated the generalizable validity of the "GRE"® General Test for predicting academic…
Descriptors: College Entrance Examinations, Graduate Study, Prediction, Predictive Validity
Plucker, Jonathan A.; Qian, Meihua; Schmalensee, Stephanie L. – Creativity Research Journal, 2014
In recent years, the social sciences have seen a resurgence in the study of divergent thinking (DT) measures. However, many of these recent advances have focused on abstract, decontextualized DT tasks (e.g., list as many things as you can think of that have wheels). This study provides a new perspective by exploring the reliability and validity…
Descriptors: Creative Thinking, Creativity Tests, Scoring Formulas, Evaluation Methods
Barnhisel, Greg; Rapchak, Marcia – Communications in Information Literacy, 2014
Students in a senior English class examined the question of whether the "wisdom of experts" or "the wisdom of crowds" is more reliable and useful in a writing course by engaging in a parallel Wikipedia project. Each student either created a new entry or made significant changes to an existing Wikipedia entry, tracked changes to…
Descriptors: College Seniors, College English, Encyclopedias, Collaborative Writing

Peer reviewed
Direct link
