Publication Date
| In 2026 | 1 |
| Since 2025 | 46 |
| Since 2022 (last 5 years) | 178 |
| Since 2017 (last 10 years) | 511 |
| Since 2007 (last 20 years) | 1004 |
Descriptor
| Scores | 1568 |
| Test Validity | 1568 |
| Test Reliability | 595 |
| Foreign Countries | 368 |
| Correlation | 315 |
| Test Construction | 272 |
| Psychometrics | 247 |
| Factor Analysis | 190 |
| Test Items | 181 |
| Language Tests | 173 |
| Comparative Analysis | 159 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 32 |
| Practitioners | 23 |
| Teachers | 6 |
| Parents | 5 |
| Policymakers | 5 |
| Administrators | 3 |
| Community | 3 |
| Students | 2 |
| Counselors | 1 |
Location
| Turkey | 42 |
| Canada | 29 |
| China | 29 |
| United Kingdom | 20 |
| United States | 20 |
| Australia | 17 |
| Netherlands | 17 |
| Germany | 16 |
| Florida | 15 |
| Japan | 15 |
| Texas | 14 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Camille Landesvatter; Paul C. Bauer – Sociological Methods & Research, 2025
Trust is a foundational concept of contemporary sociological theory. Still, empirical research on trust relies on a relatively small set of measures. These are increasingly debated, potentially undermining large swathes of empirical evidence. Drawing on a combination of open-ended probing data, supervised machine learning, and a U.S.…
Descriptors: Trust (Psychology), Surveys, Measures (Individuals), Test Validity
Tiffany Wu; Christina Weiland; Meghan McCormick; JoAnn Hsueh; Catherine Snow; Jason Sachs – Grantee Submission, 2024
The Hearts and Flowers (H&F) task is a computerized executive functioning (EF) assessment that has been used to measure EF from early childhood to adulthood. It provides data on accuracy and reaction time (RT) across three different task blocks (hearts, flowers, and mixed). However, there is a lack of consensus in the field on how to score the…
Descriptors: Scoring, Executive Function, Kindergarten, Young Children
Zafer Ozen; Nielsen Pereira; Tugce Karatas; Hernán Castillo-Hermosilla; Yukiko Maeda – Gifted Child Quarterly, 2025
Cognitive Abilities Test (CogAT) is one of the most frequently used gifted identification tools. In this meta-analytic study, we investigated empirical evidence of the validity of CogAT, in relation to different types of instruments. After reviewing 1,480 studies, a total of 24 with 33 effect sizes were included in the meta-analysis. According to…
Descriptors: Test Validity, Cognitive Tests, Disability Identification, Scores
Stefan O'Grady – TESOL Journal, 2025
Task-based language assessment represents a major component of task-based language teaching syllabi. Current perspectives emphasise the importance of tasks in the assessment process, suggesting that adherence to influential models of language production during task design yields predictable test outcomes. The current study contends that the…
Descriptors: Task Analysis, Language Tests, Evaluators, Rating Scales
Chet Robie; Sabah Rasheed; Stephen D. Risavy; Piers Steel – International Journal of Testing, 2024
This meta-analysis examined the validity of an alternative to traditional assessments called the Wonderlic which is a brief measure of general mental ability. Our results showed significant, positive correlations between Wonderlic scores and academic performance in general ([r-bar] = 0.26), between Wonderlic scores and undergraduate GPA in…
Descriptors: Meta Analysis, Test Validity, Alternative Assessment, Scores
Janika Saretzki; Rosalie Andrae; Boris Forthmann; Mathias Benedek – Journal of Creative Behavior, 2025
Divergent thinking (DT) ability is widely regarded as a central cognitive capacity underlying creativity, but its assessment is challenged by the fact that DT tasks yield a variable number of responses. Various approaches for the scoring of DT tasks have been proposed, which differ in how responses are evaluated and aggregated within a task. The…
Descriptors: Creative Thinking, Creativity Tests, Scoring, Metacognition
Meyer, J. Patrick; Hu, Ann; Li, Sylvia – NWEA, 2023
The Content Proximity Project was designed to improve the content validity of the MAP® Growth™ assessments while retaining the ability for the test to adapt off-grade and meet students wherever they are in their learning. Two main features of the project were the development of an enhanced item selection algorithm, and a spring pilot study…
Descriptors: Achievement Tests, Mathematics Achievement, Content Validity, Mathematics Tests
Johayra Bouza; Rebecca J. Bulotsky-Shearer; Krystal M. Bichay-Awadala; Jhonelle Bailey; Patricia Gaona; Lisa White; Veronica A. Fernandez – Psychology in the Schools, 2024
The purpose of this study was to validate the Spanish version of the Family Involvement Questionnaire-Short Form (FIQ-SF) for use with Spanish-speaking families of children enrolled in early childhood education programs. This study examined the factor structure of the FIQ-SF and established criterion validity for the resulting FIQ-SF dimension…
Descriptors: Family Involvement, Questionnaires, Early Childhood Education, Spanish
Kayla V. Campaña; Benjamin G. Solomon – Assessment for Effective Intervention, 2025
The purpose of this study was to compare the classification accuracy of data produced by the previous year's end-of-year New York state assessment, a computer-adaptive diagnostic assessment ("i-Ready"), and the gating combination of both assessments to predict the rate of students passing the following year's end-of-year state assessment…
Descriptors: Accuracy, Classification, Diagnostic Tests, Adaptive Testing
Huei-Wen Tsai; Ching-Ling Cheng – Journal of Psychoeducational Assessment, 2025
This study aimed to evaluate the psychometric properties and gather evidence supporting the validity of scores from a traditional Chinese version of the Claremont Purpose Scale (TC-CPS) among Taiwanese adolescents. The TC-CPS, measuring meaningfulness, goal directedness, and beyond-the-self orientation, was administered to 233 high school and 445…
Descriptors: Foreign Countries, Adolescents, Measures (Individuals), High School Students
Ebru Balta; Arzu Uçar – International Journal of Assessment Tools in Education, 2025
Unproctored Computerized Adaptive Testing (CAT) is gaining traction due to its convenience, flexibility, and scalability, particularly in high-stakes assessments. However, the lack of proctor can give rise to aberrant testing behavior. These behaviors can impair the validity of test scores. This paper explores the use of a verification test to…
Descriptors: Adaptive Testing, Computer Assisted Testing, Paper and Pencil Tests, Test Validity
Folger, Timothy D.; Bostic, Jonathan; Krupa, Erin E. – Educational Measurement: Issues and Practice, 2023
Validity is a fundamental consideration of test development and test evaluation. The purpose of this study is to define and reify three key aspects of validity and validation, namely test-score interpretation, test-score use, and the claims supporting interpretation and use. This study employed a Delphi methodology to explore how experts in…
Descriptors: Test Interpretation, Scores, Test Use, Test Validity
Ehri Ryu – Society for Research on Educational Effectiveness, 2024
Background/Context: Confirmatory factor analysis (CFA) model is a commonly adopted framework to estimate and test a measurement model. Once a well-fitting final CFA model is selected, the selected model may be used to test structural relationships of the latent constructs with other variables, to construct a test with desired reliability and…
Descriptors: Research Problems, Factor Analysis, Scores, Computation
Allison R. Lombardi; Graham G. Rifenbark; H. Jane Rogers; Hariharan Swaminathan; Ashley Taconet; Valerie L. Mazzotti; Mary E. Morningstar; Rongxiu Wu; Shannon Langdon – Career Development and Transition for Exceptional Individuals, 2023
The purpose of this study was to establish construct validity of a college and career readiness measure using a sample of youth with (n = 356) and without (n = 1,599) disabilities from five high schools across three U.S. states. We established content validity through expert item review, structural validity through initial field-testing, and…
Descriptors: Test Validity, Construct Validity, Adolescents, College Readiness
Rodrigo Moreta-Herrera; Jacqueline Regatto-Bonifaz; Víctor Viteri-Miranda; María Gorety Rodríguez-Vieira; Giancarlo Magro-Lazo; Jose A. Rodas; Sergio Dominguez-Lara – Journal of Psychoeducational Assessment, 2025
Objective: Analyze the evidence of validity of scores of the Academic Procrastination Scale (APS), its measurement equivalence based on nationality, its reliability of the scores, and its validity in relation to other variables in university students from Ecuador, Venezuela, and Peru. Method: This paper involves a quantitative, descriptive,…
Descriptors: Measures (Individuals), Time Management, College Students, Foreign Countries

Peer reviewed
Direct link
