Publication Date
| In 2026 | 0 |
| Since 2025 | 36 |
| Since 2022 (last 5 years) | 131 |
| Since 2017 (last 10 years) | 401 |
| Since 2007 (last 20 years) | 706 |
Descriptor
| Scores | 1118 |
| Test Reliability | 1118 |
| Test Validity | 594 |
| Foreign Countries | 271 |
| Psychometrics | 237 |
| Test Construction | 227 |
| Correlation | 197 |
| Factor Analysis | 181 |
| Test Items | 171 |
| Statistical Analysis | 147 |
| Measures (Individuals) | 133 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 22 |
| Practitioners | 19 |
| Teachers | 4 |
| Administrators | 3 |
| Counselors | 2 |
| Parents | 2 |
| Community | 1 |
| Policymakers | 1 |
Location
| Turkey | 48 |
| Canada | 16 |
| China | 16 |
| Germany | 13 |
| United Kingdom | 13 |
| Australia | 12 |
| Netherlands | 12 |
| United Kingdom (England) | 12 |
| Spain | 11 |
| Texas | 11 |
| United States | 11 |
| More ▼ | |
Laws, Policies, & Programs
| Elementary and Secondary… | 4 |
| No Child Left Behind Act 2001 | 2 |
| Race to the Top | 2 |
| Elementary and Secondary… | 1 |
| Elementary and Secondary… | 1 |
| Every Student Succeeds Act… | 1 |
| Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
Janika Saretzki; Rosalie Andrae; Boris Forthmann; Mathias Benedek – Journal of Creative Behavior, 2025
Divergent thinking (DT) ability is widely regarded as a central cognitive capacity underlying creativity, but its assessment is challenged by the fact that DT tasks yield a variable number of responses. Various approaches for the scoring of DT tasks have been proposed, which differ in how responses are evaluated and aggregated within a task. The…
Descriptors: Creative Thinking, Creativity Tests, Scoring, Metacognition
Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2023
Traditional estimators of reliability such as coefficients alpha, theta, omega, and rho (maximal reliability) are prone to give radical underestimates of reliability for the tests common when testing educational achievement. These tests are often structured by widely deviating item difficulties. This is a typical pattern where the traditional…
Descriptors: Test Reliability, Achievement Tests, Computation, Test Items
Huei-Wen Tsai; Ching-Ling Cheng – Journal of Psychoeducational Assessment, 2025
This study aimed to evaluate the psychometric properties and gather evidence supporting the validity of scores from a traditional Chinese version of the Claremont Purpose Scale (TC-CPS) among Taiwanese adolescents. The TC-CPS, measuring meaningfulness, goal directedness, and beyond-the-self orientation, was administered to 233 high school and 445…
Descriptors: Foreign Countries, Adolescents, Measures (Individuals), High School Students
José Antonio López-López; Rubén López-Nicolás; Alejandro Sandoval-Lentisco; Julio Sánchez-Meca; Alejandro Veas – Journal of Psychoeducational Assessment, 2025
The School Attitude Assessment Survey-Revised (SAAS-R) is a popular scale for assessing attitudinal and motivational aspects of students' academic achievement. However, evidence on key psychometric properties of the SAAS-R such as reliability remains limited. We conducted a reliability generalization study of the SAAS-R using meta-analytic…
Descriptors: Attitude Measures, Student Attitudes, School Attitudes, Psychometrics
Orhan, Ali – Journal of Psychoeducational Assessment, 2022
The aims of this reliability generalization study were to provide the overall alpha values of the California critical thinking disposition inventory (CCTDI) total score and subscales scores and investigate the characteristics of the studies that may be associated with the variability in the reliability values of the CCTDI total score and subscales…
Descriptors: Critical Thinking, Measures (Individuals), Test Reliability, Generalization
Kuan-Yu Jin; Wai-Lok Siu – Journal of Educational Measurement, 2025
Educational tests often have a cluster of items linked by a common stimulus ("testlet"). In such a design, the dependencies caused between items are called "testlet effects." In particular, the directional testlet effect (DTE) refers to a recursive influence whereby responses to earlier items can positively or negatively affect…
Descriptors: Models, Test Items, Educational Assessment, Scores
Sojeong Nam; Byeolbee Um; Jeongwoon Jeong; Monique Rodriguez; David Lardier – Measurement and Evaluation in Counseling and Development, 2024
This study aimed to provide meta-analytic reliability information of the Columbia-Suicide Severity Rating Scale (C-SSRS). We implemented systematic search procedures to 35 eligible studies (N = 23,247; Mage = 26.74 years) that reported reliability estimates. The synthesized average values of Cronbach's alpha were 0.88 (95% CI [0.85, 0.92]) for the…
Descriptors: Scores, Test Reliability, Rating Scales, Suicide
M. Arda Atakaya; Ugur Sak; M. Bahadir Ayas – Creativity Research Journal, 2024
Scoring in creativity research has been a central problem since creativity became an important issue in psychology and education in the 1950s. The current study examined the psychometric properties of 27 creativity indices derived from summed and averaged scores using 15 scoring methods. Participants included 2802 middle-school students. Data…
Descriptors: Psychometrics, Creativity, Creativity Tests, Scoring
Ehri Ryu – Society for Research on Educational Effectiveness, 2024
Background/Context: Confirmatory factor analysis (CFA) model is a commonly adopted framework to estimate and test a measurement model. Once a well-fitting final CFA model is selected, the selected model may be used to test structural relationships of the latent constructs with other variables, to construct a test with desired reliability and…
Descriptors: Research Problems, Factor Analysis, Scores, Computation
Rodrigo Moreta-Herrera; Jacqueline Regatto-Bonifaz; Víctor Viteri-Miranda; María Gorety Rodríguez-Vieira; Giancarlo Magro-Lazo; Jose A. Rodas; Sergio Dominguez-Lara – Journal of Psychoeducational Assessment, 2025
Objective: Analyze the evidence of validity of scores of the Academic Procrastination Scale (APS), its measurement equivalence based on nationality, its reliability of the scores, and its validity in relation to other variables in university students from Ecuador, Venezuela, and Peru. Method: This paper involves a quantitative, descriptive,…
Descriptors: Measures (Individuals), Time Management, College Students, Foreign Countries
Yuriko K. Sosa Paredes; Björn Andersson – Educational Assessment, Evaluation and Accountability, 2025
In international large-scale assessments, student performance comparisons across educational systems are frequently done to assess the state and development in different domains. These results often have a large impact on educational policy and on the perceptions of an educational system's performance. Early assessments, such as the First and…
Descriptors: Test Interpretation, International Assessment, Science Tests, Scores
Aberdine R. Dwight; Amy M. Briesch; Jessica A. Hoffman; Christopher Rutt – Child & Youth Care Forum, 2024
Background: Although the Depression Anxiety Stress Scales, Short Form (DASS-21) was developed for adults, its authors noted no compelling reasons to not use the measure with youth as young as 12 years. Despite increasingly widespread use with youth, psychometric evidence in support of its use with this population needs to be investigated to fully…
Descriptors: Depression (Psychology), Measures (Individuals), Anxiety, Stress Variables
Joshua B. Gilbert; James G. Soland; Benjamin W. Domingue – Annenberg Institute for School Reform at Brown University, 2025
Value-Added Models (VAMs) are both common and controversial in education policy and accountability research. While the sensitivity of VAMs to model specification and covariate selection is well documented, the extent to which test scoring methods (e.g., mean scores vs. IRT-based scores) may affect VA estimates is less studied. We examine the…
Descriptors: Value Added Models, Tests, Testing, Scoring
Ryan, Joseph J.; Gontkovsky, Samuel T. – Journal of Psychoeducational Assessment, 2021
We analyzed data from the WASI-II manual to determine discrepancy score reliabilities of the Verbal Comprehension (VCI) and Perceptual Reasoning (PRI) indexes and the four subtests in the child and adult standardization samples. Reliabilities of the VCI-PRI discrepancy scores range from 0.78 to 0.86 for children and 0.82 to 0.89 for adults and…
Descriptors: Intelligence Tests, Test Reliability, Scores, Children
Mustafa Ilhan; Nese Güler; Gülsen Tasdelen Teker; Ömer Ergenekon – International Journal of Assessment Tools in Education, 2024
This study aimed to examine the effects of reverse items created with different strategies on psychometric properties and respondents' scale scores. To this end, three versions of a 10-item scale in the research were developed: 10 positive items were integrated in the first form (Form-P) and five positive and five reverse items in the other two…
Descriptors: Test Items, Psychometrics, Scores, Measures (Individuals)

Peer reviewed
Direct link
