Publication Date
| In 2026 | 1 |
| Since 2025 | 37 |
| Since 2022 (last 5 years) | 132 |
| Since 2017 (last 10 years) | 402 |
| Since 2007 (last 20 years) | 707 |
Descriptor
| Scores | 1119 |
| Test Reliability | 1119 |
| Test Validity | 595 |
| Foreign Countries | 272 |
| Psychometrics | 238 |
| Test Construction | 227 |
| Correlation | 197 |
| Factor Analysis | 182 |
| Test Items | 171 |
| Statistical Analysis | 147 |
| Measures (Individuals) | 134 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 22 |
| Practitioners | 19 |
| Teachers | 4 |
| Administrators | 3 |
| Counselors | 2 |
| Parents | 2 |
| Community | 1 |
| Policymakers | 1 |
Location
| Turkey | 48 |
| Canada | 16 |
| China | 16 |
| Germany | 13 |
| United Kingdom | 13 |
| Australia | 12 |
| Netherlands | 12 |
| United Kingdom (England) | 12 |
| Spain | 11 |
| Texas | 11 |
| United States | 11 |
| More ▼ | |
Laws, Policies, & Programs
| Elementary and Secondary… | 4 |
| No Child Left Behind Act 2001 | 2 |
| Race to the Top | 2 |
| Elementary and Secondary… | 1 |
| Elementary and Secondary… | 1 |
| Every Student Succeeds Act… | 1 |
| Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
Abdulkadir Haktanir; M. Furkan Kurnaz; Zeynep Simsir Gökalp – Measurement and Evaluation in Counseling and Development, 2024
Objective: Brief Self-Control Scale (BSCS) is the most widely used instrument to assess self-control. The purpose of this reliability generalization meta-analysis was to examine the degree to which consistency reliability coefficients for scores on the BSCS generalize across age groups and languages. Method: We included studies using the BSCS and…
Descriptors: Self Control, Measures (Individuals), Meta Analysis, Test Reliability
Muhammed Tayyib Kadak; Nihal Serdengeçti; Meryem Seçen Yazici; Tuncay Sandikçi; Aybike Aydin; Zehra Koyuncu; Yavuz Meral; Abas Hasimoglu; Yasin Çaliskan; Gizem Bayraktar; Elif Can Öztürk; Mehmet Enes Gökler; Roula Choueiri; Mahmut Cem Tarakçioglu – Autism: The International Journal of Research and Practice, 2024
This study aims to investigate the validation of the Rapid Interactive Screening Test for Autism in Toddlers (RITA-T) in Turkish toddlers between 18 and 36 months of age. Children aged 18-36 months were referred to the department of child psychiatry for concerns of autism spectrum disorder, language disorder, developmental delay, and typically…
Descriptors: Foreign Countries, Turkish, Screening Tests, Autism Spectrum Disorders
Hojung Kim; Changkyung Song; Jiyoung Kim; Hyeyun Jeong; Jisoo Park – Language Testing in Asia, 2024
This study presents a modified version of the Korean Elicited Imitation (EI) test, designed to resemble natural spoken language, and validates its reliability as a measure of proficiency. The study assesses the correlation between average test scores and Test of Proficiency in Korean (TOPIK) levels, examining score distributions among beginner,…
Descriptors: Korean, Test Validity, Test Reliability, Imitation
Enrico Gandolfi; Richard E. Ferdig – Educational Technology Research and Development, 2025
Augmented Reality (AR) is increasingly being adopted in education to foster engagement and interest in a variety of subjects and content areas. However, there is a scarcity of instruments to measure the instructional impact of this innovation. This article addresses this gap in two unique ways. First, it presents validation results of the…
Descriptors: Simulated Environment, Measures (Individuals), Rating Scales, Item Response Theory
Hanif Akhtar; Retno Firdiyanti – Journal of Psychoeducational Assessment, 2025
Psychometric Properties of the Scale of Positive and Negative Experience (SPANE) have been extensively evaluated in numerous countries, but not in Indonesia. This study investigated factor structure, reliability, measurement invariance, and validity of SPANE scores among a sample of Indonesian university students (N = 405). Multiple measurement…
Descriptors: Foreign Countries, Affective Measures, Psychometrics, Factor Structure
Karrie A. Shogren; Daria Gerasimova; Yves Lachapelle; Dany Lussier-Desrochers; Mayumi Hagiwara; Geneviève Petitpierre; Barbara Fontana-Lana; Filippo Piazza; Yannick Courbois; Agnès Desbiens; Marie-Claire Haelewyck; Hélène Geurts; Jesse R. Pace; Tyler Hicks – Intellectual and Developmental Disabilities, 2024
There is a strong and growing focus on self-determination in French-speaking countries, and this pilot study reports the technical adequacy of the Self-Determination Inventory: Student Report (SDI:SR) French Translation. Data were collected with 471 French-speaking youth with and without disabilities in Canada (Quebec), Switzerland, France, and…
Descriptors: Measures (Individuals), Self Determination, Test Reliability, Test Validity
Cátia Marques; Íris M. Oliveira; Jaisso Vautero; Ana Daniela Silva – International Journal for Educational and Vocational Guidance, 2024
This study examined the psychometric properties of the Career Adapt-Abilities Scale in a Lebanese sample. The study includes 236 Lebanese citizens (54.2% women; M[subscript age] = 30.14). Confirmatory factor analyses indicated that a hierarchical model yielded a good fit, with the CAAS measuring four distinct dimensions that can be combined in a…
Descriptors: Psychometrics, Career Development, Factor Analysis, Goodness of Fit
D. Betsy McCoach; Scott Peters; Anthony J. Gambino; Daniel Long; Del Siegle – Grantee Submission, 2024
Teacher rating scales (TRS) often play a part in service eligibility decisions for gifted services. Although schools regularly use TRS to identify gifted students either as part of an informal nomination process or through behavioral rating scales, there is little research documenting the between-teacher variance in teacher ratings and the…
Descriptors: Gifted Education, Rating Scales, Academically Gifted, Academic Achievement
D. Betsy McCoach; Scott Peters; Anthony J. Gambino; Daniel Long; Del Siegle – Exceptional Children, 2024
Teacher rating scales (TRS) often play a part in service eligibility decisions for gifted services. Although schools regularly use TRS to identify gifted students either as part of an informal nomination process or through behavioral rating scales, there is little research documenting the between-teacher variance in teacher ratings and the…
Descriptors: Gifted Education, Rating Scales, Academically Gifted, Academic Achievement
Ugur Orhan; Eda Demirhan – Research in Science Education, 2025
Throughout the world scientific reasoning (SR) is a valuable and desirable ability to gain deeper understanding of science in all grade level. In the current study, we first adapted the SPR-I (7) which consists of seven items with three sub-dimensions as the experimentation, the understanding the nature of science (NOS) and the data…
Descriptors: Foreign Countries, Elementary School Students, Science Process Skills, Thinking Skills
Kent Anderson Seidel – School Leadership Review, 2025
This paper examines one of three central diagnostic tools of the Concerns Based Adoption Model, the Stages of Concern Questionnaire (SoCQ). The SoCQ was developed with a focus on K12 education. It has been used widely since developed in 1973, in early childhood, higher education, medical, business, community, and military settings. The SoCQ…
Descriptors: Questionnaires, Educational Change, Educational Innovation, Intervention
Wallace N. Pinto Jr.; Jinnie Shin – Journal of Educational Measurement, 2025
In recent years, the application of explainability techniques to automated essay scoring and automated short-answer grading (ASAG) models, particularly those based on transformer architectures, has gained significant attention. However, the reliability and consistency of these techniques remain underexplored. This study systematically investigates…
Descriptors: Automation, Grading, Computer Assisted Testing, Scoring
Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022
The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…
Descriptors: Test Reliability, Scores, Test Items, Correlation
John Jerrim; Luis Alejandro Lopez-Agudo; Oscar David Marcenaro-Gutierrez – British Journal of Educational Studies, 2024
International large-scale assessments have gained much attention since the beginning of the twenty-first century, influencing education legislation in many countries. This includes Spain, where they have been used by successive governments to justify education policy change. Unfortunately, there was a problem with the PISA 2018 reading scores for…
Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students
Kelly Francis – ProQuest LLC, 2024
The current study involved the development, scaling, and validation of a new, brief, strength-based measure of children's ecological support, as rated by their parents. The scaling and validation of the Child and Youth Ecological Assets Scale--Parent Form (C-YEAS-P) took place through two studies. The first study involved 500 parents who were…
Descriptors: Measures (Individuals), Test Construction, Ability, Children

Peer reviewed
Direct link
