Publication Date
| In 2026 | 6 |
| Since 2025 | 2195 |
| Since 2022 (last 5 years) | 12710 |
| Since 2017 (last 10 years) | 33835 |
| Since 2007 (last 20 years) | 68326 |
Descriptor
| Foreign Countries | 30532 |
| Test Validity | 21728 |
| Scores | 18248 |
| Academic Achievement | 16912 |
| Test Construction | 16738 |
| Test Reliability | 15015 |
| Achievement Tests | 14839 |
| Standardized Tests | 14712 |
| Comparative Analysis | 14429 |
| Elementary Secondary Education | 13038 |
| Language Tests | 12549 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5034 |
| Teachers | 3391 |
| Researchers | 2630 |
| Policymakers | 1229 |
| Administrators | 976 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2815 |
| Australia | 2426 |
| Canada | 2269 |
| California | 1853 |
| United States | 1725 |
| Texas | 1615 |
| China | 1578 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1202 |
| Germany | 1121 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Kathryn R. Glodowski; Yusuke Hayashi – Journal of Applied Behavior Analysis, 2025
The testing effect is a well-established phenomenon in cognitive psychology that refers to enhanced long-term retention of information due to active recalling through testing. Following a cross-disciplinary translation of the testing effect into behavioral principles, we systematically replicated the previous findings in a behavior-analytic…
Descriptors: Testing, Replication (Evaluation), Tests, Test Length
Neset Demirci – Turkish Online Journal of Educational Technology - TOJET, 2025
In this study, the performance of artificial intelligence chatbots--OpenAI's ChatGPT, Google Gemini, and Microsoft's Copilot--was evaluated and compared based on their responses to questions from the Turkish Higher Education Entrance Physics Examination over the past three years. Analysis of the chatbots' responses to TYT Physics questions showed…
Descriptors: Artificial Intelligence, College Entrance Examinations, Physics, Science Tests
Changiz Mohiyeddini – Anatomical Sciences Education, 2025
This article presents a step-by-step guide to using R and SPSS to bootstrap exam questions. Bootstrapping, a versatile nonparametric analytical technique, can help to improve the psychometric qualities of exam questions in the process of quality assurance. Bootstrapping is particularly useful in disciplines such as medical education, where student…
Descriptors: Test Items, Sampling, Statistical Inference, Nonparametric Statistics
Zoe Stephenson; Amy Jackson – Assessment & Evaluation in Higher Education, 2025
Despite the positive experiences of many candidates and the existence of many skilled examiners, the closed-door viva has been widely scrutinised on the basis of issues surrounding its reliability and quality. This study aimed to explore the views of examiners regarding this assessment method; specifically, in relation to how the issues with…
Descriptors: Verbal Tests, Doctoral Students, Examiners, Attitudes
Burcu Büge; Iasmina Tsvetkova – International Journal of Psychology and Educational Studies, 2025
The current paper was targeted to validate the Self-Hate Scale (SHS; Turnell et al., 2019) for application within the Russian-speaking community, as it examined its validity and reliability within a sample of 302 participants. Subsequent to the translation procedures, a strong positive relationship between the English and the Russian versions was…
Descriptors: Self Concept Measures, Test Reliability, Test Validity, Russian
Matthew C. Lambert; Michael H. Epstein; Douglas Cullinan – Journal of Psychoeducational Assessment, 2025
Research and policy reports estimate that 10%-40% of U.S. children and adolescents currently have or very recently have had at least one significant mental health condition. Students who exhibit substantial behavior and emotional problems in school often show less severe problems when younger. Screening for less severe problems at younger ages can…
Descriptors: Elementary School Students, Screening Tests, Emotional Disturbances, Test Validity
Vahe Permzadian; Kit W. Cho – Assessment & Evaluation in Higher Education, 2025
Since the COVID-19 pandemic, the use of open-book examinations (OBE) in higher education has increased compared to closed-book examinations (CBE), raising questions about the relative efficacy of these two major examination formats. We review the effects of CBE and OBE on learning and examine potential moderating variables related to study,…
Descriptors: Test Format, Tests, College Students, Student Evaluation
Patrick Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Report Series, 2024
Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international large-scale assessments of cognitive and…
Descriptors: Assessment Literacy, Testing, Test Bias, Test Construction
Fuat Ozcan; Ali Meydan – Journal of Education in Science, Environment and Health, 2024
The goal of this study is to create the Zero Waste Attitude Scale, which will be used to determine the zero-waste attitude of social studies teacher candidates and to conduct validity and reliability studies. The data for the study were collected with a 5-point Likert-type form from pre-service teachers studying in the social studies teaching…
Descriptors: Test Construction, Preservice Teachers, Social Studies, Test Validity
Cameron Downing; Markéta Caravolas – Reading and Writing: An Interdisciplinary Journal, 2024
Spelling and handwriting are related skills which are critical for writing but are typically assessed separately. Doing so makes it more difficult to understand their respective development. We describe the creation and evaluation of a tool for their concurrent assessment: the Spelling and Handwriting Legibility Test (SaHLT). We examined whether…
Descriptors: Spelling, Handwriting, Writing Skills, Test Construction
Denise Swanson; Gerald Tindal – Behavioral Research and Teaching, 2024
This technical report provides an authoritative bibliographic resource of all the studies conducted on "easyCBM"® and published on the main website for Behavioral Research and Teaching under Publications (https://brtprojects.org). The "easyCBM"© software is a direct descendent of "Curriculum-based Measurement" (CBM)…
Descriptors: Bibliographies, Computer Software, Test Construction, Test Reliability
Ozan Evrim Tunca; Evrim Genc Kumtepe; Sukru Torun; Yusuf Zafer Can Ugurhan – International Journal of Music Education, 2024
In Turkey, children are accepted to conservatory music departments after fourth grade and fine arts high school music departments after eighth grade by taking a musical talent test. For students with high musical aural skills to know about their potential and be directed to the related education institutions there needs to be a valid test. This…
Descriptors: Foreign Countries, Test Construction, Music Theory, Test Validity
Darmawan Muttaqin – Journal of Psychoeducational Assessment, 2024
The Vocational Identity Status Assessment (VISA) is one of the instruments that can be used to assess vocational identity. Conceptually, VISA consists of six sub-dimensions and has been validated using factor analysis. This study provides a factor structure test of the Indonesian version of VISA using the exploratory structural equation modeling…
Descriptors: Foreign Countries, Structural Equation Models, Vocational Interests, Occupational Tests
Ehri Ryu – Society for Research on Educational Effectiveness, 2024
Background/Context: Confirmatory factor analysis (CFA) model is a commonly adopted framework to estimate and test a measurement model. Once a well-fitting final CFA model is selected, the selected model may be used to test structural relationships of the latent constructs with other variables, to construct a test with desired reliability and…
Descriptors: Research Problems, Factor Analysis, Scores, Computation
Reeta Neittaanmäki; Iasonas Lamprianou – Language Testing, 2024
This article focuses on rater severity and consistency and their relation to major changes in the rating system in a high-stakes testing context. The study is based on longitudinal data collected from 2009 to 2019 from the second language (L2) Finnish speaking subtest in the National Certificates of Language Proficiency in Finland. We investigated…
Descriptors: Foreign Countries, Interrater Reliability, Evaluators, Item Response Theory

Peer reviewed
Direct link
