Publication Date
| In 2026 | 18 |
| Since 2025 | 2375 |
| Since 2022 (last 5 years) | 12890 |
| Since 2017 (last 10 years) | 34015 |
| Since 2007 (last 20 years) | 68506 |
Descriptor
| Foreign Countries | 30599 |
| Test Validity | 21771 |
| Scores | 18272 |
| Academic Achievement | 16940 |
| Test Construction | 16772 |
| Test Reliability | 15043 |
| Achievement Tests | 14867 |
| Standardized Tests | 14727 |
| Comparative Analysis | 14431 |
| Elementary Secondary Education | 13048 |
| Language Tests | 12555 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5034 |
| Teachers | 3394 |
| Researchers | 2630 |
| Policymakers | 1232 |
| Administrators | 979 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2827 |
| Australia | 2432 |
| Canada | 2271 |
| California | 1857 |
| United States | 1728 |
| Texas | 1616 |
| China | 1580 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1205 |
| Germany | 1123 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Alqarni, Abdulelah Mohammed – Journal on Educational Psychology, 2019
This study compares the psychometric properties of reliability in Classical Test Theory (CTT), item information in Item Response Theory (IRT), and validation from the perspective of modern validity theory for the purpose of bringing attention to potential issues that might exist when testing organizations use both test theories in the same testing…
Descriptors: Test Theory, Item Response Theory, Test Construction, Scoring
Klender, Sara; Ferriby, Andrew; Notebaert, Andrew – HAPS Educator, 2019
Multiple-choice questions (MCQ) are commonly used on histology examinations. There are many guidelines for how to properly write MCQ and many of them recommend avoiding negatively worded stems. The current study aims to investigate differences between positively and negatively worded stems in a medical histology course by comparing the item…
Descriptors: Multiple Choice Tests, Science Tests, Biology, Test Construction
Çankaya, Elif M.; Cevik, Emel – Journal of Psychoeducational Assessment, 2019
The Youth Anxiety Measure for the "DSM"-5 (YAM-5) is a self-report and/or parent-report measure that was created to assess the full spectrum of anxiety disorder symptoms in children and adolescents aged 8 through 18. The scale consists of two related sections. The first section (YAM-5-I) evaluates the major anxiety disorders; the second…
Descriptors: Test Anxiety, Anxiety Disorders, Children, Adolescents
Andujar, Alberto; Cruz-Martínez, Maria Soledad – Language Learning in Higher Education, 2020
The present investigation explores Cognitive Test Anxiety (CTA) in a high-stake oral examination, taking into consideration how face-to-face and computer-based examination formats affect test-takers' anxiety and consequently language performance. Two speaking tests -- face-to-face and computer-based -- were developed for a Spanish university's…
Descriptors: Test Anxiety, High Stakes Tests, Verbal Tests, Computer Assisted Testing
Exploring the Validity Evidence of a High-Stake, Second Language Reading Test: An Eye-Tracking Study
Lim, Hyojung – Language Testing in Asia, 2020
The current study aims to explore the cognitive validity of the iBT TOEFL reading test by investigating test takers' eye movements on individual items. It is assumed that successful test takers would adopt the intended reading processes, the same types and levels of cognitive processes that they would use for real-world reading tasks. Forty-seven…
Descriptors: Test Validity, High Stakes Tests, Second Language Learning, Language Tests
Seeratan, Kavita L.; McElhaney, Kevin W.; Mislevy, Jessica; McGhee, Raymond, Jr.; Conger, Dylan; Long, Mark C. – Educational Assessment, 2020
We describe the conceptualization, design, development, validation, and testing of a summative instrument that measures high school students' ability to analyze and evaluate data, construct scientific explanations, and formulate scientific arguments in biology and chemistry disciplinary contexts. Data from 1,405 students were analyzed to evaluate…
Descriptors: High School Students, Science Process Skills, Student Evaluation, Science Tests
Tabatabaee-Yazdi, Mona – SAGE Open, 2020
The Hierarchical Diagnostic Classification Model (HDCM) reflects on the sequences of the presentation of the essential materials and attributes to answer the items of a test correctly. In this study, a foreign language reading comprehension test was analyzed employing HDCM and the generalized deterministic-input, noisy and gate (G-DINA) model to…
Descriptors: Diagnostic Tests, Classification, Models, Reading Comprehension
Karim Sadeghi; Neda Bakhshi – International Journal of Language Testing, 2025
Assessing language skills in an integrative form has drawn the attention of assessment experts in recent years. While some research data exists on integrative listening/reading-to-write assessment, there is comparatively little research literature on listening-to-speak integrated assessment. Also, little attention has been devoted to the role of…
Descriptors: Language Tests, Second Language Learning, English (Second Language), Computer Assisted Testing
Mary Richardson; Jennie Golding; Tina Isaacs; Iain Barnes; David Wilkinson; Christina Swensson; Robbie Maris – UK Department for Education, 2025
Volume 1 of the "TIMSS 2023 National Report for England" analysed the TIMSS 2023 performance of a sample of year 5 and year 9 pupils in mathematics and science in England. This second volume of the "TIMSS 2023 National Report for England" focuses on performance in England analysed by pupil characteristics and participating…
Descriptors: Foreign Countries, Elementary Secondary Education, International Assessment, Achievement Tests
Yildirim, Hüseyin H. – Educational Assessment, Evaluation and Accountability, 2021
From a sociocognitive perspective, item parameters in a test represent regularities in examinees' item responses. These regularities are originated from shared experiences among individuals in interacting with their environment. Theories explaining the relationship between culture and cognition also acknowledge these shared experiences as the…
Descriptors: Educational Assessment, Test Items, Item Response Theory, Psychometrics
Buttiler, Maria Belen – TESL-EJ, 2021
In this study, I investigate whether three A2 Key for Schools practice tests from Cambridge Assessment English for 6th-graders (11-12 years old) in Argentina measure growth and produce scores that are meaningful. Drawing data from three consecutive years, I analyze the scores of 80 children over a school year and consider whether the tests are…
Descriptors: Elementary School Students, Language Tests, Second Language Learning, Second Language Instruction
Tallberg, Christian; Axelsson, Maria – International Association for the Evaluation of Educational Achievement, 2021
International large-scale assessments (ILSAs) have become an important part of the Swedish evaluation system. It is therefore of crucial importance to validate national measures of Swedish students' achievement with their ILSA test scores. Here, we offer results from such a validation study based on Swedish students' test scores in IEA's Trends in…
Descriptors: Foreign Countries, Achievement Tests, Elementary Secondary Education, Mathematics Tests
Wang, Lin – ETS Research Report Series, 2019
Rearranging response options in different versions of a test of multiple-choice items can be an effective strategy against cheating on the test. This study investigated if rearranging response options would affect item performance and test score comparability. A study test was assembled as the base version from which 3 variant versions were…
Descriptors: Multiple Choice Tests, Test Items, Test Format, Scores
Ip, Edward H.; Strachan, Tyler; Fu, Yanyan; Lay, Alexandra; Willse, John T.; Chen, Shyh-Huei; Rutkowski, Leslie; Ackerman, Terry – Journal of Educational Measurement, 2019
Test items must often be broad in scope to be ecologically valid. It is therefore almost inevitable that secondary dimensions are introduced into a test during test development. A cognitive test may require one or more abilities besides the primary ability to correctly respond to an item, in which case a unidimensional test score overestimates the…
Descriptors: Test Items, Test Bias, Test Construction, Scores
Aho, Carson; Werfel, Krystal L. – Language, Speech, and Hearing Services in Schools, 2021
Purpose: The purpose of this study was to determine if group differences exist in spelling accuracy or spelling errors between kindergarten children with hearing loss and children with normal hearing loss. Method: Participants included 23 kindergarten children with hearing loss and 21 children with normal hearing. All children used spoken English…
Descriptors: Spelling, Kindergarten, Hearing Impairments, Error Analysis (Language)

Peer reviewed
Direct link
