Publication Date
| In 2026 | 18 |
| Since 2025 | 2375 |
| Since 2022 (last 5 years) | 12890 |
| Since 2017 (last 10 years) | 34015 |
| Since 2007 (last 20 years) | 68506 |
Descriptor
| Foreign Countries | 30599 |
| Test Validity | 21771 |
| Scores | 18272 |
| Academic Achievement | 16940 |
| Test Construction | 16772 |
| Test Reliability | 15043 |
| Achievement Tests | 14867 |
| Standardized Tests | 14727 |
| Comparative Analysis | 14431 |
| Elementary Secondary Education | 13048 |
| Language Tests | 12555 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5034 |
| Teachers | 3394 |
| Researchers | 2630 |
| Policymakers | 1232 |
| Administrators | 979 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2827 |
| Australia | 2432 |
| Canada | 2271 |
| California | 1857 |
| United States | 1728 |
| Texas | 1616 |
| China | 1580 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1205 |
| Germany | 1123 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Denis Dumas; Selcuk Acar; Kelly Berthiaume; Peter Organisciak; David Eby; Katalin Grajzel; Theadora Vlaamster; Michele Newman; Melanie Carrera – Grantee Submission, 2023
Open-ended verbal creativity assessments are commonly administered in psychological research and in educational practice to elementary-aged children. Children's responses are then typically rated by teams of judges who are trained to identify original ideas, hopefully with a degree of inter-rater agreement. Even in cases where the judges are…
Descriptors: Elementary School Students, Grade 3, Grade 4, Grade 5
Jessica B. Koslouski; Kristabel Stark; Sandra M. Chafouleas; T. Chris Riley-Tillman – Grantee Submission, 2023
Social, emotional, and behavioral (SEB) instruments are currently used in schools to screen, refer, and progress monitor students. Although many of these instruments have demonstrated strong technical adequacy, there has been far less examination of their consequential validity--that is, positive or negative intended and unintended consequences of…
Descriptors: Behavior Rating Scales, Screening Tests, Test Validity, Scores
Xin Wei; Susu Zhang; Jihong Zhang; Jennifer Yu – Autism: The International Journal of Research and Practice, 2023
For autistic students receiving special education services, little is known about their relative strengths, weaknesses, and enjoyment across different math content areas; their overall math interest and persistence are also not well-studied. Using the 2017 eighth-grade National Assessment of Education Progress data, this study finds, relative to…
Descriptors: Mathematics Achievement, Reaction Time, Autism Spectrum Disorders, Grade 8
Emery-Wetherell, Meaghan; Wang, Ruoyao – Assessment & Evaluation in Higher Education, 2023
Over four semesters of a large introductory statistics course the authors found students were engaging in contract cheating on Chegg.com during multiple choice examinations. In this paper we describe our methodology for identifying, addressing and eventually eliminating cheating. We successfully identified 23 out of 25 students using a combination…
Descriptors: Computer Assisted Testing, Multiple Choice Tests, Cheating, Identification
Renuka Swami – ProQuest LLC, 2023
This study investigated the relationship between reading skills and math achievement among high school students in the Central Delta Areas of Mississippi. The data sources included Reading skills measured by English II Criterion Reference Test (CRT) scores and Algebra I CRT scores from the Mississippi Academic Assessment Program (MAAP) for the…
Descriptors: Reading Skills, Scores, Criterion Referenced Tests, Mathematics Achievement
Chang Ren – ProQuest LLC, 2023
The research includes three studies at the intersection of communications disorders and computational linguistics. We begin with the case study of APTgt, a system created to improve reinforcement for Phonetics students and improve Linguistic tools for their instructions. A portion of this system utilizes machine learning techniques (i.e.,…
Descriptors: Communication Disorders, Computational Linguistics, Technology Uses in Education, Phonetics
A Feasible Guidance for Ordered Multiple-Choice Items in Students' Hierarchical Understanding Levels
Su, King-Dow – Journal of Baltic Science Education, 2019
This research focuses on students' 5 hierarchical levels of Ordered Multiple-Choice (OMC) items for their extensive conceptualized understanding in the particulate nature of matter (PNM) chemistry. The basic framework for OMC items is to link students' conceptual understanding levels with possible cognitive responses. Developed as the substantial…
Descriptors: Multiple Choice Tests, Science Tests, STEM Education, Test Items
Luo, Xiao; Wang, Xinrui – International Journal of Testing, 2019
This study introduced dynamic multistage testing (dy-MST) as an improvement to existing adaptive testing methods. dy-MST combines the advantages of computerized adaptive testing (CAT) and computerized adaptive multistage testing (ca-MST) to create a highly efficient and regulated adaptive testing method. In the test construction phase, multistage…
Descriptors: Adaptive Testing, Computer Assisted Testing, Test Construction, Psychometrics
Wind, Stefanie A. – Journal of Educational Measurement, 2019
Numerous researchers have proposed methods for evaluating the quality of rater-mediated assessments using nonparametric methods (e.g., kappa coefficients) and parametric methods (e.g., the many-facet Rasch model). Generally speaking, popular nonparametric methods for evaluating rating quality are not based on a particular measurement theory. On…
Descriptors: Nonparametric Statistics, Test Validity, Test Reliability, Item Response Theory
Reed, Jessica J.; Raker, Jeffrey R.; Murphy, Kristen L. – Journal of Chemical Education, 2019
The ability to assess students' content knowledge and make meaningful comparisons of student performance is an important component of instruction. ACS exams have long served as tools for standardized assessment of students' chemistry knowledge. Because these exams are designed by committees of practitioners to cover a breadth of topics in the…
Descriptors: Science Tests, Standardized Tests, Chemistry, Student Evaluation
Slepkov, Aaron D.; Godfrey, Alan T. K. – Applied Measurement in Education, 2019
The answer-until-correct (AUC) method of multiple-choice (MC) testing involves test respondents making selections until the keyed answer is identified. Despite attendant benefits that include improved learning, broad student adoption, and facile administration of partial credit, the use of AUC methods for classroom testing has been extremely…
Descriptors: Multiple Choice Tests, Test Items, Test Reliability, Scores
Wise, Steven L. – Education Inquiry, 2019
A decision of whether to move from paper-and-pencil to computer-based tests is based largely on a careful weighing of the potential benefits of a change against its costs, disadvantages, and challenges. This paper briefly discusses the trade-offs involved in making such a transition, and then focuses on a relatively unexplored benefit of…
Descriptors: Computer Assisted Testing, Cheating, Test Wiseness, Scores
Yeo, Jun-Hui; Yang, Hsi-Hsun; Cho, I-Hsuan – Journal of Baltic Science Education, 2022
This research is conducted to identify the scientific conceptual cognition of ecosystem and the corresponding alternative conceptions by lower-secondary school students in Taiwan. Concept mapping, interviewing, and two-tier diagnostic test cannot make explicit reasoning pathways that students may use. Therefore, its purpose is to develop,…
Descriptors: Scientific Concepts, Concept Formation, Secondary School Students, Science Instruction
Erdener, Kevser; Perkmen, Serkan; Shelley, Mack; Ali Kandemir, Mehmet – Computers in the Schools, 2022
The main purpose of the current study was to develop and validate a scale of perceived attributes of the interactive whiteboard (IW) for the mathematics class. Rogers' Diffusion of Innovations Theory served as the theoretical framework. Two groups of participants in Turkey were employed in this study. The first group consisted of 350 middle school…
Descriptors: Test Construction, Mathematics Instruction, Foreign Countries, Educational Technology
Taboada Barber, Ana; Klauda, Susan Lutz; Wang, Weimeng; Cartwright, Kelly B.; Cutting, Laurie E. – Journal of Learning Disabilities, 2022
This study centered on emergent bilingual (EB) students with specific reading comprehension deficits (S-RCD), that is, with poor reading comprehension despite solid word identification skills. The participants were 209 students in Grades 2 to 4, including both EBs and English monolinguals (EMs) with and without S-RCD. Mean comparisons indicated…
Descriptors: Bilingualism, Reading Comprehension, Reading Difficulties, Comparative Analysis

Peer reviewed
Direct link
