NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20256
Since 202423
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 23 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Jiawei Xiong; George Engelhard; Allan S. Cohen – Measurement: Interdisciplinary Research and Perspectives, 2025
It is common to find mixed-format data results from the use of both multiple-choice (MC) and constructed-response (CR) questions on assessments. Dealing with these mixed response types involves understanding what the assessment is measuring, and the use of suitable measurement models to estimate latent abilities. Past research in educational…
Descriptors: Responses, Test Items, Test Format, Grade 8
Peer reviewed Peer reviewed
Direct linkDirect link
Ting Sun; Stella Yun Kim – Educational and Psychological Measurement, 2024
Equating is a statistical procedure used to adjust for the difference in form difficulty such that scores on those forms can be used and interpreted comparably. In practice, however, equating methods are often implemented without considering the extent to which two forms differ in difficulty. The study aims to examine the effect of the magnitude…
Descriptors: Difficulty Level, Data Interpretation, Equated Scores, High School Students
Peer reviewed Peer reviewed
Direct linkDirect link
Angela Gardner; Karen Lichtman – Foreign Language Annals, 2024
This study investigated how questioning strategies impact language learner performance. Specifically, it explored how questioning strategies influence (i) verb production and subject-verb agreement in the target language, and (ii) learner confidence in completing tasks without translation software. Sixty-eight novice language learners enrolled in…
Descriptors: Questioning Techniques, Second Language Learning, Spanish, High School Students
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Robert N. Prince – Numeracy, 2025
One of the effects of the COVID-19 pandemic was the rapid shift to replacing traditional, paper-based tests with their computer-based counterparts. In many cases, these new modes of delivering tests will remain in place for the foreseeable future. In South Africa, the National Benchmark Quantitative Literacy (QL) test was impelled to make this…
Descriptors: Benchmarking, Numeracy, Multiple Literacies, Paper and Pencil Tests
Joanna Williamson – Research Matters, 2025
Teachers, examiners and assessment experts know from experience that some candidates annotate exam questions. "Annotation" includes anything the candidate writes or draws outside of the designated response space, such as underlining, jotting, circling, sketching and calculating. Annotations are of interest because they may evidence…
Descriptors: Mathematics, Tests, Documentation, Secondary Education
Peer reviewed Peer reviewed
Direct linkDirect link
Filip Moons; Paola Iannone; Ellen Vandervieren – ZDM: Mathematics Education, 2024
Handwritten tasks are better suited than digital ones to assess higher-order mathematics skills, as students can express themselves more freely. However, maintaining reliability and providing feedback can be challenging when assessing high-stakes, handwritten mathematics exams involving multiple assessors. This paper discusses a new semi-automated…
Descriptors: Grading, Mathematics Tests, Handwriting, Test Format
Peer reviewed Peer reviewed
Direct linkDirect link
Morgan McCracken; Jonathan D. Bostic; Timothy D. Folger – TechTrends: Linking Research and Practice to Improve Learning, 2024
Assessment is central to teaching and learning, and recently there has been a substantive shift from paper-and-pencil assessments towards technology delivered assessments such as computer-adaptive tests. Fairness is an important aspect of the assessment process, including design, administration, test-score interpretation, and data utility. The…
Descriptors: Middle School Students, Student Attitudes, Culture Fair Tests, Mathematics Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Kevin Woods; Tee McCaldin; Kerry Brown; Rob Buck; Nicola Fairhall; Emma Forshaw; David Soares – Assessment in Education: Principles, Policy & Practice, 2024
In England, Wales and Northern Ireland, the General Certificate of Secondary Education (GCSE) has been for the last 35 years the most common qualification by which students' attainment at age 16 has been measured. The range and balance of processes by which the GCSEs' programmes of study have been assessed have varied over the decades, to include…
Descriptors: Foreign Countries, Secondary School Students, Grade 11, Educational Certificates
Peer reviewed Peer reviewed
Direct linkDirect link
Zeynep Uzun; Tuncay Ögretmen – Large-scale Assessments in Education, 2025
This study aimed to evaluate the item model fit by equating the forms of the PISA 2018 mathematics subtest with concurrent common items equating in samples from Türkiye, the UK, and Italy. The answers given in mathematics subtest Forms 2, 8, and 12 were used in this context. Analyzes were performed using the Dichotomous Rasch Model in the WINSTEPS…
Descriptors: Item Response Theory, Test Items, Foreign Countries, Mathematics Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Okan Bulut; Guher Gorgun; Hacer Karamese – Journal of Educational Measurement, 2025
The use of multistage adaptive testing (MST) has gradually increased in large-scale testing programs as MST achieves a balanced compromise between linear test design and item-level adaptive testing. MST works on the premise that each examinee gives their best effort when attempting the items, and their responses truly reflect what they know or can…
Descriptors: Response Style (Tests), Testing Problems, Testing Accommodations, Measurement
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Blair Lehman; Jesse R. Sparks; Jonathan Steinberg – ETS Research Report Series, 2024
Over the last 20 years, many methods have been proposed to use process data (e.g., response time) to detect changes in engagement during the test-taking process. However, many of these methods were developed and evaluated in highly similar testing contexts: 30 or more single-select multiple-choice items presented in a linear, fixed sequence in…
Descriptors: National Competency Tests, Secondary School Mathematics, Secondary School Students, Mathematics Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Kaveh Jalilzadeh; Mojgan Rashtchi; Fatemeh Mirzapour – Language Testing in Asia, 2024
A challenging aspect of online education is assessment since academic integrity could be violated due to students' cheating behaviors. The current qualitative research investigated English teachers' perceptions of why students cheat in online assessments. Besides, it attempted to find strategies to reduce cheating in online assessments. Twelve…
Descriptors: Cheating, Computer Assisted Testing, Coping, English (Second Language)
Jennifer Darling-Aduana; Carolyn J. Heinrich; Jeremy Noonan; Jialing Wu; Kathryn Enriquez – Annenberg Institute for School Reform at Brown University, 2024
Online credit recovery (OCR) courses are the most common means through which students retake courses required for high school graduation. Yet a growing body of research has raised concerns regarding student learning in these courses, with low quality assessments posited as one contributing factor. To address this concern, we reviewed every…
Descriptors: Online Courses, Required Courses, Repetition, Credits
Emma Walland – Research Matters, 2024
GCSE examinations (taken by students aged 16 years in England) are not intended to be speeded (i.e. to be partly a test of how quickly students can answer questions). However, there has been little research exploring this. The aim of this research was to explore the speededness of past GCSE written examinations, using only the data from scored…
Descriptors: Educational Change, Test Items, Item Analysis, Scoring
Peer reviewed Peer reviewed
Direct linkDirect link
Yi-Hsuan Lee; Yue Jia – Applied Measurement in Education, 2024
Test-taking experience is a consequence of the interaction between students and assessment properties. We define a new notion, rapid-pacing behavior, to reflect two types of test-taking experience -- disengagement and speededness. To identify rapid-pacing behavior, we extend existing methods to develop response-time thresholds for individual items…
Descriptors: Adaptive Testing, Reaction Time, Item Response Theory, Test Format
Previous Page | Next Page »
Pages: 1  |  2