ERIC - Search Results

Publication Date

In 2026	0
Since 2025	5
Since 2022 (last 5 years)	19
Since 2017 (last 10 years)	47
Since 2007 (last 20 years)	62

Descriptor

Accuracy	62
Test Format	62
Test Items	27
Foreign Countries	22
Computer Assisted Testing	17
Item Response Theory	16
Language Tests	14
Comparative Analysis	13
Classification	12
Second Language Learning	12
English (Second Language)	11
Item Analysis	10
Multiple Choice Tests	10
Correlation	8
Models	8
Computation	7
Equated Scores	7
Language Proficiency	7
Second Language Instruction	7
Difficulty Level	6
Scores	6
Scoring	6
Statistical Analysis	6
Undergraduate Students	6
Prediction	5
More ▼

Publication Type

Reports - Research	62
Journal Articles	57
Speeches/Meeting Papers	4
Tests/Questionnaires	3
Numerical/Quantitative Data	1

Education Level

Higher Education	12
Postsecondary Education	12
Secondary Education	11
Elementary Education	7
Middle Schools	7
Junior High Schools	6
Grade 4	3
Early Childhood Education	2
Grade 2	2
Grade 8	2
Intermediate Grades	2
Primary Education	2
Elementary Secondary Education	1
Grade 3	1
Grade 6	1
High Schools	1
More ▼

Audience

Location

Turkey	3
Indonesia	2
Iran	2
Asia	1
Australia	1
Austria	1
Azerbaijan	1
China	1
Florida	1
Germany	1
Italy (Milan)	1
Japan	1
Maine	1
Massachusetts	1
Netherlands	1
Norway	1
Philippines	1
South Africa	1
Spain (Barcelona)	1
Switzerland (Geneva)	1
Taiwan	1
Turkey (Ankara)	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	2
Test of English as a Foreign…	2
Advanced Placement…	1
National Assessment Program…	1
Test of English for…	1
Torrance Tests of Creative…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 62 results Save | Export

Simultaneous Linear Equating for Scenarios with Optional Test Versions or across Multiple Alternative Anchors

Peer reviewed
PDF on ERIC

Download full text

Tom Benton – Practical Assessment, Research & Evaluation, 2025

This paper proposes an extension of linear equating that may be useful in one of two fairly common assessment scenarios. One is where different students have taken different combinations of test forms. This might occur, for example, where students have some free choice over the exam papers they take within a particular qualification. In this…

Descriptors: Equated Scores, Test Format, Test Items, Computation

Analysis of Student Understanding in Short-Answer Explanations to Concept Questions Using a Human-Centered AI Approach

Peer reviewed

Direct link

Harpreet Auby; Namrata Shivagunde; Vijeta Deshpande; Anna Rumshisky; Milo D. Koretsky – Journal of Engineering Education, 2025

Background: Analyzing student short-answer written justifications to conceptually challenging questions has proven helpful to understand student thinking and improve conceptual understanding. However, qualitative analyses are limited by the burden of analyzing large amounts of text. Purpose: We apply dense and sparse Large Language Models (LLMs)…

Descriptors: Student Evaluation, Thinking Skills, Test Format, Cognitive Processes

Automated Scoring of Figural Tests of Creativity with Computer Vision

Peer reviewed

Direct link

Selcuk Acar; Peter Organisciak; Denis Dumas – Journal of Creative Behavior, 2025

In this three-study investigation, we applied various approaches to score drawings created in response to both Form A and Form B of the Torrance Tests of Creative Thinking-Figural (broadly TTCT-F) as well as the Multi-Trial Creative Ideation task (MTCI). We focused on TTCT-F in Study 1, and utilizing a random forest classifier, we achieved 79% and…

Descriptors: Scoring, Computer Assisted Testing, Models, Correlation

Evaluating Equating Methods for Varying Levels of Form Difference

Peer reviewed

Direct link

Ting Sun; Stella Yun Kim – Educational and Psychological Measurement, 2024

Equating is a statistical procedure used to adjust for the difference in form difficulty such that scores on those forms can be used and interpreted comparably. In practice, however, equating methods are often implemented without considering the extent to which two forms differ in difficulty. The study aims to examine the effect of the magnitude…

Descriptors: Difficulty Level, Data Interpretation, Equated Scores, High School Students

The Effect of Polytomous Item Ratio on Ability Estimation in Multistage Tests

Peer reviewed
PDF on ERIC

Download full text

Hasibe Yahsi Sari; Hulya Kelecioglu – International Journal of Assessment Tools in Education, 2025

The aim of the study is to examine the effect of polytomous item ratio on ability estimation in different conditions in multistage tests (MST) using mixed tests. The study is simulation-based research. In the PISA 2018 application, the ability parameters of the individuals and the item pool were created by using the item parameters estimated from…

Descriptors: Test Items, Test Format, Accuracy, Test Length

Cheating Automatic Short Answer Grading with the Adversarial Usage of Adjectives and Adverbs

Peer reviewed

Direct link

Anna Filighera; Sebastian Ochs; Tim Steuer; Thomas Tregel – International Journal of Artificial Intelligence in Education, 2024

Automatic grading models are valued for the time and effort saved during the instruction of large student bodies. Especially with the increasing digitization of education and interest in large-scale standardized testing, the popularity of automatic grading has risen to the point where commercial solutions are widely available and used. However,…

Descriptors: Cheating, Grading, Form Classes (Languages), Computer Software

Detecting Compromised Items with Response Times Using a Bayesian Change-Point Approach

Peer reviewed

Direct link

Yang Du; Susu Zhang – Journal of Educational and Behavioral Statistics, 2025

Item compromise has long posed challenges in educational measurement, jeopardizing both test validity and test security of continuous tests. Detecting compromised items is therefore crucial to address this concern. The present literature on compromised item detection reveals two notable gaps: First, the majority of existing methods are based upon…

Descriptors: Item Response Theory, Item Analysis, Bayesian Statistics, Educational Assessment

Improving the Efficiency of the Digits-in-Noise Hearing Screening Test: A Comparison between Four Different Test Procedures

Peer reviewed

Direct link

Dambha, Tasneem; Swanepoel, De Wet; Mahomed-Asmail, Faheema; De Sousa, Karina C.; Graham, Marien A.; Smits, Cas – Journal of Speech, Language, and Hearing Research, 2022

Purpose: This study compared the test characteristics, test-retest reliability, and test efficiency of three novel digits-in-noise (DIN) test procedures to a conventional antiphasic 23-trial adaptive DIN (D23). Method: One hundred twenty participants with an average age of 42 years (SD = 19) were included. Participants were tested and retested…

Descriptors: Auditory Tests, Screening Tests, Efficiency, Test Format

IRT Characteristic Curve Linking Methods Weighted by Information for Mixed-Format Tests

Peer reviewed

Direct link

Shaojie Wang; Won-Chan Lee; Minqiang Zhang; Lixin Yuan – Applied Measurement in Education, 2024

To reduce the impact of parameter estimation errors on IRT linking results, recent work introduced two information-weighted characteristic curve methods for dichotomous items. These two methods showed outstanding performance in both simulation and pseudo-form pseudo-group analysis. The current study expands upon the concept of information…

Descriptors: Item Response Theory, Test Format, Test Length, Error of Measurement

Meta-Analysis of Dichotomous and Ordinal Tests with an Imperfect Gold Standard

Peer reviewed

Direct link

Cerullo, Enzo; Jones, Hayley E.; Carter, Olivia; Quinn, Terry J.; Cooper, Nicola J.; Sutton, Alex J. – Research Synthesis Methods, 2022

Standard methods for the meta-analysis of medical tests, without assuming a gold standard, are limited to dichotomous data. Multivariate probit models are used to analyse correlated dichotomous data, and can be extended to model ordinal data. Within the context of an imperfect gold standard, they have previously been used for the analysis of…

Descriptors: Meta Analysis, Test Format, Medicine, Standards

Question Format Biases College Students' Metacognitive Judgments for Exam Performance

Peer reviewed
PDF on ERIC

Download full text

McGuire, Michael J. – International Journal for the Scholarship of Teaching and Learning, 2023

College students in a lower-division psychology course made metacognitive judgments by predicting and postdicting performance for true-false, multiple-choice, and fill-in-the-blank question sets on each of three exams. This study investigated which question format would result in the most accurate metacognitive judgments. Extending Koriat's (1997)…

Descriptors: Metacognition, Multiple Choice Tests, Accuracy, Test Format

Effect of Statistically Matching Equating Samples for Common-Item Equating. Research Report. ETS RR-21-02

Peer reviewed
PDF on ERIC

Download full text

Lu, Ru; Kim, Sooyeon – ETS Research Report Series, 2021

This study evaluated the impact of subgroup weighting for equating through a common-item anchor. We used data from a single test form to create two research forms for which the equating relationship was known. The results showed that equating was most accurate when the new form and reference form samples were weighted to be similar to the target…

Descriptors: Equated Scores, Weighted Scores, Raw Scores, Test Items

The Dark Side of Corrective Feedback: Controlled and Automatic Influences of Retrieval Practice

Peer reviewed

Direct link

Alamri, Aeshah; Higham, Philip A. – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2022

Corrective feedback is often touted as a critical benefit to learning, boosting testing effects when retrieval is poor and reducing negative testing effects. Here, we explore the dark side of corrective feedback. In three experiments, we found that corrective feedback on multiple-choice (MC) practice questions is later endorsed as the answer to…

Descriptors: Feedback (Response), Multiple Choice Tests, Cues, Recall (Psychology)

Polytomous Testlet Response Models for Technology-Enhanced Innovative Items: Implications on Model Fit and Trait Inference

Peer reviewed

Direct link

Kang, Hyeon-Ah; Han, Suhwa; Kim, Doyoung; Kao, Shu-Chuan – Educational and Psychological Measurement, 2022

The development of technology-enhanced innovative items calls for practical models that can describe polytomous testlet items. In this study, we evaluate four measurement models that can characterize polytomous items administered in testlets: (a) generalized partial credit model (GPCM), (b) testlet-as-a-polytomous-item model (TPIM), (c)…

Descriptors: Goodness of Fit, Item Response Theory, Test Items, Scoring

Speeding up without Loss of Accuracy: Item Position Effects on Performance in University Exams

Peer reviewed
PDF on ERIC

Download full text

Vida, Leonardo J.; Bolsinova, Maria; Brinkhuis, Matthieu J. S. – International Educational Data Mining Society, 2021

The quality of exams drives test-taking behavior of examinees and is a proxy for the quality of teaching. As most university exams have strict time limits, and speededness is an important measure of the cognitive state of examinees, this might be used to assess the connection between exams' quality and examinees' performance. The practice of…

Descriptors: Accuracy, Test Items, Tests, Student Behavior

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

ETS Research Report Series	6
Educational and Psychological…	5
Language Testing	3
Applied Measurement in…	2
International Journal of…	2
International Journal of…	2
Journal of Educational…	2
Practical Assessment,…	2
ACT, Inc.	1
AERA Online Paper Repository	1
Assessment for Effective…	1
Athletic Training Education…	1
Computer Assisted Language…	1
EURASIA Journal of…	1
Educational Assessment	1
Educational Technology…	1
Field Methods	1
International Educational…	1
International Journal for the…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Online Journal…	1
Interpreter and Translator…	1
More ▼

Hambleton, Ronald K.	2
Kalender, Ilker	2
Kim, Sooyeon	2
Lee, Won-Chan	2
Agus, Mirian	1
Ahmadi, Alireza	1
Aizawa, Kazumi	1
Akbar, Maruf	1
Alamri, Aeshah	1
Alasgarova, Gunel A.	1
Ali Akbar Ariamanesh	1
Anna Filighera	1
Anna Rumshisky	1
Arslan, Recep Sahin	1
Aryadoust, Vahid	1
Ayan, Cansu	1
Babcock, Ben	1
Bayard, Natalie S.	1
Bernardo, Alejandro S.	1
Bolsinova, Maria	1
Brady, Michael P.	1
Brinkhuis, Matthieu J. S.	1
Bulut, Okan	1
Carter, Olivia	1
Cerullo, Enzo	1
More ▼