ERIC - Search Results

Publication Date

In 2026	0
Since 2025	3
Since 2022 (last 5 years)	12
Since 2017 (last 10 years)	21
Since 2007 (last 20 years)	45

Descriptor

Models	65
Test Format	65
Test Items	31
Item Response Theory	26
Test Construction	15
Comparative Analysis	12
Foreign Countries	12
Multiple Choice Tests	12
Computer Assisted Testing	11
Psychometrics	10
Scores	9
Accuracy	8
Test Reliability	8
College Students	7
Goodness of Fit	7
Simulation	7
Correlation	6
Difficulty Level	6
Equated Scores	6
Item Analysis	6
Statistical Analysis	6
Test Validity	6
Validity	6
Achievement Tests	5
Automation	5
More ▼

Publication Type

Reports - Research	65
Journal Articles	48
Speeches/Meeting Papers	9
Numerical/Quantitative Data	2
Information Analyses	1
Tests/Questionnaires	1

Education Level

Higher Education	10
Postsecondary Education	7
Secondary Education	6
Grade 8	4
Elementary Education	3
Elementary Secondary Education	3
Grade 4	2
High Schools	2
Kindergarten	2
Early Childhood Education	1
Grade 12	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Primary Education	1
More ▼

Audience

Administrators	1
Practitioners	1

Location

Netherlands	3
Germany	2
Turkey	2
Belgium	1
Canada	1
China	1
Colorado (Boulder)	1
Europe	1
France	1
Georgia	1
Illinois	1
Iran	1
Malaysia	1
Philippines	1
Saudi Arabia (Riyadh)	1
Singapore	1
Texas	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	2
Advanced Placement…	1
Armed Services Vocational…	1
Bem Sex Role Inventory	1
Graduate Management Admission…	1
Graduate Record Examinations	1
International English…	1
National Assessment of…	1
Texas Assessment of Academic…	1
Torrance Tests of Creative…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 65 results Save | Export

IRT Linking Methods for the Bifactor Model with Mixed Format Tests

Peer reviewed

Direct link

Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025

This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…

Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis

Automated Scoring of Figural Tests of Creativity with Computer Vision

Peer reviewed

Direct link

Selcuk Acar; Peter Organisciak; Denis Dumas – Journal of Creative Behavior, 2025

In this three-study investigation, we applied various approaches to score drawings created in response to both Form A and Form B of the Torrance Tests of Creative Thinking-Figural (broadly TTCT-F) as well as the Multi-Trial Creative Ideation task (MTCI). We focused on TTCT-F in Study 1, and utilizing a random forest classifier, we achieved 79% and…

Descriptors: Scoring, Computer Assisted Testing, Models, Correlation

Analysis of Mixed-Format Assessments Using Measurement Models and Topic Modeling

Peer reviewed

Direct link

Jiawei Xiong; George Engelhard; Allan S. Cohen – Measurement: Interdisciplinary Research and Perspectives, 2025

It is common to find mixed-format data results from the use of both multiple-choice (MC) and constructed-response (CR) questions on assessments. Dealing with these mixed response types involves understanding what the assessment is measuring, and the use of suitable measurement models to estimate latent abilities. Past research in educational…

Descriptors: Responses, Test Items, Test Format, Grade 8

Short-Answer Grading for German: Addressing the Challenges

Peer reviewed

Direct link

Ulrike Padó; Yunus Eryilmaz; Larissa Kirschner – International Journal of Artificial Intelligence in Education, 2024

Short-Answer Grading (SAG) is a time-consuming task for teachers that automated SAG models have long promised to make easier. However, there are three challenges for their broad-scale adoption: A technical challenge regarding the need for high-quality models, which is exacerbated for languages with fewer resources than English; a usability…

Descriptors: Grading, Automation, Test Format, Computer Assisted Testing

Changed-Goal or Cue-Strengthening? Examining the Reactivity of Judgments of Learning with the Dual-Retrieval Model

Peer reviewed

Direct link

Chang, Minyu; Brainerd, C. J. – Metacognition and Learning, 2023

Making judgments of learning (JOLs) can sometimes modify subsequent memory performance, which is referred to as JOL reactivity. We evaluated two major theoretical explanations of JOL reactivity and used the dual-retrieval model to pinpoint the retrieval processes that are modified by JOLs. The changed-goal hypothesis assumes that JOLs highlight…

Descriptors: Cues, Evaluative Thinking, Models, Recall (Psychology)

Cheating Automatic Short Answer Grading with the Adversarial Usage of Adjectives and Adverbs

Peer reviewed

Direct link

Anna Filighera; Sebastian Ochs; Tim Steuer; Thomas Tregel – International Journal of Artificial Intelligence in Education, 2024

Automatic grading models are valued for the time and effort saved during the instruction of large student bodies. Especially with the increasing digitization of education and interest in large-scale standardized testing, the popularity of automatic grading has risen to the point where commercial solutions are widely available and used. However,…

Descriptors: Cheating, Grading, Form Classes (Languages), Computer Software

Meta-Analysis of Dichotomous and Ordinal Tests with an Imperfect Gold Standard

Peer reviewed

Direct link

Cerullo, Enzo; Jones, Hayley E.; Carter, Olivia; Quinn, Terry J.; Cooper, Nicola J.; Sutton, Alex J. – Research Synthesis Methods, 2022

Standard methods for the meta-analysis of medical tests, without assuming a gold standard, are limited to dichotomous data. Multivariate probit models are used to analyse correlated dichotomous data, and can be extended to model ordinal data. Within the context of an imperfect gold standard, they have previously been used for the analysis of…

Descriptors: Meta Analysis, Test Format, Medicine, Standards

Classification of Open-Ended Responses to a Research-Based Assessment Using Natural Language Processing

Peer reviewed

Direct link

Wilson, Joseph; Pollard, Benjamin; Aiken, John M.; Lewandowski, H. J. – Physical Review Physics Education Research, 2022

Surveys have long been used in physics education research to understand student reasoning and inform course improvements. However, to make analysis of large sets of responses practical, most surveys use a closed-response format with a small set of potential responses. Open-ended formats, such as written free response, can provide deeper insights…

Descriptors: Natural Language Processing, Science Education, Physics, Artificial Intelligence

Feature Engineering and Ensemble-Based Approach for Improving Automatic Short-Answer Grading Performance

Peer reviewed

Direct link

Sahu, Archana; Bhowmick, Plaban Kumar – IEEE Transactions on Learning Technologies, 2020

In this paper, we studied different automatic short answer grading (ASAG) systems to provide a comprehensive view of the feature spaces explored by previous works. While the performance reported in previous works have been encouraging, systematic study of the features is lacking. Apart from providing systematic feature space exploration, we also…

Descriptors: Automation, Grading, Test Format, Artificial Intelligence

Diagnostic Classification Model for Forced-Choice Items and Noncognitive Tests

Peer reviewed

Direct link

Huang, Hung-Yu – Educational and Psychological Measurement, 2023

The forced-choice (FC) item formats used for noncognitive tests typically develop a set of response options that measure different traits and instruct respondents to make judgments among these options in terms of their preference to control the response biases that are commonly observed in normative tests. Diagnostic classification models (DCMs)…

Descriptors: Test Items, Classification, Bayesian Statistics, Decision Making

Multiple-Choice Test Format and Student Test Anxiety: A Case Set in a Technical Analytics Class

Peer reviewed

Direct link

Stephane E. Collignon; Josey Chacko; Salman Nazir – Journal of Information Systems Education, 2024

Most business schools require students to take at least one technical Management Information System (MIS) course. Due to the technical nature of the material, the course and the assessments tend to be anxiety inducing. With over three out of every five students in US colleges suffering from "overwhelming anxiety" in some form, we study…

Descriptors: Multiple Choice Tests, Test Format, Business Schools, Information Systems

A Comparison of the Relative Performance of Four IRT Models on Equating Passage-Based Tests

Peer reviewed

Direct link

Kim, Kyung Yong; Lim, Euijin; Lee, Won-Chan – International Journal of Testing, 2019

For passage-based tests, items that belong to a common passage often violate the local independence assumption of unidimensional item response theory (UIRT). In this case, ignoring local item dependence (LID) and estimating item parameters using a UIRT model could be problematic because doing so might result in inaccurate parameter estimates,…

Descriptors: Item Response Theory, Equated Scores, Test Items, Models

Efficient Standard Errors in Item Response Theory Models for Short Tests

Peer reviewed

Direct link

Ippel, Lianne; Magis, David – Educational and Psychological Measurement, 2020

In dichotomous item response theory (IRT) framework, the asymptotic standard error (ASE) is the most common statistic to evaluate the precision of various ability estimators. Easy-to-use ASE formulas are readily available; however, the accuracy of some of these formulas was recently questioned and new ASE formulas were derived from a general…

Descriptors: Item Response Theory, Error of Measurement, Accuracy, Standards

A Bayesian Item Response Model for Examining Item Position Effects in Complex Survey Data

Peer reviewed

Direct link

Trendtel, Matthias; Robitzsch, Alexander – Journal of Educational and Behavioral Statistics, 2021

A multidimensional Bayesian item response model is proposed for modeling item position effects. The first dimension corresponds to the ability that is to be measured; the second dimension represents a factor that allows for individual differences in item position effects called persistence. This model allows for nonlinear item position effects on…

Descriptors: Bayesian Statistics, Item Response Theory, Test Items, Test Format

A Comparison of IRT Model Combinations for Assessing Fit in a Mixed Format Elementary School Science Test

Peer reviewed
PDF on ERIC

Download full text

Yilmaz, Haci Bayram – International Electronic Journal of Elementary Education, 2019

Open ended and multiple choice questions are commonly placed on the same tests; however, there is a discussion on the effects of using different item types on the test and item statistics. This study aims to compare model and item fit statistics in a mixed format test where multiple choice and constructed response items are used together. In this…

Descriptors: Item Response Theory, Models, Goodness of Fit, Elementary School Science

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Applied Psychological…	5
ETS Research Report Series	5
Journal of Educational…	4
Applied Measurement in…	3
International Journal of…	3
College Board	2
Educational and Psychological…	2
Intelligence	2
International Journal of…	2
Journal of Educational and…	2
Assessment & Evaluation in…	1
Educational Evaluation and…	1
European Educational Research…	1
Grantee Submission	1
IEEE Transactions on Learning…	1
International Educational…	1
International Electronic…	1
International Journal of…	1
Journal of Creative Behavior	1
Journal of Educational…	1
Journal of Experimental…	1
Journal of Information…	1
Journal of Intelligence	1
Journal of Psychoeducational…	1
Journal of Reading	1
More ▼

Chang, Lei	2
Denis Dumas	2
Engelhard, George, Jr.	2
Lee, Won-Chan	2
Peter Organisciak	2
Selcuk Acar	2
Ackerman, Terry	1
Aiken, John M.	1
Al-Jarf, Reima	1
Albano, Anthony D.	1
Allan S. Cohen	1
Anna Filighera	1
Arendasy, Martin E.	1
Aryadoust, Vahid	1
Baghaei, Purya	1
Baker, Eva L.	1
Baldonado, Angela Argo	1
Baron, Simon	1
Bender, Timothy A.	1
Benjamin, Moshe	1
Bernard, David	1
Bhowmick, Plaban Kumar	1
Bivens-Tatum, Jennifer	1
Bizot, Elizabeth B.	1
More ▼