ERIC - Search Results

Publication Date

In 2026	0
Since 2025	10
Since 2022 (last 5 years)	96
Since 2017 (last 10 years)	250
Since 2007 (last 20 years)	552

Descriptor

Models	754
Test Items	754
Item Response Theory	342
Test Construction	159
Difficulty Level	145
Foreign Countries	138
Psychometrics	113
Item Analysis	111
Simulation	103
Statistical Analysis	98
Comparative Analysis	96
Goodness of Fit	95
Computation	85
Scores	85
Computer Assisted Testing	84
Test Validity	83
Evaluation Methods	74
Test Bias	65
Scoring	64
Classification	62
Correlation	62
Mathematics Tests	61
Test Reliability	59
Multiple Choice Tests	57
Accuracy	53
More ▼

Education Level

Higher Education	85
Secondary Education	66
Postsecondary Education	58
Elementary Education	54
Elementary Secondary Education	34
Middle Schools	34
Junior High Schools	24
High Schools	20
Intermediate Grades	18
Grade 8	17
Grade 4	15
Grade 7	10
Early Childhood Education	9
Grade 3	9
Primary Education	8
Grade 5	7
Grade 6	6
Grade 2	5
Grade 12	4
Adult Education	3
Grade 1	2
Kindergarten	2
Grade 10	1
Grade 11	1
More ▼

Audience

Practitioners	14
Researchers	8
Teachers	6
Policymakers	3
Students	2
Administrators	1
Parents	1

Location

Germany	14
Canada	11
Taiwan	9
China	8
United States	8
Iran	7
Netherlands	7
South Korea	6
Turkey	6
Indonesia	5
United Kingdom	5
New York	4
Singapore	4
South Africa	4
United Kingdom (England)	4
Australia	3
Belgium	3
Japan	3
Malaysia	3
Massachusetts	3
United Kingdom (Great Britain)	3
Arizona	2
Asia	2
California	2
Denmark	2
More ▼

Laws, Policies, & Programs

Comprehensive Employment and…	1
Elementary and Secondary…	1
Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 1 to 15 of 754 results Save | Export

A Historic Review and Empirical Revitalization of the Stages of Concern Questionnaire

Peer reviewed
PDF on ERIC

Download full text

Kent Anderson Seidel – School Leadership Review, 2025

This paper examines one of three central diagnostic tools of the Concerns Based Adoption Model, the Stages of Concern Questionnaire (SoCQ). The SoCQ was developed with a focus on K12 education. It has been used widely since developed in 1973, in early childhood, higher education, medical, business, community, and military settings. The SoCQ…

Descriptors: Questionnaires, Educational Change, Educational Innovation, Intervention

Fused SDT/IRT Models for Mixed-Format Exams

Peer reviewed

Direct link

Lawrence T. DeCarlo – Educational and Psychological Measurement, 2024

A psychological framework for different types of items commonly used with mixed-format exams is proposed. A choice model based on signal detection theory (SDT) is used for multiple-choice (MC) items, whereas an item response theory (IRT) model is used for open-ended (OE) items. The SDT and IRT models are shown to share a common conceptualization…

Descriptors: Test Format, Multiple Choice Tests, Item Response Theory, Models

Analysis of Mixed-Format Assessments Using Measurement Models and Topic Modeling

Peer reviewed

Direct link

Jiawei Xiong; George Engelhard; Allan S. Cohen – Measurement: Interdisciplinary Research and Perspectives, 2025

It is common to find mixed-format data results from the use of both multiple-choice (MC) and constructed-response (CR) questions on assessments. Dealing with these mixed response types involves understanding what the assessment is measuring, and the use of suitable measurement models to estimate latent abilities. Past research in educational…

Descriptors: Responses, Test Items, Test Format, Grade 8

Comparing and Combining IRTree Models and Anchoring Vignettes in Addressing Response Styles

Peer reviewed

Direct link

Mingfeng Xue; Ping Chen – Journal of Educational Measurement, 2025

Response styles pose great threats to psychological measurements. This research compares IRTree models and anchoring vignettes in addressing response styles and estimating the target traits. It also explores the potential of combining them at the item level and total-score level (ratios of extreme and middle responses to vignettes). Four models…

Descriptors: Item Response Theory, Models, Comparative Analysis, Vignettes

Improvised Progressive Model Based on Automatic Calibration of Difficulty Level: A Practical Solution of Competitive-Based Examination

Peer reviewed

Direct link

Aditya Shah; Ajay Devmane; Mehul Ranka; Prathamesh Churi – Education and Information Technologies, 2024

Online learning has grown due to the advancement of technology and flexibility. Online examinations measure students' knowledge and skills. Traditional question papers include inconsistent difficulty levels, arbitrary question allocations, and poor grading. The suggested model calibrates question paper difficulty based on student performance to…

Descriptors: Computer Assisted Testing, Difficulty Level, Grading, Test Construction

Are You Doubtful? Oh, It Might Be Difficult Then! Exploring the Use of Model Uncertainty for Question Difficulty Estimation

Peer reviewed
PDF on ERIC

Download full text

Leonidas Zotos; Hedderik van Rijn; Malvina Nissim – International Educational Data Mining Society, 2025

In an educational setting, an estimate of the difficulty of Multiple-Choice Questions (MCQs), a commonly used strategy to assess learning progress, constitutes very useful information for both teachers and students. Since human assessment is costly from multiple points of view, automatic approaches to MCQ item difficulty estimation are…

Descriptors: Multiple Choice Tests, Test Items, Difficulty Level, Artificial Intelligence

Investigating Item Complexity as a Source of Cross-National DIF in TIMSS Math and Science

Peer reviewed

Direct link

Qi Huang; Daniel M. Bolt; Weicong Lyu – Large-scale Assessments in Education, 2024

Large scale international assessments depend on invariance of measurement across countries. An important consideration when observing cross-national differential item functioning (DIF) is whether the DIF actually reflects a source of bias, or might instead be a methodological artifact reflecting item response theory (IRT) model misspecification.…

Descriptors: Test Items, Item Response Theory, Test Bias, Test Validity

Guesses and Slips as Proficiency-Related Phenomena and Impacts on Parameter Invariance

Peer reviewed

Direct link

Xiangyi Liao; Daniel M Bolt – Educational Measurement: Issues and Practice, 2024

Traditional approaches to the modeling of multiple-choice item response data (e.g., 3PL, 4PL models) emphasize slips and guesses as random events. In this paper, an item response model is presented that characterizes both disjunctively interacting guessing and conjunctively interacting slipping processes as proficiency-related phenomena. We show…

Descriptors: Item Response Theory, Test Items, Error Correction, Guessing (Tests)

Correcting for Extreme Response Style: Model Choice Matters

Peer reviewed

Direct link

Martijn Schoenmakers; Jesper Tijmstra; Jeroen Vermunt; Maria Bolsinova – Educational and Psychological Measurement, 2024

Extreme response style (ERS), the tendency of participants to select extreme item categories regardless of the item content, has frequently been found to decrease the validity of Likert-type questionnaire results. For this reason, various item response theory (IRT) models have been proposed to model ERS and correct for it. Comparisons of these…

Descriptors: Item Response Theory, Response Style (Tests), Models, Likert Scales

Robustness of Item Response Theory Models under the PISA Multistage Adaptive Testing Designs

Peer reviewed

Direct link

Hyo Jeong Shin; Christoph König; Frederic Robin; Andreas Frey; Kentaro Yamamoto – Journal of Educational Measurement, 2025

Many international large-scale assessments (ILSAs) have switched to multistage adaptive testing (MST) designs to improve measurement efficiency in measuring the skills of the heterogeneous populations around the world. In this context, previous literature has reported the acceptable level of model parameter recovery under the MST designs when the…

Descriptors: Robustness (Statistics), Item Response Theory, Adaptive Testing, Test Construction

IRT Linking Methods for the Bifactor Model with Mixed Format Tests

Peer reviewed

Direct link

Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025

This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…

Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis

Assessing Model Fit of the Generalized Graded Unfolding Model

Peer reviewed
PDF on ERIC

Download full text

Abdulla Alzarouni; R. J. De Ayala – Practical Assessment, Research & Evaluation, 2025

The assessment of model fit in latent trait modeling is an integral part of correctly applying the model. Still the assessment of model fit has been less utilized for ideal point models such as the Generalized Graded Unfolding Models (GGUM). The current study assesses the performance of the relative fit indices "AIC" and "BIC,"…

Descriptors: Goodness of Fit, Models, Statistical Analysis, Sample Size

The Accuracy of Estimating Parameters of Multiple-Choice Test Items, Following Item-Response Theory: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Aiman Mohammad Freihat; Omar Saleh Bani Yassin – Educational Process: International Journal, 2025

Background/purpose: This study aimed to reveal the accuracy of estimation of multiple-choice test items parameters following the models of the item-response theory in measurement. Materials/methods: The researchers depended on the measurement accuracy indicators, which express the absolute difference between the estimated and actual values of the…

Descriptors: Accuracy, Computation, Multiple Choice Tests, Test Items

Generating Multiple Choice Questions from a Textbook: LLMs Match Human Performance on Most Metrics

Peer reviewed
PDF on ERIC

Download full text

Andrew M. Olney – Grantee Submission, 2023

Multiple choice questions are traditionally expensive to produce. Recent advances in large language models (LLMs) have led to fine-tuned LLMs that generate questions competitive with human-authored questions. However, the relative capabilities of ChatGPT-family models have not yet been established for this task. We present a carefully-controlled…

Descriptors: Test Construction, Multiple Choice Tests, Test Items, Algorithms

Information Functions of Rank-2PL Models for Forced-Choice Questionnaires

Peer reviewed

Direct link

Jianbin Fu; Xuan Tan; Patrick C. Kyllonen – Journal of Educational Measurement, 2024

This paper presents the item and test information functions of the Rank two-parameter logistic models (Rank-2PLM) for items with two (pair) and three (triplet) statements in forced-choice questionnaires. The Rank-2PLM model for pairs is the MUPP-2PLM (Multi-Unidimensional Pairwise Preference) and, for triplets, is the Triplet-2PLM. Fisher's…

Descriptors: Questionnaires, Test Items, Item Response Theory, Models

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 51

Educational and Psychological…	67
Journal of Educational…	54
Applied Psychological…	46
ProQuest LLC	37
Journal of Educational and…	34
Psychometrika	29
ETS Research Report Series	23
Applied Measurement in…	21
International Journal of…	20
Measurement:…	19
Grantee Submission	17
Educational Measurement:…	10
College Board	9
Journal of Applied Testing…	9
International Educational…	7
Online Submission	7
Practical Assessment,…	7
International Journal of…	6
Language Assessment Quarterly	6
International Journal of…	5
International Journal of…	5
Journal of Experimental…	5
Language Testing	5
Education and Information…	4
Educational Testing Service	4
More ▼

van der Linden, Wim J.	13
Wang, Wen-Chung	12
Gierl, Mark J.	10
de la Torre, Jimmy	8
von Davier, Matthias	8
Sinharay, Sandip	7
De Boeck, Paul	6
Paek, Insu	6
Suh, Youngsuk	6
Baghaei, Purya	5
Bejar, Isaac I.	5
Lai, Hollis	5
Revuelta, Javier	5
Wilson, Mark	5
Bolt, Daniel M.	4
Cai, Li	4
Chang, Hua-Hua	4
DeMars, Christine E.	4
Goldhammer, Frank	4
Huff, Kristen	4
Janssen, Rianne	4
Jiao, Hong	4
Jin, Kuan-Yu	4
Kahraman, Nilufer	4
More ▼

Journal Articles	563
Reports - Research	436
Reports - Evaluative	153
Reports - Descriptive	84
Speeches/Meeting Papers	64
Dissertations/Theses -…	39
Opinion Papers	18
Tests/Questionnaires	15
Guides - Non-Classroom	7
Numerical/Quantitative Data	7
Information Analyses	6
Non-Print Media	6
Reference Materials - General	6
Guides - Classroom - Teacher	4
Books	3
Collected Works - Proceedings	3
Book/Product Reviews	2
Dissertations/Theses -…	1
ERIC Digests in Full Text	1
Guides - General	1
More ▼

Program for International…	21
Trends in International…	18
National Assessment of…	17
Graduate Record Examinations	8
SAT (College Admission Test)	8
Test of English as a Foreign…	6
Advanced Placement…	5
Armed Services Vocational…	4
Law School Admission Test	3
Raven Advanced Progressive…	3
Alberta Grade Twelve Diploma…	2
California Achievement Tests	2
Hidden Figures Test	2
International English…	2
Medical College Admission Test	2
NEO Personality Inventory	2
Test of English for…	2
Wechsler Adult Intelligence…	2
ACT Assessment	1
Attribution Style…	1
Behavioral Risk Factor…	1
Big Five Inventory	1
Center for Epidemiologic…	1
Eysenck Personality Inventory	1
Florida Comprehensive…	1
More ▼