ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	5
Since 2017 (last 10 years)	11
Since 2007 (last 20 years)	34

Descriptor

Models	45
Test Format	45
Item Response Theory	39
Test Items	29
Comparative Analysis	11
Simulation	10
Goodness of Fit	9
Test Construction	9
Scores	8
Computer Assisted Testing	7
Foreign Countries	7
Item Analysis	7
Psychometrics	7
Difficulty Level	6
Equated Scores	6
Error of Measurement	6
Responses	6
Ability	5
Accuracy	5
Computation	5
Multiple Choice Tests	5
Statistical Analysis	5
Test Bias	5
Test Reliability	5
Testing	5
More ▼

Source

Educational and Psychological…	7
Applied Psychological…	5
Journal of Educational…	5
ETS Research Report Series	4
International Journal of…	3
Applied Measurement in…	2
Intelligence	2
Journal of Educational and…	2
ALT-J: Research in Learning…	1
Assessment & Evaluation in…	1
College Board	1
International Electronic…	1
Journal of Intelligence	1
Journal of Psychoeducational…	1
Measurement:…	1
ProQuest LLC	1
More ▼

Publication Type

Journal Articles	36
Reports - Research	30
Reports - Evaluative	9
Reports - Descriptive	4
Speeches/Meeting Papers	3
Numerical/Quantitative Data	2
Dissertations/Theses -…	1

Education Level

Secondary Education	4
Elementary Secondary Education	3
Grade 8	3
Higher Education	3
Elementary Education	2
Grade 4	2
Postsecondary Education	2
Adult Education	1
Grade 12	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
More ▼

Audience

Location

Turkey	2
Belgium	1
China	1
France	1
Illinois	1
Iran	1
Malaysia	1
Netherlands	1
Philippines	1
Singapore	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	2
Test of English as a Foreign…	2
Advanced Placement…	1
Armed Services Vocational…	1
Graduate Record Examinations	1
International English…	1
National Assessment of…	1
Peabody Picture Vocabulary…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 45 results Save | Export

IRT Linking Methods for the Bifactor Model with Mixed Format Tests

Peer reviewed

Direct link

Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025

This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…

Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis

Diagnostic Classification Model for Forced-Choice Items and Noncognitive Tests

Peer reviewed

Direct link

Huang, Hung-Yu – Educational and Psychological Measurement, 2023

The forced-choice (FC) item formats used for noncognitive tests typically develop a set of response options that measure different traits and instruct respondents to make judgments among these options in terms of their preference to control the response biases that are commonly observed in normative tests. Diagnostic classification models (DCMs)…

Descriptors: Test Items, Classification, Bayesian Statistics, Decision Making

Information Functions of Rank-2PL Models for Forced-Choice Questionnaires

Peer reviewed

Direct link

Jianbin Fu; Xuan Tan; Patrick C. Kyllonen – Journal of Educational Measurement, 2024

This paper presents the item and test information functions of the Rank two-parameter logistic models (Rank-2PLM) for items with two (pair) and three (triplet) statements in forced-choice questionnaires. The Rank-2PLM model for pairs is the MUPP-2PLM (Multi-Unidimensional Pairwise Preference) and, for triplets, is the Triplet-2PLM. Fisher's…

Descriptors: Questionnaires, Test Items, Item Response Theory, Models

Analysis of Mixed-Format Assessments Using Measurement Models and Topic Modeling

Peer reviewed

Direct link

Jiawei Xiong; George Engelhard; Allan S. Cohen – Measurement: Interdisciplinary Research and Perspectives, 2025

It is common to find mixed-format data results from the use of both multiple-choice (MC) and constructed-response (CR) questions on assessments. Dealing with these mixed response types involves understanding what the assessment is measuring, and the use of suitable measurement models to estimate latent abilities. Past research in educational…

Descriptors: Responses, Test Items, Test Format, Grade 8

Fused SDT/IRT Models for Mixed-Format Exams

Peer reviewed

Direct link

Lawrence T. DeCarlo – Educational and Psychological Measurement, 2024

A psychological framework for different types of items commonly used with mixed-format exams is proposed. A choice model based on signal detection theory (SDT) is used for multiple-choice (MC) items, whereas an item response theory (IRT) model is used for open-ended (OE) items. The SDT and IRT models are shown to share a common conceptualization…

Descriptors: Test Format, Multiple Choice Tests, Item Response Theory, Models

A Comparison of the Relative Performance of Four IRT Models on Equating Passage-Based Tests

Peer reviewed

Direct link

Kim, Kyung Yong; Lim, Euijin; Lee, Won-Chan – International Journal of Testing, 2019

For passage-based tests, items that belong to a common passage often violate the local independence assumption of unidimensional item response theory (UIRT). In this case, ignoring local item dependence (LID) and estimating item parameters using a UIRT model could be problematic because doing so might result in inaccurate parameter estimates,…

Descriptors: Item Response Theory, Equated Scores, Test Items, Models

Efficient Standard Errors in Item Response Theory Models for Short Tests

Peer reviewed

Direct link

Ippel, Lianne; Magis, David – Educational and Psychological Measurement, 2020

In dichotomous item response theory (IRT) framework, the asymptotic standard error (ASE) is the most common statistic to evaluate the precision of various ability estimators. Easy-to-use ASE formulas are readily available; however, the accuracy of some of these formulas was recently questioned and new ASE formulas were derived from a general…

Descriptors: Item Response Theory, Error of Measurement, Accuracy, Standards

A Bayesian Item Response Model for Examining Item Position Effects in Complex Survey Data

Peer reviewed

Direct link

Trendtel, Matthias; Robitzsch, Alexander – Journal of Educational and Behavioral Statistics, 2021

A multidimensional Bayesian item response model is proposed for modeling item position effects. The first dimension corresponds to the ability that is to be measured; the second dimension represents a factor that allows for individual differences in item position effects called persistence. This model allows for nonlinear item position effects on…

Descriptors: Bayesian Statistics, Item Response Theory, Test Items, Test Format

A Comparison of IRT Model Combinations for Assessing Fit in a Mixed Format Elementary School Science Test

Peer reviewed
PDF on ERIC

Download full text

Yilmaz, Haci Bayram – International Electronic Journal of Elementary Education, 2019

Open ended and multiple choice questions are commonly placed on the same tests; however, there is a discussion on the effects of using different item types on the test and item statistics. This study aims to compare model and item fit statistics in a mixed format test where multiple choice and constructed response items are used together. In this…

Descriptors: Item Response Theory, Models, Goodness of Fit, Elementary School Science

Parameter Estimation in Rasch Models for Examinee-Selected Items

Peer reviewed

Direct link

Liu, Chen-Wei; Wang, Wen-Chung – Journal of Educational Measurement, 2017

The examinee-selected-item (ESI) design, in which examinees are required to respond to a fixed number of items in a given set of items (e.g., choose one item to respond from a pair of items), always yields incomplete data (i.e., only the selected items are answered and the others have missing data) that are likely nonignorable. Therefore, using…

Descriptors: Item Response Theory, Models, Maximum Likelihood Statistics, Data Analysis

Same Test, Better Scores: Boosting the Reliability of Short Online Intelligence Recruitment Tests with Nested Logit Item Response Theory Models

Peer reviewed
PDF on ERIC

Download full text

Storme, Martin; Myszkowski, Nils; Baron, Simon; Bernard, David – Journal of Intelligence, 2019

Assessing job applicants' general mental ability online poses psychometric challenges due to the necessity of having brief but accurate tests. Recent research (Myszkowski & Storme, 2018) suggests that recovering distractor information through Nested Logit Models (NLM; Suh & Bolt, 2010) increases the reliability of ability estimates in…

Descriptors: Intelligence Tests, Item Response Theory, Comparative Analysis, Test Reliability

An Extension of IRT-Based Equating to the Dichotomous Testlet Response Theory Model

Peer reviewed

Direct link

Tao, Wei; Cao, Yi – Applied Measurement in Education, 2016

Current procedures for equating number-correct scores using traditional item response theory (IRT) methods assume local independence. However, when tests are constructed using testlets, one concern is the violation of the local item independence assumption. The testlet response theory (TRT) model is one way to accommodate local item dependence.…

Descriptors: Item Response Theory, Equated Scores, Test Format, Models

Reducing the Need for Guesswork in Multiple-Choice Tests

Peer reviewed

Direct link

Bush, Martin – Assessment & Evaluation in Higher Education, 2015

The humble multiple-choice test is very widely used within education at all levels, but its susceptibility to guesswork makes it a suboptimal assessment tool. The reliability of a multiple-choice test is partly governed by the number of items it contains; however, longer tests are more time consuming to take, and for some subject areas, it can be…

Descriptors: Guessing (Tests), Multiple Choice Tests, Test Format, Test Reliability

Using Necessary Information to Identify Item Dependence in Passage-Based Reading Comprehension Tests

Peer reviewed

Direct link

Baldonado, Angela Argo; Svetina, Dubravka; Gorin, Joanna – Applied Measurement in Education, 2015

Applications of traditional unidimensional item response theory models to passage-based reading comprehension assessment data have been criticized based on potential violations of local independence. However, simple rules for determining dependency, such as including all items associated with a particular passage, may overestimate the dependency…

Descriptors: Reading Tests, Reading Comprehension, Test Items, Item Response Theory

The Effects of Rater Severity and Rater Distribution on Examinees' Ability Estimation for Constructed-Response Items. Research Report. ETS RR-13-23

Peer reviewed
PDF on ERIC

Download full text

Wang, Zhen; Yao, Lihua – ETS Research Report Series, 2013

The current study used simulated data to investigate the properties of a newly proposed method (Yao's rater model) for modeling rater severity and its distribution under different conditions. Our study examined the effects of rater severity, distributions of rater severity, the difference between item response theory (IRT) models with rater effect…

Descriptors: Test Format, Test Items, Responses, Computation

Previous Page | Next Page »

Pages: 1 | 2 | 3

Brennan, Robert L.	2
Lee, Won-Chan	2
Wang, Wen-Chung	2
Way, Walter D.	2
Yao, Lihua	2
Ackerman, Terry	1
Albano, Anthony D.	1
Allan S. Cohen	1
Andrews, Benjamin James	1
Arendasy, Martin E.	1
Aryadoust, Vahid	1
Baghaei, Purya	1
Baldonado, Angela Argo	1
Baron, Simon	1
Bernard, David	1
Bizot, Elizabeth B.	1
Bush, Martin	1
Cao, Yi	1
Chang, Mei-Lin	1
Chang, Wanchen	1
Cohen, Allan S.	1
Cox, Benita	1
DeCarlo, Lawrence T.	1
Debeer, Dries	1
More ▼