ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	4
Since 2007 (last 20 years)	6

Descriptor

Bayesian Statistics	7
Comparative Analysis	7
Item Response Theory	5
Difficulty Level	3
Models	3
Adaptive Testing	2
Error of Measurement	2
Markov Processes	2
Monte Carlo Methods	2
Simulation	2
Test Bias	2
Test Items	2
Ability Identification	1
Accuracy	1
Check Lists	1
Computation	1
Computer Assisted Testing	1
Computer Simulation	1
Computer Software	1
Correlation	1
Data Analysis	1
English	1
Estimation (Mathematics)	1
Ethics	1
Guessing (Tests)	1
More ▼

Source

Journal of Educational…

Author

Ames, Allison	1
De Ayala, R. J.	1
De Boeck, Paul	1
Frederickx, Sofie	1
Isham, Steven	1
Kim, Sooyeon	1
Lee, Soo	1
Magis, David	1
Moses, Tim	1
Shi, Ning-Zhong	1
Smith, Elizabeth	1
Suh, Youngsuk	1
Tao, Jian	1
Tuerlinckx, Francis	1
Wang, Chun	1
Ye, Lei	1
Yoo, Hanwook	1
Zhang, Xue	1
Zwick, Rebecca	1
More ▼

Publication Type

Journal Articles	7
Reports - Research	6
Reports - Descriptive	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Bayesian Model Selection Methods for Multilevel IRT Models: A Comparison of Five DIC-Based Indices

Peer reviewed

Direct link

Zhang, Xue; Tao, Jian; Wang, Chun; Shi, Ning-Zhong – Journal of Educational Measurement, 2019

Model selection is important in any statistical analysis, and the primary goal is to find the preferred (or most parsimonious) model, based on certain criteria, from a set of candidate models given data. Several recent publications have employed the deviance information criterion (DIC) to do model selection among different forms of multilevel item…

Descriptors: Bayesian Statistics, Item Response Theory, Measurement, Models

Aggregating Polytomous DIF Results over Multiple Test Administrations

Peer reviewed

Direct link

Zwick, Rebecca; Ye, Lei; Isham, Steven – Journal of Educational Measurement, 2018

In typical differential item functioning (DIF) assessments, an item's DIF status is not influenced by its status in previous test administrations. An item that has shown DIF at multiple administrations may be treated the same way as an item that has shown DIF in only the most recent administration. Therefore, much useful information about the…

Descriptors: Test Bias, Testing, Test Items, Bayesian Statistics

Subjective Priors for Item Response Models: Application of Elicitation by Design

Peer reviewed

Direct link

Ames, Allison; Smith, Elizabeth – Journal of Educational Measurement, 2018

Bayesian methods incorporate model parameter information prior to data collection. Eliciting information from content experts is an option, but has seen little implementation in Bayesian item response theory (IRT) modeling. This study aims to use ethical reasoning content experts to elicit prior information and incorporate this information into…

Descriptors: Item Response Theory, Bayesian Statistics, Ethics, Specialists

Lord's Wald Test for Detecting Dif in Multidimensional Irt Models: A Comparison of Two Estimation Approaches

Peer reviewed

Direct link

Lee, Soo; Suh, Youngsuk – Journal of Educational Measurement, 2018

Lord's Wald test for differential item functioning (DIF) has not been studied extensively in the context of the multidimensional item response theory (MIRT) framework. In this article, Lord's Wald test was implemented using two estimation approaches, marginal maximum likelihood estimation and Bayesian Markov chain Monte Carlo estimation, to detect…

Descriptors: Item Response Theory, Sample Size, Models, Error of Measurement

A Comparison of IRT Proficiency Estimation Methods under Adaptive Multistage Testing

Peer reviewed

Direct link

Kim, Sooyeon; Moses, Tim; Yoo, Hanwook – Journal of Educational Measurement, 2015

This inquiry is an investigation of item response theory (IRT) proficiency estimators' accuracy under multistage testing (MST). We chose a two-stage MST design that includes four modules (one at Stage 1, three at Stage 2) and three difficulty paths (low, middle, high). We assembled various two-stage MST panels (i.e., forms) by manipulating two…

Descriptors: Comparative Analysis, Item Response Theory, Computation, Accuracy

RIM: A Random Item Mixture Model to Detect Differential Item Functioning

Peer reviewed

Direct link

Frederickx, Sofie; Tuerlinckx, Francis; De Boeck, Paul; Magis, David – Journal of Educational Measurement, 2010

In this paper we present a new methodology for detecting differential item functioning (DIF). We introduce a DIF model, called the random item mixture (RIM), that is based on a Rasch model with random item difficulties (besides the common random person abilities). In addition, a mixture model is assumed for the item difficulties such that the…

Descriptors: Test Bias, Models, Test Items, Difficulty Level

A Simulation and Comparison of Flexilevel and Bayesian Computerized Adaptive Testing.

Peer reviewed

De Ayala, R. J.; And Others – Journal of Educational Measurement, 1990

F. M. Lord's flexilevel, computerized adaptive testing (CAT) procedure was compared to an item-response theory-based CAT procedure that uses Bayesian ability estimation with various standard errors of estimates used for terminating the test. Ability estimates of flexilevel CATs were as accurate as were those of Bayesian CATs. (TJH)

Descriptors: Ability Identification, Adaptive Testing, Bayesian Statistics, Comparative Analysis