ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	17

Descriptor

Monte Carlo Methods	23
Statistical Analysis	23
Test Items	23
Item Response Theory	14
Computation	12
Bayesian Statistics	6
Comparative Analysis	6
Error of Measurement	6
Goodness of Fit	6
Markov Processes	6
Models	5
Sample Size	5
Computer Assisted Testing	4
Sampling	4
Simulation	4
Test Bias	4
Accuracy	3
Correlation	3
Difficulty Level	3
Scores	3
Statistical Bias	3
Test Reliability	3
College Entrance Examinations	2
Data Analysis	2
Effect Size	2
More ▼

Source

Educational and Psychological…	8
Applied Psychological…	4
ETS Research Report Series	3
Applied Measurement in…	1
International Journal of…	1
International Journal of…	1
Journal of Educational and…	1
Multivariate Behavioral…	1
ProQuest LLC	1
Psychometrika	1
Structural Equation Modeling:…	1
More ▼

Publication Type

Journal Articles	22
Reports - Research	17
Reports - Evaluative	5
Dissertations/Theses -…	1

Education Level

Grade 8

Audience

Location

Canada	1
Germany	1

Laws, Policies, & Programs

Assessments and Surveys

Law School Admission Test	1
SAT (College Admission Test)	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 23 results Save | Export

Comparison of Cronbach's Alpha and McDonald's Omega for Ordinal Data: Are They Different?

Peer reviewed
PDF on ERIC

Download full text

Fatih Orcan – International Journal of Assessment Tools in Education, 2023

Among all, Cronbach's Alpha and McDonald's Omega are commonly used for reliability estimations. The alpha uses inter-item correlations while omega is based on a factor analysis result. This study uses simulated ordinal data sets to test whether the alpha and omega produce different estimates. Their performances were compared according to the…

Descriptors: Statistical Analysis, Monte Carlo Methods, Correlation, Factor Analysis

Assess Robustness of the Rasch Mixture Model to Detect Differential Item Functioning -- A Monte Carlo Study

Direct link

Jinjin Huang – ProQuest LLC, 2020

Measurement invariance is crucial for an effective and valid measure of a construct. Invariance holds when the latent trait varies consistently across subgroups; in other words, the mean differences among subgroups are only due to true latent ability differences. Differential item functioning (DIF) occurs when measurement invariance is violated.…

Descriptors: Robustness (Statistics), Item Response Theory, Test Items, Item Analysis

Large Sample Confidence Intervals for Item Response Theory Reliability Coefficients

Peer reviewed

Direct link

Andersson, Björn; Xin, Tao – Educational and Psychological Measurement, 2018

In applications of item response theory (IRT), an estimate of the reliability of the ability estimates or sum scores is often reported. However, analytical expressions for the standard errors of the estimators of the reliability coefficients are not available in the literature and therefore the variability associated with the estimated reliability…

Descriptors: Item Response Theory, Test Reliability, Test Items, Scores

The Consequences of Ignoring Item Parameter Drift in Longitudinal Item Response Models

Peer reviewed

Direct link

Lee, Wooyeol; Cho, Sun-Joo – Applied Measurement in Education, 2017

Utilizing a longitudinal item response model, this study investigated the effect of item parameter drift (IPD) on item parameters and person scores via a Monte Carlo study. Item parameter recovery was investigated for various IPD patterns in terms of bias and root mean-square error (RMSE), and percentage of time the 95% confidence interval covered…

Descriptors: Item Response Theory, Test Items, Bias, Computation

A Monte Carlo Study of an Iterative Wald Test Procedure for DIF Analysis

Peer reviewed

Direct link

Cao, Mengyang; Tay, Louis; Liu, Yaowu – Educational and Psychological Measurement, 2017

This study examined the performance of a proposed iterative Wald approach for detecting differential item functioning (DIF) between two groups when preknowledge of anchor items is absent. The iterative approach utilizes the Wald-2 approach to identify anchor items and then iteratively tests for DIF items with the Wald-1 approach. Monte Carlo…

Descriptors: Monte Carlo Methods, Test Items, Test Bias, Error of Measurement

An Algorithm to Improve Test Answer Copying Detection Using the Omega Statistic

Peer reviewed

Direct link

Maeda, Hotaka; Zhang, Bo – International Journal of Testing, 2017

The omega (?) statistic is reputed to be one of the best indices for detecting answer copying on multiple choice tests, but its performance relies on the accurate estimation of copier ability, which is challenging because responses from the copiers may have been contaminated. We propose an algorithm that aims to identify and delete the suspected…

Descriptors: Cheating, Test Items, Mathematics, Statistics

It Might Not Make a Big DIF: Improved Differential Test Functioning Statistics That Account for Sampling Variability

Peer reviewed

Direct link

Chalmers, R. Philip; Counsell, Alyssa; Flora, David B. – Educational and Psychological Measurement, 2016

Differential test functioning, or DTF, occurs when one or more items in a test demonstrate differential item functioning (DIF) and the aggregate of these effects are witnessed at the test level. In many applications, DTF can be more important than DIF when the overall effects of DIF at the test level can be quantified. However, optimal statistical…

Descriptors: Test Bias, Sampling, Test Items, Statistical Analysis

Rasch Model Parameter Estimation in the Presence of a Nonnormal Latent Trait Using a Nonparametric Bayesian Approach

Peer reviewed

Direct link

Finch, Holmes; Edwards, Julianne M. – Educational and Psychological Measurement, 2016

Standard approaches for estimating item response theory (IRT) model parameters generally work under the assumption that the latent trait being measured by a set of items follows the normal distribution. Estimation of IRT parameters in the presence of nonnormal latent traits has been shown to generate biased person and item parameter estimates. A…

Descriptors: Item Response Theory, Computation, Nonparametric Statistics, Bayesian Statistics

Coefficient Omega Bootstrap Confidence Intervals: Nonnormal Distributions

Peer reviewed

Direct link

Padilla, Miguel A.; Divers, Jasmin – Educational and Psychological Measurement, 2013

The performance of the normal theory bootstrap (NTB), the percentile bootstrap (PB), and the bias-corrected and accelerated (BCa) bootstrap confidence intervals (CIs) for coefficient omega was assessed through a Monte Carlo simulation under conditions not previously investigated. Of particular interests were nonnormal Likert-type and binary items.…

Descriptors: Sampling, Statistical Inference, Computation, Statistical Analysis

A Semiparametric Model for Jointly Analyzing Response Times and Accuracy in Computerized Testing

Peer reviewed

Direct link

Wang, Chun; Fan, Zhewen; Chang, Hua-Hua; Douglas, Jeffrey A. – Journal of Educational and Behavioral Statistics, 2013

The item response times (RTs) collected from computerized testing represent an underutilized type of information about items and examinees. In addition to knowing the examinees' responses to each item, we can investigate the amount of time examinees spend on each item. Current models for RTs mainly focus on parametric models, which have the…

Descriptors: Reaction Time, Computer Assisted Testing, Test Items, Accuracy

The MIMIC Model as a Tool for Differential Bundle Functioning Detection

Peer reviewed

Direct link

Finch, W. Holmes – Applied Psychological Measurement, 2012

Increasingly, researchers interested in identifying potentially biased test items are encouraged to use a confirmatory, rather than exploratory, approach. One such method for confirmatory testing is rooted in differential bundle functioning (DBF), where hypotheses regarding potential differential item functioning (DIF) for sets of items (bundles)…

Descriptors: Test Bias, Test Items, Statistical Analysis, Models

A Method for Imputing Response Options for Missing Data on Multiple-Choice Assessments

Peer reviewed

Direct link

Wolkowitz, Amanda A.; Skorupski, William P. – Educational and Psychological Measurement, 2013

When missing values are present in item response data, there are a number of ways one might impute a correct or incorrect response to a multiple-choice item. There are significantly fewer methods for imputing the actual response option an examinee may have provided if he or she had not omitted the item either purposely or accidentally. This…

Descriptors: Multiple Choice Tests, Statistical Analysis, Models, Accuracy

Gender and Minority Achievement Gaps in Science in Eighth Grade: Item Analyses of Nationally Representative Data. Research Report. ETS RR-17-36

Peer reviewed
PDF on ERIC

Download full text

Qian, Xiaoyu; Nandakumar, Ratna; Glutting, Joseoph; Ford, Danielle; Fifield, Steve – ETS Research Report Series, 2017

In this study, we investigated gender and minority achievement gaps on 8th-grade science items employing a multilevel item response methodology. Both gaps were wider on physics and earth science items than on biology and chemistry items. Larger gender gaps were found on items with specific topics favoring male students than other items, for…

Descriptors: Item Analysis, Gender Differences, Achievement Gap, Grade 8

Estimating a Noncompensatory IRT Model Using Metropolis within Gibbs Sampling

Peer reviewed

Direct link

Babcock, Ben – Applied Psychological Measurement, 2011

Relatively little research has been conducted with the noncompensatory class of multidimensional item response theory (MIRT) models. A Monte Carlo simulation study was conducted exploring the estimation of a two-parameter noncompensatory item response theory (IRT) model. The estimation method used was a Metropolis-Hastings within Gibbs algorithm…

Descriptors: Item Response Theory, Sampling, Computation, Statistical Analysis

On the Analysis of Fraction Subtraction Data: The DINA Model, Classification, Latent Class Sizes, and the Q-Matrix

Peer reviewed

Direct link

DeCarlo, Lawrence T. – Applied Psychological Measurement, 2011

Cognitive diagnostic models (CDMs) attempt to uncover latent skills or attributes that examinees must possess in order to answer test items correctly. The DINA (deterministic input, noisy "and") model is a popular CDM that has been widely used. It is shown here that a logistic version of the model can easily be fit with standard software for…

Descriptors: Bayesian Statistics, Computation, Cognitive Tests, Diagnostic Tests

Previous Page | Next Page »

Pages: 1 | 2

Andersson, Björn	1
Asparouhov, Tihomir	1
Babcock, Ben	1
Bradlow, Eric T.	1
Cao, Mengyang	1
Chalmers, R. Philip	1
Chang, Hua-Hua	1
Chen, Cheng-Te	1
Cho, Sun-Joo	1
Counsell, Alyssa	1
DeCarlo, Lawrence T.	1
Divers, Jasmin	1
Douglas, Jeffrey A.	1
Edwards, Julianne M.	1
Enders, Craig K	1
Fan, Zhewen	1
Fatih Orcan	1
Fifield, Steve	1
Finch, Holmes	1
Finch, W. Holmes	1
Flora, David B.	1
Ford, Danielle	1
Glutting, Joseoph	1
Hoijtink, Herbert	1
Jinjin Huang	1
More ▼