ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	7

Descriptor

Computation	7
Data Analysis	7
Item Response Theory	5
Models	4
Correlation	3
Simulation	3
Test Items	3
Bayesian Statistics	2
Evaluation Methods	2
Measurement	2
Statistical Analysis	2
Test Bias	2
Tests	2
Ability	1
Accuracy	1
Educational Testing	1
Elementary Secondary Education	1
Error Patterns	1
Goodness of Fit	1
Grade 4	1
High School Students	1
Intervals	1
Markov Processes	1
Maximum Likelihood Statistics	1
Measurement Techniques	1
More ▼

Source

Applied Psychological…

Author

Chan, Tsze	1
Cohen, Jon	1
Fukuhara, Hirotaka	1
Gu, Fei	1
Hoyle, Larry	1
Jiang, Tao	1
Kamata, Akihito	1
Kingston, Neal M.	1
Lei, Pui-Wa	1
Li, Hongli	1
Seburn, Mary	1
Skorupski, William P.	1
Song, Hao	1
Walker, Cindy M.	1
Zhang, Bo	1
Zhang, Jinming	1
de la Torre, Jimmy	1
More ▼

Publication Type

Journal Articles	7
Reports - Research	6
Reports - Evaluative	1

Education Level

Elementary Secondary Education	1
Grade 4	1
High Schools	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Small-Sample DIF Estimation Using SIBTEST, Cochran's Z, and Log-Linear Smoothing

Peer reviewed

Direct link

Lei, Pui-Wa; Li, Hongli – Applied Psychological Measurement, 2013

Minimum sample sizes of about 200 to 250 per group are often recommended for differential item functioning (DIF) analyses. However, there are times when sample sizes for one or both groups of interest are smaller than 200 due to practical constraints. This study attempts to examine the performance of Simultaneous Item Bias Test (SIBTEST),…

Descriptors: Sample Size, Test Bias, Computation, Accuracy

A Bifactor Multidimensional Item Response Theory Model for Differential Item Functioning Analysis on Testlet-Based Items

Peer reviewed

Direct link

Fukuhara, Hirotaka; Kamata, Akihito – Applied Psychological Measurement, 2011

A differential item functioning (DIF) detection method for testlet-based data was proposed and evaluated in this study. The proposed DIF model is an extension of a bifactor multidimensional item response theory (MIRT) model for testlets. Unlike traditional item response theory (IRT) DIF models, the proposed model takes testlet effects into…

Descriptors: Item Response Theory, Test Bias, Test Items, Bayesian Statistics

Calibration of Response Data Using MIRT Models with Simple and Mixed Structures

Peer reviewed

Direct link

Zhang, Jinming – Applied Psychological Measurement, 2012

It is common to assume during a statistical analysis of a multiscale assessment that the assessment is composed of several unidimensional subtests or that it has simple structure. Under this assumption, the unidimensional and multidimensional approaches can be used to estimate item parameters. These two approaches are equivalent in parameter…

Descriptors: Simulation, Computation, Models, Statistical Analysis

Standard Errors and Confidence Intervals from Bootstrapping for Ramsay-Curve Item Response Theory Model Item Parameters

Peer reviewed

Direct link

Gu, Fei; Skorupski, William P.; Hoyle, Larry; Kingston, Neal M. – Applied Psychological Measurement, 2011

Ramsay-curve item response theory (RC-IRT) is a nonparametric procedure that estimates the latent trait using splines, and no distributional assumption about the latent trait is required. For item parameters of the two-parameter logistic (2-PL), three-parameter logistic (3-PL), and polytomous IRT models, RC-IRT can provide more accurate estimates…

Descriptors: Intervals, Item Response Theory, Models, Evaluation Methods

Simultaneous Estimation of Overall and Domain Abilities: A Higher-Order IRT Model Approach

Peer reviewed

Direct link

de la Torre, Jimmy; Song, Hao – Applied Psychological Measurement, 2009

Assessments consisting of different domains (e.g., content areas, objectives) are typically multidimensional in nature but are commonly assumed to be unidimensional for estimation purposes. The different domains of these assessments are further treated as multi-unidimensional tests for the purpose of obtaining diagnostic information. However, when…

Descriptors: Ability, Tests, Item Response Theory, Data Analysis

Impact of Missing Data on Person-Model Fit and Person Trait Estimation

Peer reviewed

Direct link

Zhang, Bo; Walker, Cindy M. – Applied Psychological Measurement, 2008

The purpose of this research was to examine the effects of missing data on person-model fit and person trait estimation in tests with dichotomous items. Under the missing-completely-at-random framework, four missing data treatment techniques were investigated including pairwise deletion, coding missing responses as incorrect, hotdeck imputation,…

Descriptors: Item Response Theory, Computation, Goodness of Fit, Test Items

Consistent Estimation of Rasch Item Parameters and Their Standard Errors under Complex Sample Designs

Peer reviewed

Direct link

Cohen, Jon; Chan, Tsze; Jiang, Tao; Seburn, Mary – Applied Psychological Measurement, 2008

U.S. state educational testing programs administer tests to track student progress and hold schools accountable for educational outcomes. Methods from item response theory, especially Rasch models, are usually used to equate different forms of a test. The most popular method for estimating Rasch models yields inconsistent estimates and relies on…

Descriptors: Testing Programs, Educational Testing, Item Response Theory, Computation