ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	9

Descriptor

Bayesian Statistics	12
Statistical Analysis	12
Test Bias	12
Computation	5
Test Items	4
Achievement Tests	3
Foreign Countries	3
Item Analysis	3
Mathematics Tests	3
Models	3
Sample Size	3
Comparative Analysis	2
Error of Measurement	2
Gender Differences	2
International Assessment	2
Item Response Theory	2
Psychometrics	2
Racial Differences	2
Regression (Statistics)	2
Scores	2
Ability Identification	1
Accuracy	1
Achievement Gap	1
Adaptive Testing	1
Aptitude Tests	1
More ▼

Source

Journal of Educational and…	5
ETS Research Report Series	4
Educational and Psychological…	1
Journal of Experimental…	1

Publication Type

Journal Articles	11
Reports - Research	9
Guides - Classroom - Learner	1
Opinion Papers	1
Reports - Evaluative	1

Education Level

Elementary Education	1
Grade 5	1
Grade 8	1
Higher Education	1
Intermediate Grades	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Brazil	1
Canada	1

Laws, Policies, & Programs

Assessments and Surveys

Pre Professional Skills Tests	1
Program for International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Statistical Equivalence Testing Approaches for Mantel-Haenszel DIF Analysis

Peer reviewed

Direct link

Casabianca, Jodi M.; Lewis, Charles – Journal of Educational and Behavioral Statistics, 2018

The null hypothesis test used in differential item functioning (DIF) detection tests for a subgroup difference in item-level performance--if the null hypothesis of "no DIF" is rejected, the item is flagged for DIF. Conversely, an item is kept in the test form if there is insufficient evidence of DIF. We present frequentist and empirical…

Descriptors: Test Bias, Hypothesis Testing, Bayesian Statistics, Statistical Analysis

Detection of Differential Item Functioning Using the Lasso Approach

Peer reviewed

Direct link

Magis, David; Tuerlinckx, Francis; De Boeck, Paul – Journal of Educational and Behavioral Statistics, 2015

This article proposes a novel approach to detect differential item functioning (DIF) among dichotomously scored items. Unlike standard DIF methods that perform an item-by-item analysis, we propose the "LR lasso DIF method": logistic regression (LR) model is formulated for all item responses. The model contains item-specific intercepts,…

Descriptors: Test Bias, Test Items, Regression (Statistics), Scores

Evaluation of Model Selection Strategies for Cross-Level Two-Way Differential Item Functioning Analysis

Peer reviewed

Direct link

Patarapichayatham, Chalie; Kamata, Akihito; Kanjanawasee, Sirichai – Educational and Psychological Measurement, 2012

Model specification issues on the cross-level two-way differential item functioning model were previously investigated by Patarapichayatham et al. (2009). Their study clarified that an incorrect model specification can easily lead to biased estimates of key parameters. The objective of this article is to provide further insights on the issue by…

Descriptors: Test Bias, Models, Bayesian Statistics, Statistical Analysis

Improving Mantel-Haenszel DIF Estimation through Bayesian Updating

Peer reviewed

Direct link

Zwick, Rebecca; Ye, Lei; Isham, Steven – Journal of Educational and Behavioral Statistics, 2012

This study demonstrates how the stability of Mantel-Haenszel (MH) DIF (differential item functioning) methods can be improved by integrating information across multiple test administrations using Bayesian updating (BU). The authors conducted a simulation that showed that this approach, which is based on earlier work by Zwick, Thayer, and Lewis,…

Descriptors: Test Bias, Computation, Statistical Analysis, Bayesian Statistics

Two Simple Approaches to Overcome a Problem with the Mantel-Haenszel Statistic: Comments on Wang, Bradlow, Wainer, and Muller (2008)

Peer reviewed

Direct link

Sinharay, Sandip; Dorans, Neil J. – Journal of Educational and Behavioral Statistics, 2010

The Mantel-Haenszel (MH) procedure (Mantel and Haenszel) is a popular method for estimating and testing a common two-factor association parameter in a 2 x 2 x K table. Holland and Holland and Thayer described how to use the procedure to detect differential item functioning (DIF) for tests with dichotomously scored items. Wang, Bradlow, Wainer, and…

Descriptors: Test Bias, Statistical Analysis, Computation, Bayesian Statistics

Gender and Minority Achievement Gaps in Science in Eighth Grade: Item Analyses of Nationally Representative Data. Research Report. ETS RR-17-36

Peer reviewed
PDF on ERIC

Download full text

Qian, Xiaoyu; Nandakumar, Ratna; Glutting, Joseoph; Ford, Danielle; Fifield, Steve – ETS Research Report Series, 2017

In this study, we investigated gender and minority achievement gaps on 8th-grade science items employing a multilevel item response methodology. Both gaps were wider on physics and earth science items than on biology and chemistry items. Larger gender gaps were found on items with specific topics favoring male students than other items, for…

Descriptors: Item Analysis, Gender Differences, Achievement Gap, Grade 8

A Review of ETS Differential Item Functioning Assessment Procedures: Flagging Rules, Minimum Sample Size Requirements, and Criterion Refinement. Research Report. ETS RR-12-08

Peer reviewed
PDF on ERIC

Download full text

Zwick, Rebecca – ETS Research Report Series, 2012

Differential item functioning (DIF) analysis is a key component in the evaluation of the fairness and validity of educational tests. The goal of this project was to review the status of ETS DIF analysis procedures, focusing on three aspects: (a) the nature and stringency of the statistical rules used to flag items, (b) the minimum sample size…

Descriptors: Test Bias, Sample Size, Bayesian Statistics, Evaluation Methods

An Integrated Bayesian Model for DIF Analysis

Peer reviewed

Direct link

Soares, Tufi M.; Goncalves, Flavio B.; Gamerman, Dani – Journal of Educational and Behavioral Statistics, 2009

In this article, an integrated Bayesian model for differential item functioning (DIF) analysis is proposed. The model is integrated in the sense of modeling the responses along with the DIF analysis. This approach allows DIF detection and explanation in a simultaneous setup. Previous empirical studies and/or subjective beliefs about the item…

Descriptors: Test Bias, Bayesian Statistics, Models, Item Response Theory

Empirical Bayes versus Standard Mantel-Haenszel Statistics for Detecting Differential Item Functioning under Small Sample Conditions

Peer reviewed

Direct link

Fidalgo, Angel M.; Hashimoto, Kanako; Bartram, Dave; Muniz, Jose – Journal of Experimental Education, 2007

In this study, the authors assess several strategies created on the basis of the Mantel-Haenszel (MH) procedure for conducting differential item functioning (DIF) analysis with small samples. One of the analytical strategies is a loss function (LF) that uses empirical Bayes Mantel-Haenszel estimators, whereas the other strategies use the classical…

Descriptors: Bayesian Statistics, Test Bias, Statistical Analysis, Sample Size

Using Past Data to Enhance Small-Sample DIF Estimation: A Bayesian Approach. Research Report. ETS RR-06-09

Peer reviewed
PDF on ERIC

Download full text

Sinharay, Sandip; Dorans, Neil J.; Grant, Mary C.; Blew, Edwin O.; Knorr, Colleen M. – ETS Research Report Series, 2006

The application of the Mantel-Haenszel test statistic (and other popular DIF-detection methods) to determine DIF requires large samples, but test administrators often need to detect DIF with small samples. There is no universally agreed upon statistical approach for performing DIF analysis with small samples; hence there is substantial scope of…

Descriptors: Test Bias, Computation, Sample Size, Bayesian Statistics

Model Diagnostics for Bayesian Networks. Research Report. ETS RR-04-17

Peer reviewed
PDF on ERIC

Download full text

Sinharay, Sandip – ETS Research Report Series, 2004

Assessing fit of psychometric models has always been an issue of enormous interest, but there exists no unanimously agreed upon item fit diagnostic for the models. Bayesian networks, frequently used in educational assessments (see, for example, Mislevy, Almond, Yan, & Steinberg, 2001) primarily for learning about students' knowledge and…

Descriptors: Bayesian Statistics, Networks, Models, Goodness of Fit

A Primer of Item Response Theory. Technical Report 940279.

Download full text

Warm, Thomas A. – 1978

This primer is an introduction to item response theory (also called item characteristic curve theory, or latent trait theory) as it is used most commonly--for scoring multiple choice achievement or aptitude tests. Written for the testing practitioner with minimum training in statistics and psychometrics, it presents and illustrates the basic…

Descriptors: Ability Identification, Achievement Tests, Adaptive Testing, Aptitude Tests

Sinharay, Sandip	3
Dorans, Neil J.	2
Zwick, Rebecca	2
Bartram, Dave	1
Blew, Edwin O.	1
Casabianca, Jodi M.	1
De Boeck, Paul	1
Fidalgo, Angel M.	1
Fifield, Steve	1
Ford, Danielle	1
Gamerman, Dani	1
Glutting, Joseoph	1
Goncalves, Flavio B.	1
Grant, Mary C.	1
Hashimoto, Kanako	1
Isham, Steven	1
Kamata, Akihito	1
Kanjanawasee, Sirichai	1
Knorr, Colleen M.	1
Lewis, Charles	1
Magis, David	1
Muniz, Jose	1
Nandakumar, Ratna	1
Patarapichayatham, Chalie	1
Qian, Xiaoyu	1
More ▼