ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	4
Since 2017 (last 10 years)	5
Since 2007 (last 20 years)	14

Descriptor

Error of Measurement	19
Regression (Statistics)	19
Test Items	19
Item Response Theory	9
Simulation	8
Test Bias	7
Comparative Analysis	5
Evaluation Methods	5
Item Analysis	5
Sample Size	5
Scores	4
Computation	3
Data Analysis	3
Difficulty Level	3
Mathematics Tests	3
Models	3
Multivariate Analysis	3
Statistical Analysis	3
Achievement Tests	2
Classification	2
Correlation	2
Discriminant Analysis	2
Educational Assessment	2
Educational Research	2
Foreign Countries	2
More ▼

Source

Educational and Psychological…	6
Applied Measurement in…	2
ProQuest LLC	2
ETS Research Report Series	1
Educational Measurement:…	1
International Journal of…	1
Large-scale Assessments in…	1
Measurement:…	1
Multivariate Behavioral…	1
Structural Equation Modeling:…	1

Publication Type

Journal Articles	15
Reports - Research	12
Dissertations/Theses -…	2
Reports - Descriptive	2
Reports - Evaluative	2
Speeches/Meeting Papers	2
Opinion Papers	1

Education Level

Higher Education	2
Postsecondary Education	2
Elementary Education	1
Elementary Secondary Education	1
Grade 8	1
Secondary Education	1

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Program for International…	1
SAT (College Admission Test)	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

Multi-Group Generalizations of SIBTEST and Crossing-SIBTEST

Peer reviewed

Direct link

Chalmers, R. Philip; Zheng, Guoguo – Applied Measurement in Education, 2023

This article presents generalizations of SIBTEST and crossing-SIBTEST statistics for differential item functioning (DIF) investigations involving more than two groups. After reviewing the original two-group setup for these statistics, a set of multigroup generalizations that support contrast matrices for joint tests of DIF are presented. To…

Descriptors: Test Bias, Test Items, Item Response Theory, Error of Measurement

Type I Error and Power Rates: A Comparative Analysis of Techniques in Differential Item Functioning

Peer reviewed
PDF on ERIC

Download full text

Ayse Bilicioglu Gunes; Bayram Bicak – International Journal of Assessment Tools in Education, 2023

The main purpose of this study is to examine the Type I error and statistical power ratios of Differential Item Functioning (DIF) techniques based on different theories under different conditions. For this purpose, a simulation study was conducted by using Mantel-Haenszel (MH), Logistic Regression (LR), Lord's [chi-squared], and Raju's Areas…

Descriptors: Test Items, Item Response Theory, Error of Measurement, Test Bias

A Regression Discontinuity Design Framework for Controlling Selection Bias in Evaluations of Differential Item Functioning

Peer reviewed

Direct link

Koziol, Natalie A.; Goodrich, J. Marc; Yoon, HyeonJin – Educational and Psychological Measurement, 2022

Differential item functioning (DIF) is often used to examine validity evidence of alternate form test accommodations. Unfortunately, traditional approaches for evaluating DIF are prone to selection bias. This article proposes a novel DIF framework that capitalizes on regression discontinuity design analysis to control for selection bias. A…

Descriptors: Regression (Statistics), Item Analysis, Validity, Testing Accommodations

A Log-Linear Modeling Approach for Differential Item Functioning Detection in Polytomously Scored Items

Peer reviewed

Direct link

Yesiltas, Gonca; Paek, Insu – Educational and Psychological Measurement, 2020

A log-linear model (LLM) is a well-known statistical method to examine the relationship among categorical variables. This study investigated the performance of LLM in detecting differential item functioning (DIF) for polytomously scored items via simulations where various sample sizes, ability mean differences (impact), and DIF types were…

Descriptors: Simulation, Sample Size, Item Analysis, Scores

Comparing the Robustness of Three Nonparametric DIF Procedures to Differential Rapid Guessing

Peer reviewed

Direct link

Abulela, Mohammed A. A.; Rios, Joseph A. – Applied Measurement in Education, 2022

When there are no personal consequences associated with test performance for examinees, rapid guessing (RG) is a concern and can differ between subgroups. To date, the impact of differential RG on item-level measurement invariance has received minimal attention. To that end, a simulation study was conducted to examine the robustness of the…

Descriptors: Comparative Analysis, Robustness (Statistics), Nonparametric Statistics, Item Analysis

Effect of Multiple Testing Adjustment in Differential Item Functioning Detection

Peer reviewed

Direct link

Kim, Jihye; Oshima, T. C. – Educational and Psychological Measurement, 2013

In a typical differential item functioning (DIF) analysis, a significance test is conducted for each item. As a test consists of multiple items, such multiple testing may increase the possibility of making a Type I error at least once. The goal of this study was to investigate how to control a Type I error rate and power using adjustment…

Descriptors: Test Bias, Test Items, Statistical Analysis, Error of Measurement

Differential Item Functioning for Accommodated Students with Disabilities: Effect of Differences in Proficiency Distributions

Direct link

Quesen, Sarah – ProQuest LLC, 2016

When studying differential item functioning (DIF) with students with disabilities (SWD) focal groups typically suffer from small sample size, whereas the reference group population is usually large. This makes it possible for a researcher to select a sample from the reference population to be similar to the focal group on the ability scale. Doing…

Descriptors: Test Items, Academic Accommodations (Disabilities), Testing Accommodations, Disabilities

Item Discrimination and Type I Error in the Detection of Differential Item Functioning

Peer reviewed

Direct link

Li, Yanju; Brooks, Gordon P.; Johanson, George A. – Educational and Psychological Measurement, 2012

In 2009, DeMars stated that when impact exists there will be Type I error inflation, especially with larger sample sizes and larger discrimination parameters for items. One purpose of this study is to present the patterns of Type I error rates using Mantel-Haenszel (MH) and logistic regression (LR) procedures when the mean ability between the…

Descriptors: Error of Measurement, Test Bias, Test Items, Regression (Statistics)

Detecting Differential Item Functioning Using Generalized Logistic Regression in the Context of Large-Scale Assessments

Peer reviewed

Direct link

Svetina, Dubravka; Rutkowski, Leslie – Large-scale Assessments in Education, 2014

Background: When studying student performance across different countries or cultures, an important aspect for comparisons is that of score comparability. In other words, it is imperative that the latent variable (i.e., construct of interest) is understood and measured equivalently across all participating groups or countries, if our inferences…

Descriptors: Test Items, Item Response Theory, Item Analysis, Regression (Statistics)

Improving Explanatory Inferences from Assessments

Direct link

Diakow, Ronli Phyllis – ProQuest LLC, 2013

This dissertation comprises three papers that propose, discuss, and illustrate models to make improved inferences about research questions regarding student achievement in education. Addressing the types of questions common in educational research today requires three different "extensions" to traditional educational assessment: (1)…

Descriptors: Inferences, Educational Assessment, Academic Achievement, Educational Research

DIF Trees: Using Classification Trees to Detect Differential Item Functioning

Peer reviewed

Direct link

Vaughn, Brandon K.; Wang, Qiu – Educational and Psychological Measurement, 2010

A nonparametric tree classification procedure is used to detect differential item functioning for items that are dichotomously scored. Classification trees are shown to be an alternative procedure to detect differential item functioning other than the use of traditional Mantel-Haenszel and logistic regression analysis. A nonparametric…

Descriptors: Test Bias, Classification, Nonparametric Statistics, Regression (Statistics)

On Bias in Linear Observed-Score Equating

Peer reviewed

Direct link

van der Linden, Wim J. – Measurement: Interdisciplinary Research and Perspectives, 2010

The traditional way of equating the scores on a new test form X to those on an old form Y is equipercentile equating for a population of examinees. Because the population is likely to change between the two administrations, a popular approach is to equate for a "synthetic population." The authors of the articles in this issue of the…

Descriptors: Test Format, Equated Scores, Population Distribution, Population Trends

A Multilevel Nonlinear Profile Analysis Model for Dichotomous Data

Peer reviewed

Direct link

Culpepper, Steven Andrew – Multivariate Behavioral Research, 2009

This study linked nonlinear profile analysis (NPA) of dichotomous responses with an existing family of item response theory models and generalized latent variable models (GLVM). The NPA method offers several benefits over previous internal profile analysis methods: (a) NPA is estimated with maximum likelihood in a GLVM framework rather than…

Descriptors: Profiles, Item Response Theory, Models, Maximum Likelihood Statistics

Avoiding and Correcting Bias in Score-Based Latent Variable Regression with Discrete Manifest Items

Peer reviewed

Direct link

Lu, Irene R. R.; Thomas, D. Roland – Structural Equation Modeling: A Multidisciplinary Journal, 2008

This article considers models involving a single structural equation with latent explanatory and/or latent dependent variables where discrete items are used to measure the latent variables. Our primary focus is the use of scores as proxies for the latent variables and carrying out ordinary least squares (OLS) regression on such scores to estimate…

Descriptors: Least Squares Statistics, Computation, Item Response Theory, Structural Equation Models

When Can Subscores Have Value? Research Report. ETS RR-05-08

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2005

In educational tests, subscores are often generated from a portion of the items in a larger test. Guidelines based on mean-squared error are proposed to indicate whether subscores are worth reporting. Alternatives considered are direct reports of subscores, estimates of subscores based on total score, combined estimates based on subscores and…

Descriptors: Scores, Test Items, Error of Measurement, Computation

Previous Page | Next Page »

Pages: 1 | 2

Abulela, Mohammed A. A.	1
Aylesworth, Richard	1
Ayse Bilicioglu Gunes	1
Bayram Bicak	1
Brooks, Gordon P.	1
Carlson, James E.	1
Chalmers, R. Philip	1
Culpepper, Steven Andrew	1
Cuttance, Peter F.	1
Diakow, Ronli Phyllis	1
Goodrich, J. Marc	1
Haberman, Shelby J.	1
Johanson, George A.	1
Kim, Jihye	1
Koziol, Natalie A.	1
Kristjansson, Elizabeth	1
Li, Yanju	1
Lu, Irene R. R.	1
Mcdowell, Ian	1
Oshima, T. C.	1
Paek, Insu	1
Quesen, Sarah	1
Reckase, Mark D.	1
Rios, Joseph A.	1
Rutkowski, Leslie	1
More ▼