ERIC - Search Results

Publication Date

In 2026	0
Since 2025	12

Descriptor

Sample Size	12
Simulation	8
Models	6
Evaluation Methods	4
Item Response Theory	4
Accuracy	3
Educational Research	3
Statistical Analysis	3
Structural Equation Models	3
Algorithms	2
Bayesian Statistics	2
Causal Models	2
Computation	2
Error of Measurement	2
Factor Analysis	2
Goodness of Fit	2
Intervention	2
Matrices	2
Monte Carlo Methods	2
Programming Languages	2
Psychological Studies	2
Secondary School Students	2
Statistical Bias	2
Test Items	2
Achievement Tests	1
More ▼

Source

Journal of Educational and…	4
International Journal of…	2
Educational and Psychological…	1
Grantee Submission	1
International Educational…	1
Journal of Educational…	1
Practical Assessment,…	1
Structural Equation Modeling:…	1

Publication Type

Journal Articles	11
Reports - Research	11
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Secondary Education	2
Elementary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

National Longitudinal Study…	1
Program for International…	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Toward Sufficient Statistical Power in Algorithmic Bias Assessment: A Test for ABROCA

Peer reviewed
PDF on ERIC

Download full text

Conrad Borchers – International Educational Data Mining Society, 2025

Algorithmic bias is a pressing concern in educational data mining (EDM), as it risks amplifying inequities in learning outcomes. The Area Between ROC Curves (ABROCA) metric is frequently used to measure discrepancies in model performance across demographic groups to quantify overall model fairness. However, its skewed distribution--especially when…

Descriptors: Algorithms, Bias, Statistics, Simulation

Factor Extraction in Exploratory Factor Analysis for Ordinal Indicators: Is Principal Component Analysis the Best Option?

Peer reviewed
PDF on ERIC

Download full text

Tugay Kaçak; Abdullah Faruk Kiliç – International Journal of Assessment Tools in Education, 2025

Researchers continue to choose PCA in scale development and adaptation studies because it is the default setting and overestimates measurement quality. When PCA is utilized in investigations, the explained variance and factor loadings can be exaggerated. PCA, in contrast to the models given in the literature, should be investigated in…

Descriptors: Factor Analysis, Monte Carlo Methods, Mathematical Models, Sample Size

Redefining Item Response Models for Small Samples

Peer reviewed

Direct link

Jean-Paul Fox – Journal of Educational and Behavioral Statistics, 2025

Popular item response theory (IRT) models are considered complex, mainly due to the inclusion of a random factor variable (latent variable). The random factor variable represents the incidental parameter problem since the number of parameters increases when including data of new persons. Therefore, IRT models require a specific estimation method…

Descriptors: Sample Size, Item Response Theory, Accuracy, Bayesian Statistics

Shrinking Small Sample Problems in Multilevel Structural Equation Modeling via Regularization of the Sample Covariance Matrix

Peer reviewed

Direct link

Julia-Kim Walther; Martin Hecht; Steffen Zitzmann – Structural Equation Modeling: A Multidisciplinary Journal, 2025

Small sample sizes pose a severe threat to convergence and accuracy of between-group level parameter estimates in multilevel structural equation modeling (SEM). However, in certain situations, such as pilot studies or when populations are inherently small, increasing samples sizes is not feasible. As a remedy, we propose a two-stage regularized…

Descriptors: Sample Size, Hierarchical Linear Modeling, Structural Equation Models, Matrices

Using Regularized Methods to Validate Q-Matrix in Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Daoxuan Fu; Chunying Qin; Zhaosheng Luo; Yujun Li; Xiaofeng Yu; Ziyu Ye – Journal of Educational and Behavioral Statistics, 2025

One of the central components of cognitive diagnostic assessment is the Q-matrix, which is an essential loading indicator matrix and is typically constructed by subject matter experts. Nonetheless, to a large extent, the construction of Q-matrix remains a subjective process and might lead to misspecifications. Many researchers have recognized the…

Descriptors: Q Methodology, Matrices, Diagnostic Tests, Cognitive Measurement

Power Analyses for Estimation of Complier Average Causal Effects under Random Encouragement Designs in Education Research: Theory and Guidance

Peer reviewed

Direct link

Peter Z. Schochet – Journal of Educational and Behavioral Statistics, 2025

Random encouragement designs evaluate treatments that aim to increase participation in a program or activity. These randomized controlled trials (RCTs) can also assess the mediated effects of participation itself on longer term outcomes using a complier average causal effect (CACE) estimation framework. This article considers power analysis…

Descriptors: Statistical Analysis, Computation, Causal Models, Research Design

Evaluating Imputation-Based Fit Statistics in Structural Equation Modeling with Ordinal Data: The Mi2S Approach

Peer reviewed

Direct link

Suppanut Sriutaisuk; Yu Liu; Seungwon Chung; Hanjoe Kim; Fei Gu – Educational and Psychological Measurement, 2025

The multiple imputation two-stage (MI2S) approach holds promise for evaluating the model fit of structural equation models for ordinal variables with multiply imputed data. However, previous studies only examined the performance of MI2S-based residual-based test statistics. This study extends previous research by examining the performance of two…

Descriptors: Structural Equation Models, Error of Measurement, Programming Languages, Goodness of Fit

Power Analysis for Moderated Multiple Regression: An Incremental Model-Building Approach Using R

Peer reviewed
PDF on ERIC

Download full text

Ethan C. Brown; Mohammed A. A. Abulela – Practical Assessment, Research & Evaluation, 2025

Moderated multiple regression (MMR) has become a fundamental tool for applied researchers, since many effects are expected to vary based on other variables. However, the inherent complexity of MMR creates formidable challenges for adequately performing power analysis on interaction effects to ensure reliable and replicable research results. Prior…

Descriptors: Statistical Analysis, Multiple Regression Analysis, Models, Programming Languages

Robustness of Item Response Theory Models under the PISA Multistage Adaptive Testing Designs

Peer reviewed

Direct link

Hyo Jeong Shin; Christoph König; Frederic Robin; Andreas Frey; Kentaro Yamamoto – Journal of Educational Measurement, 2025

Many international large-scale assessments (ILSAs) have switched to multistage adaptive testing (MST) designs to improve measurement efficiency in measuring the skills of the heterogeneous populations around the world. In this context, previous literature has reported the acceptable level of model parameter recovery under the MST designs when the…

Descriptors: Robustness (Statistics), Item Response Theory, Adaptive Testing, Test Construction

Exploring Number of Response Categories in Factor Analysis: Implications for Sample Size

Peer reviewed
PDF on ERIC

Download full text

Fatih Orçan – International Journal of Assessment Tools in Education, 2025

Factor analysis is a statistical method to explore the relationships among observed variables and identify latent structures. It is crucial in scale development and validity analysis. Key factors affecting the accuracy of factor analysis results include the type of data, sample size, and the number of response categories. While some studies…

Descriptors: Factor Analysis, Factor Structure, Item Response Theory, Sample Size

A Practical Guide to Causal Moderation Analysis for Investigating the Role of Context, Identity, and Culture in Intervention Research

Peer reviewed

Direct link

Nianbo Dong; Keith Herman; Benjamin Kelcey; Sirui Ren; Wendy Reinke; Jessaca Spybrook – Grantee Submission, 2025

Contextual, identity, and cultural factors are not only associated with student outcomes but can also serve to moderate the effects of interventions. However, the conventional analysis of moderation commonly used in school psychology is subject to the selection bias potentially introducing bias into estimated moderator effects. This article…

Descriptors: Causal Models, Statistical Analysis, Context Effect, Intervention

Bayesian Adaptive Lasso for the Detection of Differential Item Functioning in Graded Response Models

Peer reviewed

Direct link

Na Shan; Ping-Feng Xu – Journal of Educational and Behavioral Statistics, 2025

The detection of differential item functioning (DIF) is important in psychological and behavioral sciences. Standard DIF detection methods perform an item-by-item test iteratively, often assuming that all items except the one under investigation are DIF-free. This article proposes a Bayesian adaptive Lasso method to detect DIF in graded response…

Descriptors: Bayesian Statistics, Item Response Theory, Adolescents, Longitudinal Studies

Abdullah Faruk Kiliç	1
Andreas Frey	1
Benjamin Kelcey	1
Christoph König	1
Chunying Qin	1
Conrad Borchers	1
Daoxuan Fu	1
Ethan C. Brown	1
Fatih Orçan	1
Fei Gu	1
Frederic Robin	1
Hanjoe Kim	1
Hyo Jeong Shin	1
Jean-Paul Fox	1
Jessaca Spybrook	1
Julia-Kim Walther	1
Keith Herman	1
Kentaro Yamamoto	1
Martin Hecht	1
Mohammed A. A. Abulela	1
Na Shan	1
Nianbo Dong	1
Peter Z. Schochet	1
Ping-Feng Xu	1
Seungwon Chung	1
More ▼