ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	10
Since 2006 (last 20 years)	19

Descriptor

Foreign Countries	20
Statistical Analysis	20
Models	12
Computation	10
Secondary School Students	6
Achievement Tests	5
International Assessment	5
Simulation	5
Bayesian Statistics	4
Comparative Analysis	4
Item Response Theory	4
Markov Processes	4
Test Items	4
Educational Research	3
Hierarchical Linear Modeling	3
Monte Carlo Methods	3
Probability	3
Regression (Statistics)	3
Test Bias	3
Accuracy	2
Cheating	2
College Freshmen	2
College Students	2
Correlation	2
Exit Examinations	2
More ▼

Source

Journal of Educational and…

Publication Type

Journal Articles	20
Reports - Research	13
Reports - Descriptive	6
Reports - Evaluative	2

Education Level

Secondary Education	10
Higher Education	5
Postsecondary Education	4
Elementary Education	3
Grade 5	1
Grade 8	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1

Audience

Location

Canada	3
Colombia	2
Germany	2
Netherlands	2
United Kingdom (England)	2
Austria (Vienna)	1
Brazil	1
Netherlands (Amsterdam)	1
Puerto Rico	1
South Korea	1
Sweden	1
United Kingdom (London)	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	5
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

Mean Comparisons of Many Groups in the Presence of DIF: An Evaluation of Linking and Concurrent Scaling Approaches

Peer reviewed

Direct link

Robitzsch, Alexander; Lüdtke, Oliver – Journal of Educational and Behavioral Statistics, 2022

One of the primary goals of international large-scale assessments in education is the comparison of country means in student achievement. This article introduces a framework for discussing differential item functioning (DIF) for such mean comparisons. We compare three different linking methods: concurrent scaling based on full invariance,…

Descriptors: Test Bias, International Assessment, Scaling, Comparative Analysis

Testing the Within-State Distribution in Mixture Models for Responses and Response Times

Peer reviewed

Direct link

Kuijpers, Renske E.; Visser, Ingmar; Molenaar, Dylan – Journal of Educational and Behavioral Statistics, 2021

Mixture models have been developed to enable detection of within-subject differences in responses and response times to psychometric test items. To enable mixture modeling of both responses and response times, a distributional assumption is needed for the within-state response time distribution. Since violations of the assumed response time…

Descriptors: Test Items, Responses, Reaction Time, Models

Kernel Equating Using Propensity Scores for Nonequivalent Groups

Peer reviewed

Direct link

Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2019

When equating two test forms, the equated scores will be biased if the test groups differ in ability. To adjust for the ability imbalance between nonequivalent groups, a set of common items is often used. When no common items are available, it has been suggested to use covariates correlated with the test scores instead. In this article, we reduce…

Descriptors: Equated Scores, Test Items, Probability, College Entrance Examinations

Hybridizing Machine Learning Methods and Finite Mixture Models for Estimating Heterogeneous Treatment Effects in Latent Classes

Peer reviewed

Direct link

Suk, Youmi; Kim, Jee-Seon; Kang, Hyunseung – Journal of Educational and Behavioral Statistics, 2021

There has been increasing interest in exploring heterogeneous treatment effects using machine learning (ML) methods such as causal forests, Bayesian additive regression trees, and targeted maximum likelihood estimation. However, there is little work on applying these methods to estimate treatment effects in latent classes defined by…

Descriptors: Artificial Intelligence, Statistical Analysis, Statistical Inference, Classification

Limitless Regression Discontinuity

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sales, Adam C.; Hansen, Ben B. – Journal of Educational and Behavioral Statistics, 2020

Conventionally, regression discontinuity analysis contrasts a univariate regression's limits as its independent variable, "R," approaches a cut point, "c," from either side. Alternative methods target the average treatment effect in a small region around "c," at the cost of an assumption that treatment assignment,…

Descriptors: Regression (Statistics), Computation, Statistical Inference, Robustness (Statistics)

Statistical Equivalence Testing Approaches for Mantel-Haenszel DIF Analysis

Peer reviewed

Direct link

Casabianca, Jodi M.; Lewis, Charles – Journal of Educational and Behavioral Statistics, 2018

The null hypothesis test used in differential item functioning (DIF) detection tests for a subgroup difference in item-level performance--if the null hypothesis of "no DIF" is rejected, the item is flagged for DIF. Conversely, an item is kept in the test form if there is insufficient evidence of DIF. We present frequentist and empirical…

Descriptors: Test Bias, Hypothesis Testing, Bayesian Statistics, Statistical Analysis

Multiple Imputation of Missing Data at Level 2: A Comparison of Fully Conditional and Joint Modeling in Multilevel Designs

Peer reviewed

Direct link

Grund, Simon; Lüdtke, Oliver; Robitzsch, Alexander – Journal of Educational and Behavioral Statistics, 2018

Multiple imputation (MI) can be used to address missing data at Level 2 in multilevel research. In this article, we compare joint modeling (JM) and the fully conditional specification (FCS) of MI as well as different strategies for including auxiliary variables at Level 1 using either their manifest or their latent cluster means. We show with…

Descriptors: Statistical Analysis, Data, Comparative Analysis, Hierarchical Linear Modeling

Avoiding Bias When Estimating the Consistency and Stability of Value-Added School Effects

Peer reviewed

Direct link

Leckie, George – Journal of Educational and Behavioral Statistics, 2018

The traditional approach to estimating the consistency of school effects across subject areas and the stability of school effects across time is to fit separate value-added multilevel models to each subject or cohort and to correlate the resulting empirical Bayes predictions. We show that this gives biased correlations and these biases cannot be…

Descriptors: Value Added Models, Reliability, Statistical Bias, Computation

On the Optimality of Answer-Copying Indices: Theory and Practice

Peer reviewed

Direct link

Romero, Mauricio; Riascos, Álvaro; Jara, Diego – Journal of Educational and Behavioral Statistics, 2015

Multiple-choice exams are frequently used as an efficient and objective method to assess learning, but they are more vulnerable to answer copying than tests based on open questions. Several statistical tests (known as indices in the literature) have been proposed to detect cheating; however, to the best of our knowledge, they all lack mathematical…

Descriptors: Cheating, Multiple Choice Tests, Statistical Analysis, Models

Item Response Data Analysis Using Stata Item Response Theory Package

Peer reviewed

Direct link

Yang, Ji Seung; Zheng, Xiaying – Journal of Educational and Behavioral Statistics, 2018

The purpose of this article is to introduce and review the capability and performance of the Stata item response theory (IRT) package that is available from Stata v.14, 2015. Using a simulated data set and a publicly available item response data set extracted from Programme of International Student Assessment, we review the IRT package from…

Descriptors: Item Response Theory, Item Analysis, Computer Software, Statistical Analysis

Posterior Predictive Checks for Conditional Independence between Response Time and Accuracy

Peer reviewed

Direct link

Bolsinova, Maria; Tijmstra, Jesper – Journal of Educational and Behavioral Statistics, 2016

Conditional independence (CI) between response time and response accuracy is a fundamental assumption of many joint models for time and accuracy used in educational measurement. In this study, posterior predictive checks (PPCs) are proposed for testing this assumption. These PPCs are based on three discrepancy measures reflecting different…

Descriptors: Reaction Time, Accuracy, Statistical Analysis, Robustness (Statistics)

Multiple Imputation of Multilevel Missing Data-Rigor versus Simplicity

Peer reviewed

Direct link

Drechsler, Jörg – Journal of Educational and Behavioral Statistics, 2015

Multiple imputation is widely accepted as the method of choice to address item-nonresponse in surveys. However, research on imputation strategies for the hierarchical structures that are typically found in the data in educational contexts is still limited. While a multilevel imputation model should be preferred from a theoretical point of view if…

Descriptors: Hierarchical Linear Modeling, Statistical Analysis, Educational Research, Statistical Bias

Multilevel Modeling of Social Segregation

Peer reviewed

Direct link

Leckie, George; Pillinger, Rebecca; Jones, Kelvyn; Goldstein, Harvey – Journal of Educational and Behavioral Statistics, 2012

The traditional approach to measuring segregation is based upon descriptive, non-model-based indices. A recently proposed alternative is multilevel modeling. The authors further develop the argument for a multilevel modeling approach by first describing and expanding upon its notable advantages, which include an ability to model segregation at a…

Descriptors: Statistical Analysis, Models, Simulation, Measurement Techniques

A Didactic Presentation of Snijders's "l[subscript z]*" Index of Person Fit with Emphasis on Response Model Selection and Ability Estimation

Peer reviewed

Direct link

Magis, David; Raiche, Gilles; Beland, Sebastien – Journal of Educational and Behavioral Statistics, 2012

This paper focuses on two likelihood-based indices of person fit, the index "l[subscript z]" and the Snijders's modified index "l[subscript z]*". The first one is commonly used in practical assessment of person fit, although its asymptotic standard normal distribution is not valid when true abilities are replaced by sample…

Descriptors: Goodness of Fit, Item Response Theory, Computation, Ability

Accounting for Individual Differences in Bradley-Terry Models by Means of Recursive Partitioning

Peer reviewed

Direct link

Strobl, Carolin; Wickelmaier, Florian; Zeileis, Achim – Journal of Educational and Behavioral Statistics, 2011

The preference scaling of a group of subjects may not be homogeneous, but different groups of subjects with certain characteristics may show different preference scalings, each of which can be derived from paired comparisons by means of the Bradley-Terry model. Usually, either different models are fit in predefined subsets of the sample or the…

Descriptors: Individual Differences, Scaling, Statistical Analysis, Models

Previous Page | Next Page »

Pages: 1 | 2

Goldstein, Harvey	3
Leckie, George	2
Lüdtke, Oliver	2
Robitzsch, Alexander	2
Beland, Sebastien	1
Bolsinova, Maria	1
Bonnet, Gerard	1
Browne, William	1
Casabianca, Jodi M.	1
Cepeda-Cuervo, Edilberto	1
Drechsler, Jörg	1
Fox, Jean-Paul	1
Gamerman, Dani	1
Goncalves, Flavio B.	1
Grund, Simon	1
Hansen, Ben B.	1
Jara, Diego	1
Jones, Kelvyn	1
Kang, Hyunseung	1
Kim, Jee-Seon	1
Kuijpers, Renske E.	1
Lewis, Charles	1
Magis, David	1
Molenaar, Dylan	1
Núñez-Antón, Vicente	1
More ▼