ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	8
Since 2017 (last 10 years)	10
Since 2007 (last 20 years)	10

Source

Journal of Educational and…

Publication Type

Journal Articles	10
Reports - Research	7
Reports - Evaluative	2
Reports - Descriptive	1

Education Level

Elementary Secondary Education	1
Higher Education	1
Postsecondary Education	1

Audience

Location

Netherlands (Amsterdam)

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…

What Works Clearinghouse Rating

Showing all 10 results Save | Export

A Critical View on the NEAT Equating Design: Statistical Modeling and Identifiability Problems

Peer reviewed

Direct link

San Martín, Ernesto; González, Jorge – Journal of Educational and Behavioral Statistics, 2022

The nonequivalent groups with anchor test (NEAT) design is widely used in test equating. Under this design, two groups of examinees are administered different test forms with each test form containing a subset of common items. Because test takers from different groups are assigned only one test form, missing score data emerge by design rendering…

Descriptors: Tests, Scores, Statistical Analysis, Models

Latent Transition Cognitive Diagnosis Model with Covariates: A Three-Step Approach

Peer reviewed

Direct link

Liang, Qianru; de la Torre, Jimmy; Law, Nancy – Journal of Educational and Behavioral Statistics, 2023

To expand the use of cognitive diagnosis models (CDMs) to longitudinal assessments, this study proposes a bias-corrected three-step estimation approach for latent transition CDMs with covariates by integrating a general CDM and a latent transition model. The proposed method can be used to assess changes in attribute mastery status and attribute…

Descriptors: Cognitive Measurement, Models, Statistical Bias, Computation

What Is Actually Equated in "Test Equating"? A Didactic Note

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2022

The current literature on test equating generally defines it as the process necessary to obtain score comparability between different test forms. The definition is in contrast with Lord's foundational paper which viewed equating as the process required to obtain comparability of measurement scale between forms. The distinction between the notions…

Descriptors: Equated Scores, Test Items, Scores, Probability

The Use of the Posterior Probability in Score Differencing

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip; Johnson, Matthew S. – Journal of Educational and Behavioral Statistics, 2021

Score differencing is one of the six categories of statistical methods used to detect test fraud (Wollack & Schoenig, 2018) and involves the testing of the null hypothesis that the performance of an examinee is similar over two item sets versus the alternative hypothesis that the performance is better on one of the item sets. We suggest, to…

Descriptors: Probability, Bayesian Statistics, Cheating, Statistical Analysis

Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores

Peer reviewed

Direct link

Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2023

This study explores the usefulness of covariates on equating test scores from nonequivalent test groups. The covariates are captured by an estimated propensity score, which is used as a proxy for latent ability to balance the test groups. The objective is to assess the sensitivity of the equated scores to various misspecifications in the…

Descriptors: Models, Error of Measurement, Robustness (Statistics), Equated Scores

An Improved Inferential Procedure to Evaluate Item Discriminations in a Conditional Maximum Likelihood Framework

Peer reviewed

Direct link

Clemens Draxler; Andreas Kurz; Can Gürer; Jan Philipp Nolte – Journal of Educational and Behavioral Statistics, 2024

A modified and improved inductive inferential approach to evaluate item discriminations in a conditional maximum likelihood and Rasch modeling framework is suggested. The new approach involves the derivation of four hypothesis tests. It implies a linear restriction of the assumed set of probability distributions in the classical approach that…

Descriptors: Inferences, Test Items, Item Analysis, Maximum Likelihood Statistics

Testing the Within-State Distribution in Mixture Models for Responses and Response Times

Peer reviewed

Direct link

Kuijpers, Renske E.; Visser, Ingmar; Molenaar, Dylan – Journal of Educational and Behavioral Statistics, 2021

Mixture models have been developed to enable detection of within-subject differences in responses and response times to psychometric test items. To enable mixture modeling of both responses and response times, a distributional assumption is needed for the within-state response time distribution. Since violations of the assumed response time…

Descriptors: Test Items, Responses, Reaction Time, Models

Forced-Choice Ranking Models for Raters' Ranking Data

Peer reviewed

Direct link

Hung, Su-Pin; Huang, Hung-Yu – Journal of Educational and Behavioral Statistics, 2022

To address response style or bias in rating scales, forced-choice items are often used to request that respondents rank their attitudes or preferences among a limited set of options. The rating scales used by raters to render judgments on ratees' performance also contribute to rater bias or errors; consequently, forced-choice items have recently…

Descriptors: Evaluation Methods, Rating Scales, Item Analysis, Preferences

Estimating Heterogeneous Treatment Effects within Latent Class Multilevel Models: A Bayesian Approach

Peer reviewed

Direct link

Lyu, Weicong; Kim, Jee-Seon; Suk, Youmi – Journal of Educational and Behavioral Statistics, 2023

This article presents a latent class model for multilevel data to identify latent subgroups and estimate heterogeneous treatment effects. Unlike sequential approaches that partition data first and then estimate average treatment effects (ATEs) within classes, we employ a Bayesian procedure to jointly estimate mixing probability, selection, and…

Descriptors: Hierarchical Linear Modeling, Bayesian Statistics, Causal Models, Statistical Inference

Deep Reinforcement Learning for Adaptive Learning Systems

Peer reviewed

Direct link

Li, Xiao; Xu, Hanchen; Zhang, Jinming; Chang, Hua-hua – Journal of Educational and Behavioral Statistics, 2023

The adaptive learning problem concerns how to create an individualized learning plan (also referred to as a learning policy) that chooses the most appropriate learning materials based on a learner's latent traits. In this article, we study an important yet less-addressed adaptive learning problem--one that assumes continuous latent traits.…

Descriptors: Learning Processes, Models, Algorithms, Individualized Instruction

Probability	10
Models	6
Item Response Theory	4
Statistical Analysis	3
Statistical Distributions	3
Test Items	3
Bayesian Statistics	2
Computation	2
Decision Making	2
Equated Scores	2
Foreign Countries	2
Item Analysis	2
Responses	2
Scores	2
Achievement Gains	1
Achievement Tests	1
Algorithms	1
Artificial Intelligence	1
Causal Models	1
Cheating	1
Classification	1
Cognitive Measurement	1
College Freshmen	1
Correlation	1
Creativity	1
More ▼

Andreas Kurz	1
Can Gürer	1
Chang, Hua-hua	1
Clemens Draxler	1
González, Jorge	1
Huang, Hung-Yu	1
Hung, Su-Pin	1
Jan Philipp Nolte	1
Johnson, Matthew S.	1
Kim, Jee-Seon	1
Kuijpers, Renske E.	1
Law, Nancy	1
Li, Xiao	1
Liang, Qianru	1
Lyu, Weicong	1
Molenaar, Dylan	1
San Martín, Ernesto	1
Sinharay, Sandip	1
Suk, Youmi	1
Visser, Ingmar	1
Wallin, Gabriel	1
Wiberg, Marie	1
Xu, Hanchen	1
Zhang, Jinming	1
de la Torre, Jimmy	1
More ▼