ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	3
Since 2017 (last 10 years)	6
Since 2007 (last 20 years)	12

Descriptor

Accuracy	12
Computation	12
Item Response Theory	5
Models	5
Sample Size	5
Test Items	5
Statistical Analysis	4
Computer Assisted Testing	3
Data Analysis	3
Error of Measurement	3
Monte Carlo Methods	3
Reaction Time	3
Statistical Bias	3
Algorithms	2
Bayesian Statistics	2
Comparative Analysis	2
Computer Software	2
Efficiency	2
Regression (Statistics)	2
Simulation	2
Statistical Inference	2
Tests	2
Adaptive Testing	1
Arithmetic	1
Artificial Intelligence	1
More ▼

Source

Journal of Educational and…

Publication Type

Journal Articles	12
Reports - Research	8
Reports - Evaluative	3
Reports - Descriptive	1

Education Level

Elementary Education	1
Grade 4	1
Higher Education	1
Intermediate Grades	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Netherlands

Laws, Policies, & Programs

Assessments and Surveys

Early Childhood Longitudinal…	1
National Assessment of…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

DINA-BAG: A Bagging Algorithm for DINA Model Parameter Estimation in Small Samples

Peer reviewed

Direct link

David Arthur; Hua-Hua Chang – Journal of Educational and Behavioral Statistics, 2024

Cognitive diagnosis models (CDMs) are the assessment tools that provide valuable formative feedback about skill mastery at both the individual and population level. Recent work has explored the performance of CDMs with small sample sizes but has focused solely on the estimates of individual profiles. The current research focuses on obtaining…

Descriptors: Algorithms, Models, Computation, Cognitive Measurement

Development of a High-Accuracy and Effective Online Calibration Method in CD-CAT Based on Gini Index

Peer reviewed

Direct link

Tan, Qingrong; Cai, Yan; Luo, Fen; Tu, Dongbo – Journal of Educational and Behavioral Statistics, 2023

To improve the calibration accuracy and calibration efficiency of cognitive diagnostic computerized adaptive testing (CD-CAT) for new items and, ultimately, contribute to the widespread application of CD-CAT in practice, the current article proposed a Gini-based online calibration method that can simultaneously calibrate the Q-matrix and item…

Descriptors: Cognitive Tests, Computer Assisted Testing, Adaptive Testing, Accuracy

Reporting Proficiency Levels for Examinees with Incomplete Data

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2022

Takers of educational tests often receive proficiency levels instead of or in addition to scaled scores. For example, proficiency levels are reported for the Advanced Placement (AP®) and U.S. Medical Licensing examinations. Technical difficulties and other unforeseen events occasionally lead to missing item scores and hence to incomplete data on…

Descriptors: Computation, Data Analysis, Educational Testing, Accuracy

Estimation of Latent Regression Item Response Theory Models Using a Second-Order Laplace Approximation

Peer reviewed

Direct link

Andersson, Björn; Xin, Tao – Journal of Educational and Behavioral Statistics, 2021

The estimation of high-dimensional latent regression item response theory (IRT) models is difficult because of the need to approximate integrals in the likelihood function. Proposed solutions in the literature include using stochastic approximations, adaptive quadrature, and Laplace approximations. We propose using a second-order Laplace…

Descriptors: Item Response Theory, Computation, Regression (Statistics), Statistical Bias

Optimizing the Use of Response Times for Item Selection in Computerized Adaptive Testing

Peer reviewed

Direct link

Choe, Edison M.; Kern, Justin L.; Chang, Hua-Hua – Journal of Educational and Behavioral Statistics, 2018

Despite common operationalization, measurement efficiency of computerized adaptive testing should not only be assessed in terms of the number of items administered but also the time it takes to complete the test. To this end, a recent study introduced a novel item selection criterion that maximizes Fisher information per unit of expected response…

Descriptors: Computer Assisted Testing, Reaction Time, Item Response Theory, Test Items

Does the Package Matter? A Comparison of Five Common Multilevel Modeling Software Packages

Peer reviewed

Direct link

McCoach, D. Betsy; Rifenbark, Graham G.; Newton, Sarah D.; Li, Xiaoran; Kooken, Janice; Yomtov, Dani; Gambino, Anthony J.; Bellara, Aarti – Journal of Educational and Behavioral Statistics, 2018

This study compared five common multilevel software packages via Monte Carlo simulation: HLM 7, M"plus" 7.4, R (lme4 V1.1-12), Stata 14.1, and SAS 9.4 to determine how the programs differ in estimation accuracy and speed, as well as convergence, when modeling multiple randomly varying slopes of different magnitudes. Simulated data…

Descriptors: Hierarchical Linear Modeling, Computer Software, Comparative Analysis, Monte Carlo Methods

Posterior Predictive Checks for Conditional Independence between Response Time and Accuracy

Peer reviewed

Direct link

Bolsinova, Maria; Tijmstra, Jesper – Journal of Educational and Behavioral Statistics, 2016

Conditional independence (CI) between response time and response accuracy is a fundamental assumption of many joint models for time and accuracy used in educational measurement. In this study, posterior predictive checks (PPCs) are proposed for testing this assumption. These PPCs are based on three discrepancy measures reflecting different…

Descriptors: Reaction Time, Accuracy, Statistical Analysis, Robustness (Statistics)

Response Styles in Rating Scales: Simultaneous Modeling of Content-Related Effects and the Tendency to Middle or Extreme Categories

Peer reviewed

Direct link

Tutz, Gerhard; Berger, Moritz – Journal of Educational and Behavioral Statistics, 2016

Heterogeneity in response styles can affect the conclusions drawn from rating scale data. In particular, biased estimates can be expected if one ignores a tendency to middle categories or to extreme categories. An adjacent categories model is proposed that simultaneously models the content-related effects and the heterogeneity in response styles.…

Descriptors: Response Style (Tests), Rating Scales, Data Interpretation, Statistical Bias

Improving Mantel-Haenszel DIF Estimation through Bayesian Updating

Peer reviewed

Direct link

Zwick, Rebecca; Ye, Lei; Isham, Steven – Journal of Educational and Behavioral Statistics, 2012

This study demonstrates how the stability of Mantel-Haenszel (MH) DIF (differential item functioning) methods can be improved by integrating information across multiple test administrations using Bayesian updating (BU). The authors conducted a simulation that showed that this approach, which is based on earlier work by Zwick, Thayer, and Lewis,…

Descriptors: Test Bias, Computation, Statistical Analysis, Bayesian Statistics

A Semiparametric Model for Jointly Analyzing Response Times and Accuracy in Computerized Testing

Peer reviewed

Direct link

Wang, Chun; Fan, Zhewen; Chang, Hua-Hua; Douglas, Jeffrey A. – Journal of Educational and Behavioral Statistics, 2013

The item response times (RTs) collected from computerized testing represent an underutilized type of information about items and examinees. In addition to knowing the examinees' responses to each item, we can investigate the amount of time examinees spend on each item. Current models for RTs mainly focus on parametric models, which have the…

Descriptors: Reaction Time, Computer Assisted Testing, Test Items, Accuracy

The Consequences of Ignoring Individuals' Mobility in Multilevel Growth Models: A Monte Carlo Study

Peer reviewed

Direct link

Luo, Wen; Kwok, Oi-man – Journal of Educational and Behavioral Statistics, 2012

In longitudinal multilevel studies, especially in educational settings, it is fairly common that participants change their group memberships over time (e.g., students switch to different schools). Participant's mobility changes the multilevel data structure from a purely hierarchical structure with repeated measures nested within individuals and…

Descriptors: Mobility, Statistical Analysis, Models, Longitudinal Studies

The Impact of Variability of Item Parameter Estimators on Test Information Function

Peer reviewed

Direct link

Zhang, Jinming – Journal of Educational and Behavioral Statistics, 2012

The impact of uncertainty about item parameters on test information functions is investigated. The information function of a test is one of the most important tools in item response theory (IRT). Inaccuracy in the estimation of test information can have substantial consequences on data analyses based on IRT. In this article, the major part (called…

Descriptors: Item Response Theory, Tests, Accuracy, Data Analysis

Chang, Hua-Hua	2
Andersson, Björn	1
Bellara, Aarti	1
Berger, Moritz	1
Bolsinova, Maria	1
Cai, Yan	1
Choe, Edison M.	1
David Arthur	1
Douglas, Jeffrey A.	1
Fan, Zhewen	1
Gambino, Anthony J.	1
Hua-Hua Chang	1
Isham, Steven	1
Kern, Justin L.	1
Kooken, Janice	1
Kwok, Oi-man	1
Li, Xiaoran	1
Luo, Fen	1
Luo, Wen	1
McCoach, D. Betsy	1
Newton, Sarah D.	1
Rifenbark, Graham G.	1
Sinharay, Sandip	1
Tan, Qingrong	1
Tijmstra, Jesper	1
More ▼