Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 3 |
| Since 2017 (last 10 years) | 6 |
| Since 2007 (last 20 years) | 12 |
Descriptor
| Accuracy | 12 |
| Computation | 12 |
| Item Response Theory | 5 |
| Models | 5 |
| Sample Size | 5 |
| Test Items | 5 |
| Statistical Analysis | 4 |
| Computer Assisted Testing | 3 |
| Data Analysis | 3 |
| Error of Measurement | 3 |
| Monte Carlo Methods | 3 |
| More ▼ | |
Source
| Journal of Educational and… | 12 |
Author
| Chang, Hua-Hua | 2 |
| Andersson, Björn | 1 |
| Bellara, Aarti | 1 |
| Berger, Moritz | 1 |
| Bolsinova, Maria | 1 |
| Cai, Yan | 1 |
| Choe, Edison M. | 1 |
| David Arthur | 1 |
| Douglas, Jeffrey A. | 1 |
| Fan, Zhewen | 1 |
| Gambino, Anthony J. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 12 |
| Reports - Research | 8 |
| Reports - Evaluative | 3 |
| Reports - Descriptive | 1 |
Education Level
| Elementary Education | 1 |
| Grade 4 | 1 |
| Higher Education | 1 |
| Intermediate Grades | 1 |
| Postsecondary Education | 1 |
| Secondary Education | 1 |
Audience
Location
| Netherlands | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| Early Childhood Longitudinal… | 1 |
| National Assessment of… | 1 |
| SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
David Arthur; Hua-Hua Chang – Journal of Educational and Behavioral Statistics, 2024
Cognitive diagnosis models (CDMs) are the assessment tools that provide valuable formative feedback about skill mastery at both the individual and population level. Recent work has explored the performance of CDMs with small sample sizes but has focused solely on the estimates of individual profiles. The current research focuses on obtaining…
Descriptors: Algorithms, Models, Computation, Cognitive Measurement
Development of a High-Accuracy and Effective Online Calibration Method in CD-CAT Based on Gini Index
Tan, Qingrong; Cai, Yan; Luo, Fen; Tu, Dongbo – Journal of Educational and Behavioral Statistics, 2023
To improve the calibration accuracy and calibration efficiency of cognitive diagnostic computerized adaptive testing (CD-CAT) for new items and, ultimately, contribute to the widespread application of CD-CAT in practice, the current article proposed a Gini-based online calibration method that can simultaneously calibrate the Q-matrix and item…
Descriptors: Cognitive Tests, Computer Assisted Testing, Adaptive Testing, Accuracy
Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2022
Takers of educational tests often receive proficiency levels instead of or in addition to scaled scores. For example, proficiency levels are reported for the Advanced Placement (AP®) and U.S. Medical Licensing examinations. Technical difficulties and other unforeseen events occasionally lead to missing item scores and hence to incomplete data on…
Descriptors: Computation, Data Analysis, Educational Testing, Accuracy
Andersson, Björn; Xin, Tao – Journal of Educational and Behavioral Statistics, 2021
The estimation of high-dimensional latent regression item response theory (IRT) models is difficult because of the need to approximate integrals in the likelihood function. Proposed solutions in the literature include using stochastic approximations, adaptive quadrature, and Laplace approximations. We propose using a second-order Laplace…
Descriptors: Item Response Theory, Computation, Regression (Statistics), Statistical Bias
Choe, Edison M.; Kern, Justin L.; Chang, Hua-Hua – Journal of Educational and Behavioral Statistics, 2018
Despite common operationalization, measurement efficiency of computerized adaptive testing should not only be assessed in terms of the number of items administered but also the time it takes to complete the test. To this end, a recent study introduced a novel item selection criterion that maximizes Fisher information per unit of expected response…
Descriptors: Computer Assisted Testing, Reaction Time, Item Response Theory, Test Items
McCoach, D. Betsy; Rifenbark, Graham G.; Newton, Sarah D.; Li, Xiaoran; Kooken, Janice; Yomtov, Dani; Gambino, Anthony J.; Bellara, Aarti – Journal of Educational and Behavioral Statistics, 2018
This study compared five common multilevel software packages via Monte Carlo simulation: HLM 7, M"plus" 7.4, R (lme4 V1.1-12), Stata 14.1, and SAS 9.4 to determine how the programs differ in estimation accuracy and speed, as well as convergence, when modeling multiple randomly varying slopes of different magnitudes. Simulated data…
Descriptors: Hierarchical Linear Modeling, Computer Software, Comparative Analysis, Monte Carlo Methods
Bolsinova, Maria; Tijmstra, Jesper – Journal of Educational and Behavioral Statistics, 2016
Conditional independence (CI) between response time and response accuracy is a fundamental assumption of many joint models for time and accuracy used in educational measurement. In this study, posterior predictive checks (PPCs) are proposed for testing this assumption. These PPCs are based on three discrepancy measures reflecting different…
Descriptors: Reaction Time, Accuracy, Statistical Analysis, Robustness (Statistics)
Tutz, Gerhard; Berger, Moritz – Journal of Educational and Behavioral Statistics, 2016
Heterogeneity in response styles can affect the conclusions drawn from rating scale data. In particular, biased estimates can be expected if one ignores a tendency to middle categories or to extreme categories. An adjacent categories model is proposed that simultaneously models the content-related effects and the heterogeneity in response styles.…
Descriptors: Response Style (Tests), Rating Scales, Data Interpretation, Statistical Bias
Zwick, Rebecca; Ye, Lei; Isham, Steven – Journal of Educational and Behavioral Statistics, 2012
This study demonstrates how the stability of Mantel-Haenszel (MH) DIF (differential item functioning) methods can be improved by integrating information across multiple test administrations using Bayesian updating (BU). The authors conducted a simulation that showed that this approach, which is based on earlier work by Zwick, Thayer, and Lewis,…
Descriptors: Test Bias, Computation, Statistical Analysis, Bayesian Statistics
Wang, Chun; Fan, Zhewen; Chang, Hua-Hua; Douglas, Jeffrey A. – Journal of Educational and Behavioral Statistics, 2013
The item response times (RTs) collected from computerized testing represent an underutilized type of information about items and examinees. In addition to knowing the examinees' responses to each item, we can investigate the amount of time examinees spend on each item. Current models for RTs mainly focus on parametric models, which have the…
Descriptors: Reaction Time, Computer Assisted Testing, Test Items, Accuracy
Luo, Wen; Kwok, Oi-man – Journal of Educational and Behavioral Statistics, 2012
In longitudinal multilevel studies, especially in educational settings, it is fairly common that participants change their group memberships over time (e.g., students switch to different schools). Participant's mobility changes the multilevel data structure from a purely hierarchical structure with repeated measures nested within individuals and…
Descriptors: Mobility, Statistical Analysis, Models, Longitudinal Studies
Zhang, Jinming – Journal of Educational and Behavioral Statistics, 2012
The impact of uncertainty about item parameters on test information functions is investigated. The information function of a test is one of the most important tools in item response theory (IRT). Inaccuracy in the estimation of test information can have substantial consequences on data analyses based on IRT. In this article, the major part (called…
Descriptors: Item Response Theory, Tests, Accuracy, Data Analysis

Peer reviewed
Direct link
