Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 3 |
Descriptor
| Computer Software | 3 |
| Comparative Analysis | 2 |
| Evaluation Methods | 2 |
| Accuracy | 1 |
| Algorithms | 1 |
| Artificial Intelligence | 1 |
| Bayesian Statistics | 1 |
| Diagnostic Tests | 1 |
| Elementary School Students | 1 |
| Essays | 1 |
| Evaluators | 1 |
| More ▼ | |
Source
| Journal of Educational and… | 3 |
Author
| Jackie Eunjung Relyea | 1 |
| James S. Kim | 1 |
| Lei Guo | 1 |
| Luke Miratrix | 1 |
| Paciorek, Christopher J. | 1 |
| Paganin, Sally | 1 |
| Rabe-Hesketh, Sophia | 1 |
| Reagan Mozer | 1 |
| Rodríguez, Abel | 1 |
| Wehrhahn, Claudia | 1 |
| Wenjie Zhou | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 3 |
| Reports - Research | 3 |
Education Level
| Early Childhood Education | 1 |
| Elementary Education | 1 |
| Grade 1 | 1 |
| Grade 2 | 1 |
| Primary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Paganin, Sally; Paciorek, Christopher J.; Wehrhahn, Claudia; Rodríguez, Abel; Rabe-Hesketh, Sophia; de Valpine, Perry – Journal of Educational and Behavioral Statistics, 2023
Item response theory (IRT) models typically rely on a normality assumption for subject-specific latent traits, which is often unrealistic in practice. Semiparametric extensions based on Dirichlet process mixtures (DPMs) offer a more flexible representation of the unknown distribution of the latent trait. However, the use of such models in the IRT…
Descriptors: Bayesian Statistics, Item Response Theory, Guidance, Evaluation Methods
Lei Guo; Wenjie Zhou; Xiao Li – Journal of Educational and Behavioral Statistics, 2024
The testlet design is very popular in educational and psychological assessments. This article proposes a new cognitive diagnosis model, the multiple-choice cognitive diagnostic testlet (MC-CDT) model for tests using testlets consisting of MC items. The MC-CDT model uses the original examinees' responses to MC items instead of dichotomously scored…
Descriptors: Multiple Choice Tests, Diagnostic Tests, Accuracy, Computer Software
Reagan Mozer; Luke Miratrix; Jackie Eunjung Relyea; James S. Kim – Journal of Educational and Behavioral Statistics, 2024
In a randomized trial that collects text as an outcome, traditional approaches for assessing treatment impact require that each document first be manually coded for constructs of interest by human raters. An impact analysis can then be conducted to compare treatment and control groups, using the hand-coded scores as a measured outcome. This…
Descriptors: Scoring, Evaluation Methods, Writing Evaluation, Comparative Analysis

Peer reviewed
Direct link
