ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	8
Since 2007 (last 20 years)	10

Descriptor

Models	10
Item Response Theory	4
Psychometrics	4
Data Analysis	3
Diagnostic Tests	3
Classification	2
Computation	2
Data	2
Data Collection	2
Data Interpretation	2
Educational Assessment	2
Glossaries	2
Goodness of Fit	2
Measurement	2
Simulation	2
Accuracy	1
Artificial Intelligence	1
Bayesian Statistics	1
Case Studies	1
Check Lists	1
Coding	1
Cognitive Measurement	1
Comparative Analysis	1
Computer Assisted Testing	1
Computer Simulation	1
More ▼

Source

Educational Measurement:…

Publication Type

Journal Articles	10
Reports - Descriptive	6
Reports - Evaluative	2
Reports - Research	2

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Applications and Modeling of Keystroke Logs in Writing Assessments

Peer reviewed

Direct link

Mo Zhang; Paul Deane; Andrew Hoang; Hongwen Guo; Chen Li – Educational Measurement: Issues and Practice, 2025

In this paper, we describe two empirical studies that demonstrate the application and modeling of keystroke logs in writing assessments. We illustrate two different approaches of modeling differences in writing processes: analysis of mean differences in handcrafted theory-driven features and use of large language models to identify stable personal…

Descriptors: Writing Tests, Computer Assisted Testing, Keyboarding (Data Entry), Writing Processes

Digital Module 04: Diagnostic Measurement: Modeling Checklists for Practitioners https://ncme.elevate.commpartners.com

Peer reviewed

Direct link

Carragher, Natacha; Templin, Jonathan; Jones, Phillip; Shulruf, Boaz; Velan, Gary – Educational Measurement: Issues and Practice, 2019

In this ITEMS module, we provide a didactic overview of the specification, estimation, evaluation, and interpretation steps for diagnostic measurement/classification models (DCMs), which are a promising psychometric modeling approach. These models can provide detailed skill- or attribute-specific feedback to respondents along multiple latent…

Descriptors: Measurement, Classification, Models, Check Lists

Interpreting Probabilistic Classifications from Diagnostic Psychometric Models

Peer reviewed

Direct link

Bradshaw, Laine; Levy, Roy – Educational Measurement: Issues and Practice, 2019

Although much research has been conducted on the psychometric properties of cognitive diagnostic models, they are only recently being used in operational settings to provide results to examinees and other stakeholders. Using this newer class of models in practice comes with a fresh challenge for diagnostic assessment developers: effectively…

Descriptors: Data Interpretation, Probability, Classification, Diagnostic Tests

Digital ITEMS Module 1: Reliability in Classical Test Theory

Peer reviewed

Direct link

Lewis, Charlie; Chajewski, Michael; Rupp, André A. – Educational Measurement: Issues and Practice, 2018

In this ITEMS module, we provide a two-part introduction to the topic of reliability from the perspective of "classical test theory" (CTT). In the first part, which is directed primarily at beginning learners, we review and build on the content presented in the original didactic ITEMS article by Traub and Rowley (1991). Specifically, we…

Descriptors: Test Reliability, Test Theory, Computation, Data Collection

A Technical Note on IRT Simulation Studies: Dealing with Truth, Estimates, Observed Data, and Residuals

Peer reviewed

Direct link

Luecht, Richard; Ackerman, Terry A. – Educational Measurement: Issues and Practice, 2018

Simulation studies are extremely common in the item response theory (IRT) research literature. This article presents a didactic discussion of "truth" and "error" in IRT-based simulation studies. We ultimately recommend that future research focus less on the simple recovery of parameters from a convenient generating IRT model,…

Descriptors: Item Response Theory, Simulation, Ethics, Error of Measurement

Digital Module 16: Longitudinal Data Analysis

Peer reviewed

Direct link

Harring, Jeffrey R.; Johnson, Tessa L. – Educational Measurement: Issues and Practice, 2020

In this digital ITEMS module, Dr. Jeffrey Harring and Ms. Tessa Johnson introduce the linear mixed effects (LME) model as a flexible general framework for simultaneously modeling continuous repeated measures data with a scientifically defensible function that adequately summarizes both individual change as well as the average response. The module…

Descriptors: Educational Assessment, Data Analysis, Longitudinal Studies, Case Studies

Toward Assessment in the Service of Learning

Peer reviewed

Direct link

Gordon, Edmund W. – Educational Measurement: Issues and Practice, 2020

Drawing upon his experience, more than 60 years ago, as a psychometric support person to a very special teacher of brain damaged children, the author of this article reflects on the productive use of educational assessments and data from them to educate - assessment in the service of learning. Findings from the Gordon Commission on the Future of…

Descriptors: Psychometrics, Student Evaluation, Special Education Teachers, Educational Assessment

Digital Module 06: Bayesian Psychometrics-Posterior Predictive Model Checking https://ncme.elevate.commpartners.com

Peer reviewed

Direct link

Ames, Allison; Myers, Aaron – Educational Measurement: Issues and Practice, 2019

Drawing valid inferences from modern measurement models is contingent upon a good fit of the data to the model. Violations of model-data fit have numerous consequences, limiting the usefulness and applicability of the model. As Bayesian estimation is becoming more common, understanding the Bayesian approaches for evaluating model-data fit models…

Descriptors: Bayesian Statistics, Psychometrics, Models, Predictive Measurement

Conducting Simulation Studies in Psychometrics

Peer reviewed

Direct link

Feinberg, Richard A.; Rubright, Jonathan D. – Educational Measurement: Issues and Practice, 2016

Simulation studies are fundamental to psychometric discourse and play a crucial role in operational and academic research. Yet, resources for psychometricians interested in conducting simulations are scarce. This Instructional Topics in Educational Measurement Series (ITEMS) module is meant to address this deficiency by providing a comprehensive…

Descriptors: Simulation, Psychometrics, Vocabulary, Research Design

How Often Is the Misfit of Item Response Theory Models Practically Significant?

Peer reviewed

Direct link

Sinharay, Sandip; Haberman, Shelby J. – Educational Measurement: Issues and Practice, 2014

Standard 3.9 of the Standards for Educational and Psychological Testing ([, 1999]) demands evidence of model fit when item response theory (IRT) models are employed to data from tests. Hambleton and Han ([Hambleton, R. K., 2005]) and Sinharay ([Sinharay, S., 2005]) recommended the assessment of practical significance of misfit of IRT models, but…

Descriptors: Item Response Theory, Goodness of Fit, Models, Tests

Ackerman, Terry A.	1
Ames, Allison	1
Andrew Hoang	1
Bradshaw, Laine	1
Carragher, Natacha	1
Chajewski, Michael	1
Chen Li	1
Feinberg, Richard A.	1
Gordon, Edmund W.	1
Haberman, Shelby J.	1
Harring, Jeffrey R.	1
Hongwen Guo	1
Johnson, Tessa L.	1
Jones, Phillip	1
Levy, Roy	1
Lewis, Charlie	1
Luecht, Richard	1
Mo Zhang	1
Myers, Aaron	1
Paul Deane	1
Rubright, Jonathan D.	1
Rupp, André A.	1
Shulruf, Boaz	1
Sinharay, Sandip	1
Templin, Jonathan	1
More ▼