ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	18
Since 2017 (last 10 years)	50
Since 2007 (last 20 years)	110

Descriptor

Models	136
Test Items	136
Item Response Theory	81
Bayesian Statistics	47
Regression (Statistics)	37
Difficulty Level	35
Maximum Likelihood Statistics	33
Computation	32
Simulation	32
Statistical Analysis	27
Foreign Countries	25
Monte Carlo Methods	24
Test Bias	23
Accuracy	20
Correlation	20
Scores	20
Comparative Analysis	19
Psychometrics	18
Goodness of Fit	17
Item Analysis	16
Markov Processes	15
Sample Size	14
Scoring	14
Test Construction	14
Achievement Tests	13
More ▼

Publication Type

Journal Articles	108
Reports - Research	87
Reports - Evaluative	25
Reports - Descriptive	15
Speeches/Meeting Papers	10
Dissertations/Theses -…	6
Collected Works - Proceedings	2
Information Analyses	1
Numerical/Quantitative Data	1
Opinion Papers	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	14
Elementary Secondary Education	10
Secondary Education	10
Elementary Education	8
Postsecondary Education	7
Middle Schools	5
Grade 4	4
Grade 8	3
Junior High Schools	3
Grade 3	2
Intermediate Grades	2
Grade 12	1
Grade 5	1
Grade 7	1
High Schools	1
More ▼

Audience

Researchers

Location

United States	4
Germany	3
Netherlands	3
Taiwan	2
Australia	1
Belgium	1
Canada	1
Europe	1
Indonesia	1
Iran	1
Israel	1
Japan	1
Saudi Arabia	1
Singapore	1
South Africa	1
South Korea	1
Sri Lanka	1
Sweden	1
Texas	1
Turkey	1
United Kingdom (England)	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…	7
Program for International…	6
Graduate Record Examinations	5
National Assessment of…	5
Raven Advanced Progressive…	3
Test of English as a Foreign…	2
Wechsler Adult Intelligence…	2
Armed Services Vocational…	1
Big Five Inventory	1
North Carolina End of Course…	1
Texas Assessment of Academic…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 136 results Save | Export

Using Item Scores and Distractors in Person-Fit Assessment

Peer reviewed

Direct link

Gorney, Kylie; Wollack, James A. – Journal of Educational Measurement, 2023

In order to detect a wide range of aberrant behaviors, it can be useful to incorporate information beyond the dichotomous item scores. In this paper, we extend the l[subscript z] and l*[subscript z] person-fit statistics so that unusual behavior in item scores and unusual behavior in item distractors can be used as indicators of aberrance. Through…

Descriptors: Test Items, Scores, Goodness of Fit, Statistics

Extending an Identified Four-Parameter IRT Model: The Confirmatory Set-4PNO Model

Peer reviewed

Direct link

Justin L. Kern – Journal of Educational and Behavioral Statistics, 2024

Given the frequent presence of slipping and guessing in item responses, models for the inclusion of their effects are highly important. Unfortunately, the most common model for their inclusion, the four-parameter item response theory model, potentially has severe deficiencies related to its possible unidentifiability. With this issue in mind, the…

Descriptors: Item Response Theory, Models, Bayesian Statistics, Generalization

Effects of the Quantity and Magnitude of Cross-Loading and Model Specification on MIRT Item Parameter Recovery

Peer reviewed

Direct link

Mostafa Hosseinzadeh; Ki Lynn Matlock Cole – Educational and Psychological Measurement, 2024

In real-world situations, multidimensional data may appear on large-scale tests or psychological surveys. The purpose of this study was to investigate the effects of the quantity and magnitude of cross-loadings and model specification on item parameter recovery in multidimensional Item Response Theory (MIRT) models, especially when the model was…

Descriptors: Item Response Theory, Models, Maximum Likelihood Statistics, Algorithms

Detecting Differential Item Functioning Using Posterior Predictive Model Checking: A Comparison of Discrepancy Statistics

Peer reviewed

Direct link

Joo, Seang-Hwane; Lee, Philseok – Journal of Educational Measurement, 2022

Abstract This study proposes a new Bayesian differential item functioning (DIF) detection method using posterior predictive model checking (PPMC). Item fit measures including infit, outfit, observed score distribution (OSD), and Q1 were considered as discrepancy statistics for the PPMC DIF methods. The performance of the PPMC DIF method was…

Descriptors: Test Items, Bayesian Statistics, Monte Carlo Methods, Prediction

Robustness of Item Response Theory Models under the PISA Multistage Adaptive Testing Designs

Peer reviewed

Direct link

Hyo Jeong Shin; Christoph König; Frederic Robin; Andreas Frey; Kentaro Yamamoto – Journal of Educational Measurement, 2025

Many international large-scale assessments (ILSAs) have switched to multistage adaptive testing (MST) designs to improve measurement efficiency in measuring the skills of the heterogeneous populations around the world. In this context, previous literature has reported the acceptable level of model parameter recovery under the MST designs when the…

Descriptors: Robustness (Statistics), Item Response Theory, Adaptive Testing, Test Construction

A Factored Regression Model for Composite Scores with Item-Level Missing Data

Peer reviewed
PDF on ERIC

Download full text

Direct link

Egamaria Alacam; Craig K. Enders; Han Du; Brian T. Keller – Grantee Submission, 2023

Composite scores are an exceptionally important psychometric tool for behavioral science research applications. A prototypical example occurs with self-report data, where researchers routinely use questionnaires with multiple items that tap into different features of a target construct. Item-level missing data are endemic to composite score…

Descriptors: Regression (Statistics), Scores, Psychometrics, Test Items

A Rating Scale Mixture Model to Account for the Tendency to Middle and Extreme Categories

Peer reviewed

Direct link

Colombi, Roberto; Giordano, Sabrina; Tutz, Gerhard – Journal of Educational and Behavioral Statistics, 2021

A mixture of logit models is proposed that discriminates between responses to rating questions that are affected by a tendency to prefer middle or extremes of the scale regardless of the content of the item (response styles) and purely content-driven preferences. Explanatory variables are used to characterize the content-driven way of answering as…

Descriptors: Rating Scales, Response Style (Tests), Test Items, Models

Modeling Nonlinear Effects of Person-by-Item Covariates in Explanatory Item Response Models: Exploratory Plots and Modeling Using Smooth Functions

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sun-Joo Cho; Amanda Goodwin; Matthew Naveiras; Paul De Boeck – Grantee Submission, 2024

Explanatory item response models (EIRMs) have been applied to investigate the effects of person covariates, item covariates, and their interactions in the fields of reading education and psycholinguistics. In practice, it is often assumed that the relationships between the covariates and the logit transformation of item response probability are…

Descriptors: Item Response Theory, Test Items, Models, Maximum Likelihood Statistics

Modeling Nonlinear Effects of Person-by-Item Covariates in Explanatory Item Response Models: Exploratory Plots and Modeling Using Smooth Functions

Peer reviewed

Direct link

Sun-Joo Cho; Amanda Goodwin; Matthew Naveiras; Paul De Boeck – Journal of Educational Measurement, 2024

Descriptors: Item Response Theory, Test Items, Models, Maximum Likelihood Statistics

Regression with Reduced Rank Predictor Matrices: A Model of Trade-Offs

Peer reviewed
PDF on ERIC

Download full text

Direct link

Davison, Mark L.; Davenport, Ernest C., Jr.; Jia, Hao; Seipel, Ben; Carlson, Sarah E. – Grantee Submission, 2022

A regression model of predictor trade-offs is described. Each regression parameter equals the expected change in Y obtained by trading 1 point from one predictor to a second predictor. The model applies to predictor variables that sum to a constant T for all observations; for example, proportions summing to T=1.0 or percentages summing to T=100…

Descriptors: Regression (Statistics), Prediction, Predictor Variables, Models

Negative Binomial Models for Visual Fixation Counts on Test Items

Peer reviewed

Direct link

Man, Kaiwen; Harring, Jeffrey R. – Educational and Psychological Measurement, 2019

With the development of technology-enhanced learning platforms, eye-tracking biometric indicators can be recorded simultaneously with students item responses. In the current study, visual fixation, an essential eye-tracking indicator, is modeled to reflect the degree of test engagement when a test taker solves a set of test questions. Three…

Descriptors: Test Items, Eye Movements, Models, Regression (Statistics)

Using Machine Learning to Predict Bloom's Taxonomy Level for Certification Exam Items

Peer reviewed

Direct link

Mead, Alan D.; Zhou, Chenxuan – Journal of Applied Testing Technology, 2022

This study fit a Naïve Bayesian classifier to the words of exam items to predict the Bloom's taxonomy level of the items. We addressed five research questions, showing that reasonably good prediction of Bloom's level was possible, but accuracy varies across levels. In our study, performance for Level 2 was poor (Level 2 items were misclassified…

Descriptors: Artificial Intelligence, Prediction, Taxonomy, Natural Language Processing

Beyond Semantic Distance: Automated Scoring of Divergent Thinking Greatly Improves with Large Language Models

Peer reviewed
PDF on ERIC

Download full text

Direct link

Peter Organisciak; Selcuk Acar; Denis Dumas; Kelly Berthiaume – Grantee Submission, 2023

Automated scoring for divergent thinking (DT) seeks to overcome a key obstacle to creativity measurement: the effort, cost, and reliability of scoring open-ended tests. For a common test of DT, the Alternate Uses Task (AUT), the primary automated approach casts the problem as a semantic distance between a prompt and the resulting idea in a text…

Descriptors: Automation, Computer Assisted Testing, Scoring, Creative Thinking

Performance of the S-X[superscript 2] Statistic for the Multidimensional Graded Response Model

Peer reviewed

Direct link

Su, Shiyang; Wang, Chun; Weiss, David J. – Educational and Psychological Measurement, 2021

S-X[superscript 2] is a popular item fit index that is available in commercial software packages such as "flex"MIRT. However, no research has systematically examined the performance of S-X[superscript 2] for detecting item misfit within the context of the multidimensional graded response model (MGRM). The primary goal of this study was…

Descriptors: Statistics, Goodness of Fit, Test Items, Models

Bayesian Estimation and Testing of a Linear Logistic Test Model for Learning during the Test

Peer reviewed

Direct link

Lozano, José H.; Revuelta, Javier – Applied Measurement in Education, 2021

The present study proposes a Bayesian approach for estimating and testing the operation-specific learning model, a variant of the linear logistic test model that allows for the measurement of the learning that occurs during a test as a result of the repeated use of the operations involved in the items. The advantages of using a Bayesian framework…

Descriptors: Bayesian Statistics, Computation, Learning, Testing

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

Educational and Psychological…	20
Journal of Educational and…	13
Applied Psychological…	12
Journal of Educational…	12
ETS Research Report Series	9
Psychometrika	9
Grantee Submission	8
Applied Measurement in…	6
ProQuest LLC	6
Educational Sciences: Theory…	2
International Journal of…	2
Practical Assessment,…	2
Alberta Journal of…	1
Autism: The International…	1
College Board	1
European Educational Research…	1
Hacettepe University Journal…	1
IEEE Transactions on Education	1
International Group for the…	1
International Journal of…	1
International Journal of…	1
International Working Group…	1
Journal of Applied Testing…	1
Journal of Deaf Studies and…	1
Journal of Memory and Language	1
More ▼

Chang, Hua-Hua	3
De Boeck, Paul	3
Revuelta, Javier	3
Sinharay, Sandip	3
Tao, Jian	3
Wang, Chun	3
Wang, Wen-Chung	3
Amanda Goodwin	2
Bolt, Daniel M.	2
Chun Wang	2
Finch, Holmes	2
He, Wei	2
Hessen, David J.	2
Huang, Hung-Yu	2
Janssen, Rianne	2
Jiao, Hong	2
Jin, Ying	2
Karabatsos, George	2
Lozano, José H.	2
Man, Kaiwen	2
Matthew Naveiras	2
Paek, Insu	2
Paul De Boeck	2
Pohl, Steffi	2
More ▼