ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	7
Since 2017 (last 10 years)	22
Since 2007 (last 20 years)	38

Descriptor

Models	42
Sample Size	42
Test Items	42
Item Response Theory	27
Simulation	19
Test Length	15
Error of Measurement	12
Statistical Analysis	12
Correlation	11
Computation	10
Test Bias	9
Comparative Analysis	8
Goodness of Fit	8
Accuracy	7
Difficulty Level	6
Factor Analysis	6
Scores	5
Classification	4
Item Analysis	4
Maximum Likelihood Statistics	4
Monte Carlo Methods	4
Regression (Statistics)	4
Scoring	4
Statistical Bias	4
Statistical Distributions	4
More ▼

Source

Educational and Psychological…	8
ProQuest LLC	6
Applied Psychological…	4
Journal of Educational…	4
Educational Sciences: Theory…	3
International Journal of…	3
ACT, Inc.	1
American Journal of…	1
ETS Research Report Series	1
Hacettepe University Journal…	1
International Journal of…	1
International Journal of…	1
Journal of Education and…	1
Measurement and Evaluation in…	1
Measurement:…	1
Online Submission	1
Practical Assessment,…	1
Prevention Science	1
More ▼

Publication Type

Journal Articles	32
Reports - Research	28
Dissertations/Theses -…	6
Reports - Evaluative	6
Speeches/Meeting Papers	3
Reports - Descriptive	2
Tests/Questionnaires	1

Education Level

Elementary Secondary Education	1
Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Turkey

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	2
Hopkins Symptom Checklist	1
National Assessment of…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 42 results Save | Export

Assessing Model Fit of the Generalized Graded Unfolding Model

Peer reviewed
PDF on ERIC

Download full text

Abdulla Alzarouni; R. J. De Ayala – Practical Assessment, Research & Evaluation, 2025

The assessment of model fit in latent trait modeling is an integral part of correctly applying the model. Still the assessment of model fit has been less utilized for ideal point models such as the Generalized Graded Unfolding Models (GGUM). The current study assesses the performance of the relative fit indices "AIC" and "BIC,"…

Descriptors: Goodness of Fit, Models, Statistical Analysis, Sample Size

Item Parameter Recovery via Traditional 2PL, Testlet and Bi-Factor Models for Testlet-Based Tests

Peer reviewed
PDF on ERIC

Download full text

Soysal, Sumeyra; Yilmaz Kogar, Esin – International Journal of Assessment Tools in Education, 2022

The testlet comprises a set of items based on a common stimulus. When the testlet is used in the tests, there may violate the local independence assumption, and in this case, it would not be appropriate to use traditional item response theory models in the tests in which the testlet is included. When the testlet is discussed, one of the most…

Descriptors: Test Items, Test Theory, Models, Sample Size

Effects of the Quantity and Magnitude of Cross-Loading and Model Specification on MIRT Item Parameter Recovery

Peer reviewed

Direct link

Mostafa Hosseinzadeh; Ki Lynn Matlock Cole – Educational and Psychological Measurement, 2024

In real-world situations, multidimensional data may appear on large-scale tests or psychological surveys. The purpose of this study was to investigate the effects of the quantity and magnitude of cross-loadings and model specification on item parameter recovery in multidimensional Item Response Theory (MIRT) models, especially when the model was…

Descriptors: Item Response Theory, Models, Maximum Likelihood Statistics, Algorithms

Robustness of Item Response Theory Models under the PISA Multistage Adaptive Testing Designs

Peer reviewed

Direct link

Hyo Jeong Shin; Christoph König; Frederic Robin; Andreas Frey; Kentaro Yamamoto – Journal of Educational Measurement, 2025

Many international large-scale assessments (ILSAs) have switched to multistage adaptive testing (MST) designs to improve measurement efficiency in measuring the skills of the heterogeneous populations around the world. In this context, previous literature has reported the acceptable level of model parameter recovery under the MST designs when the…

Descriptors: Robustness (Statistics), Item Response Theory, Adaptive Testing, Test Construction

Regarding Item Parameter Invariance for the Rasch and the 2-Parameter Logistic Models: An Investigation under Finite Non-Representative Sample Calibrations

Peer reviewed

Direct link

Paek, Insu; Liang, Xinya; Lin, Zhongtian – Measurement: Interdisciplinary Research and Perspectives, 2021

The property of item parameter invariance in item response theory (IRT) plays a pivotal role in the applications of IRT such as test equating. The scope of parameter invariance when using estimates from finite biased samples in the applications of IRT does not appear to be clearly documented in the IRT literature. This article provides information…

Descriptors: Item Response Theory, Computation, Test Items, Bias

Sample Size and Item Parameter Estimation Precision When Utilizing the Masters' Partial Credit Model

Download full text

Custer, Michael; Kim, Jongpil – Online Submission, 2023

This study utilizes an analysis of diminishing returns to examine the relationship between sample size and item parameter estimation precision when utilizing the Masters' Partial Credit Model for polytomous items. Item data from the standardization of the Batelle Developmental Inventory, 3rd Edition were used. Each item was scored with a…

Descriptors: Sample Size, Item Response Theory, Test Items, Computation

Investigating Confidence Intervals of Item Parameters When Some Item Parameters Take Priors in the 2PL and 3PL Models

Peer reviewed

Direct link

Paek, Insu; Lin, Zhongtian; Chalmers, Robert Philip – Educational and Psychological Measurement, 2023

To reduce the chance of Heywood cases or nonconvergence in estimating the 2PL or the 3PL model in the marginal maximum likelihood with the expectation-maximization (MML-EM) estimation method, priors for the item slope parameter in the 2PL model or for the pseudo-guessing parameter in the 3PL model can be used and the marginal maximum a posteriori…

Descriptors: Models, Item Response Theory, Test Items, Intervals

The Recovery of Correlation between Latent Abilities Using Compensatory and Noncompensatory Multidimensional IRT Models

Peer reviewed

Direct link

Fu, Yanyan; Strachan, Tyler; Ip, Edward H.; Willse, John T.; Chen, Shyh-Huei; Ackerman, Terry – International Journal of Testing, 2020

This research examined correlation estimates between latent abilities when using the two-dimensional and three-dimensional compensatory and noncompensatory item response theory models. Simulation study results showed that the recovery of the latent correlation was best when the test contained 100% of simple structure items for all models and…

Descriptors: Item Response Theory, Models, Test Items, Simulation

Performance of the S-X[superscript 2] Statistic for the Multidimensional Graded Response Model

Peer reviewed

Direct link

Su, Shiyang; Wang, Chun; Weiss, David J. – Educational and Psychological Measurement, 2021

S-X[superscript 2] is a popular item fit index that is available in commercial software packages such as "flex"MIRT. However, no research has systematically examined the performance of S-X[superscript 2] for detecting item misfit within the context of the multidimensional graded response model (MGRM). The primary goal of this study was…

Descriptors: Statistics, Goodness of Fit, Test Items, Models

Harmonizing Depression Measures across Studies: A Tutorial for Data Harmonization

Peer reviewed

Direct link

Zhao, Xin; Coxe, Stefany; Sibley, Margaret H.; Zulauf-McCurdy, Courtney; Pettit, Jeremy W. – Prevention Science, 2023

There has been increasing interest in applying integrative data analysis (IDA) to analyze data across multiple studies to increase sample size and statistical power. Measures of a construct are frequently not consistent across studies. This article provides a tutorial on the complex decisions that occur when conducting harmonization of measures…

Descriptors: Data Analysis, Sample Size, Decision Making, Test Items

The Performance of the Semigeneralized Partial Credit Model for Handling Item-Level Missingness

Peer reviewed

Direct link

Zhou, Sherry; Huggins-Manley, Anne Corinne – Educational and Psychological Measurement, 2020

The semi-generalized partial credit model (Semi-GPCM) has been proposed as a unidimensional modeling method for handling not applicable scale responses and neutral scale responses, and it has been suggested that the model may be of use in handling missing data in scale items. The purpose of this study is to evaluate the ability of the…

Descriptors: Models, Statistical Analysis, Response Style (Tests), Test Items

A Log-Linear Modeling Approach for Differential Item Functioning Detection in Polytomously Scored Items

Peer reviewed

Direct link

Yesiltas, Gonca; Paek, Insu – Educational and Psychological Measurement, 2020

A log-linear model (LLM) is a well-known statistical method to examine the relationship among categorical variables. This study investigated the performance of LLM in detecting differential item functioning (DIF) for polytomously scored items via simulations where various sample sizes, ability mean differences (impact), and DIF types were…

Descriptors: Simulation, Sample Size, Item Analysis, Scores

The Impact of Different Missing Data Handling Methods on DINA Model

Peer reviewed
PDF on ERIC

Download full text

Sünbül, Seçil Ömür – International Journal of Evaluation and Research in Education, 2018

In this study, it was aimed to investigate the impact of different missing data handling methods on DINA model parameter estimation and classification accuracy. In the study, simulated data were used and the data were generated by manipulating the number of items and sample size. In the generated data, two different missing data mechanisms…

Descriptors: Data, Test Items, Sample Size, Statistical Analysis

Within-Item Interactions in Bifactor Models for Ordered-Categorical Item Responses

Direct link

Fager, Meghan L. – ProQuest LLC, 2019

Recent research in multidimensional item response theory has introduced within-item interaction effects between latent dimensions in the prediction of item responses. The objective of this study was to extend this research to bifactor models to include an interaction effect between the general and specific latent variables measured by an item.…

Descriptors: Test Items, Item Response Theory, Factor Analysis, Simulation

Diagnostic Classification Models: Recent Developments, Practical Issues, and Prospects

Peer reviewed

Direct link

Ravand, Hamdollah; Baghaei, Purya – International Journal of Testing, 2020

More than three decades after their introduction, diagnostic classification models (DCM) do not seem to have been implemented in educational systems for the purposes they were devised. Most DCM research is either methodological for model development and refinement or retrofitting to existing nondiagnostic tests and, in the latter case, basically…

Descriptors: Classification, Models, Diagnostic Tests, Test Construction

Previous Page | Next Page »

Pages: 1 | 2 | 3

Paek, Insu	3
Lin, Zhongtian	2
Suh, Youngsuk	2
Willse, John T.	2
de la Torre, Jimmy	2
Abdulla Alzarouni	1
Acar, Tulin	1
Ackerman, Terry	1
Andersson, Björn	1
Andreas Frey	1
Anil, Duygu	1
Atar, Burcu	1
Ayodele, Alicia Nicole	1
Baghaei, Purya	1
Baker, Frank B.	1
Breyer, F. Jay	1
Bulut, Okan	1
Carlson, James E.	1
Chalmers, Robert Philip	1
Chason, Walter M.	1
Chen, Shyh-Huei	1
Chernyshenko, Oleksandr S.	1
Cho, Sun-Joo	1
Christoph König	1
Coxe, Stefany	1
More ▼