ERIC - Search Results

Publication Date

In 2025	0
Since 2024	3
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	24

Source

Journal of Educational…

Publication Type

Journal Articles	24
Reports - Research	14
Reports - Evaluative	8
Reports - Descriptive	2

Education Level

Higher Education	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

China

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

Detecting Multidimensional DIF in Polytomous Items with IRT Methods and Estimation Approaches

Peer reviewed

Direct link

Güler Yavuz Temel – Journal of Educational Measurement, 2024

The purpose of this study was to investigate multidimensional DIF with a simple and nonsimple structure in the context of multidimensional Graded Response Model (MGRM). This study examined and compared the performance of the IRT-LR and Wald test using MML-EM and MHRM estimation approaches with different test factors and test structures in…

Descriptors: Computation, Multidimensional Scaling, Item Response Theory, Models

A Novel Partial Credit Extension Using Varying Thresholds to Account for Response Tendencies

Peer reviewed

Direct link

Henninger, Mirka – Journal of Educational Measurement, 2021

Item Response Theory models with varying thresholds are essential tools to account for unknown types of response tendencies in rating data. However, in order to separate constructs to be measured and response tendencies, specific constraints have to be imposed on varying thresholds and their interrelations. In this article, a multidimensional…

Descriptors: Response Style (Tests), Item Response Theory, Models, Computation

Robust Estimation for Response Time Modeling

Peer reviewed

Direct link

Hong, Maxwell; Rebouças, Daniella A.; Cheng, Ying – Journal of Educational Measurement, 2021

Response time has started to play an increasingly important role in educational and psychological testing, which prompts many response time models to be proposed in recent years. However, response time modeling can be adversely impacted by aberrant response behavior. For example, test speededness can cause response time to certain items to deviate…

Descriptors: Reaction Time, Models, Computation, Robustness (Statistics)

Likelihood-Based Estimation of Model-Derived Oral Reading Fluency

Peer reviewed

Direct link

Cornelis Potgieter; Xin Qiao; Akihito Kamata; Yusuf Kara – Journal of Educational Measurement, 2024

As part of the effort to develop an improved oral reading fluency (ORF) assessment system, Kara et al. estimated the ORF scores based on a latent variable psychometric model of accuracy and speed for ORF data via a fully Bayesian approach. This study further investigates likelihood-based estimators for the model-derived ORF scores, including…

Descriptors: Oral Reading, Reading Fluency, Scores, Psychometrics

Modeling Nonlinear Effects of Person-by-Item Covariates in Explanatory Item Response Models: Exploratory Plots and Modeling Using Smooth Functions

Peer reviewed

Direct link

Sun-Joo Cho; Amanda Goodwin; Matthew Naveiras; Paul De Boeck – Journal of Educational Measurement, 2024

Explanatory item response models (EIRMs) have been applied to investigate the effects of person covariates, item covariates, and their interactions in the fields of reading education and psycholinguistics. In practice, it is often assumed that the relationships between the covariates and the logit transformation of item response probability are…

Descriptors: Item Response Theory, Test Items, Models, Maximum Likelihood Statistics

A New Facets Model for Rater's Centrality/Extremity Response Style

Peer reviewed

Direct link

Jin, Kuan-Yu; Wang, Wen-Chung – Journal of Educational Measurement, 2018

The Rasch facets model was developed to account for facet data, such as student essays graded by raters, but it accounts for only one kind of rater effect (severity). In practice, raters may exhibit various tendencies such as using middle or extreme scores in their ratings, which is referred to as the rater centrality/extremity response style. To…

Descriptors: Scoring, Models, Interrater Reliability, Computation

The Effects of Incomplete Rating Designs in Combination with Rater Effects

Peer reviewed

Direct link

Wind, Stefanie A.; Jones, Eli – Journal of Educational Measurement, 2019

Researchers have explored a variety of topics related to identifying and distinguishing among specific types of rater effects, as well as the implications of different types of incomplete data collection designs for rater-mediated assessments. In this study, we used simulated data to examine the sensitivity of latent trait model indicators of…

Descriptors: Rating Scales, Models, Evaluators, Data Collection

Parameter Estimation in Rasch Models for Examinee-Selected Items

Peer reviewed

Direct link

Liu, Chen-Wei; Wang, Wen-Chung – Journal of Educational Measurement, 2017

The examinee-selected-item (ESI) design, in which examinees are required to respond to a fixed number of items in a given set of items (e.g., choose one item to respond from a pair of items), always yields incomplete data (i.e., only the selected items are answered and the others have missing data) that are likely nonignorable. Therefore, using…

Descriptors: Item Response Theory, Models, Maximum Likelihood Statistics, Data Analysis

Statistical Assessment of Estimated Transformations in Observed-Score Equating

Peer reviewed

Direct link

Wiberg, Marie; González, Jorge – Journal of Educational Measurement, 2016

Equating methods make use of an appropriate transformation function to map the scores of one test form into the scale of another so that scores are comparable and can be used interchangeably. The equating literature shows that the ways of judging the success of an equating (i.e., the score transformation) might differ depending on the adopted…

Descriptors: Statistical Analysis, Equated Scores, Scores, Models

Assessment of Differential Item Functioning under Cognitive Diagnosis Models: The DINA Model Example

Peer reviewed

Direct link

Li, Xiaomin; Wang, Wen-Chung – Journal of Educational Measurement, 2015

The assessment of differential item functioning (DIF) is routinely conducted to ensure test fairness and validity. Although many DIF assessment methods have been developed in the context of classical test theory and item response theory, they are not applicable for cognitive diagnosis models (CDMs), as the underlying latent attributes of CDMs are…

Descriptors: Test Bias, Models, Cognitive Measurement, Evaluation Methods

IRT-Estimated Reliability for Tests Containing Mixed Item Formats

Peer reviewed

Direct link

Shu, Lianghua; Schwarz, Richard D. – Journal of Educational Measurement, 2014

As a global measure of precision, item response theory (IRT) estimated reliability is derived for four coefficients (Cronbach's a, Feldt-Raju, stratified a, and marginal reliability). Models with different underlying assumptions concerning test-part similarity are discussed. A detailed computational example is presented for the targeted…

Descriptors: Item Response Theory, Reliability, Models, Computation

Unidimensional Interpretations for Multidimensional Test Items

Peer reviewed

Direct link

Kahraman, Nilufer – Journal of Educational Measurement, 2013

This article considers potential problems that can arise in estimating a unidimensional item response theory (IRT) model when some test items are multidimensional (i.e., show a complex factorial structure). More specifically, this study examines (1) the consequences of model misfit on IRT item parameter estimates due to unintended minor item-level…

Descriptors: Test Items, Item Response Theory, Computation, Models

Statistical Models and Inference for the True Equating Transformation in the Context of Local Equating

Peer reviewed

Direct link

González, B. Jorge; von Davier, Matthias – Journal of Educational Measurement, 2013

Based on Lord's criterion of equity of equating, van der Linden (this issue) revisits the so-called local equating method and offers alternative as well as new thoughts on several topics including the types of transformations, symmetry, reliability, and population invariance appropriate for equating. A remarkable aspect is to define equating…

Descriptors: Equated Scores, Statistical Analysis, Models, Statistical Inference

Local Equating Using the Rasch Model, the OPLM, and the 2PL IRT Model--or--What Is It Anyway if the Model Captures Everything There Is to Know about the Test Takers?

Peer reviewed

Direct link

von Davier, Matthias; González B., Jorge; von Davier, Alina A. – Journal of Educational Measurement, 2013

Local equating (LE) is based on Lord's criterion of equity. It defines a family of true transformations that aim at the ideal of equitable equating. van der Linden (this issue) offers a detailed discussion of common issues in observed-score equating relative to this local approach. By assuming an underlying item response theory model, one of…

Descriptors: Equated Scores, Transformations (Mathematics), Item Response Theory, Raw Scores

The Random-Effect DINA Model

Peer reviewed

Direct link

Huang, Hung-Yu; Wang, Wen-Chung – Journal of Educational Measurement, 2014

The DINA (deterministic input, noisy, and gate) model has been widely used in cognitive diagnosis tests and in the process of test development. The outcomes known as slip and guess are included in the DINA model function representing the responses to the items. This study aimed to extend the DINA model by using the random-effect approach to allow…

Descriptors: Models, Guessing (Tests), Probability, Ability

Previous Page | Next Page »

Pages: 1 | 2

Computation	24
Models	24
Item Response Theory	14
Simulation	8
Test Items	7
Statistical Analysis	6
Maximum Likelihood Statistics	5
Accuracy	4
Equated Scores	4
Response Style (Tests)	4
Scores	4
Comparative Analysis	3
Computer Software	3
Markov Processes	3
Measurement Techniques	3
Monte Carlo Methods	3
Psychometrics	3
Rating Scales	3
Statistical Distributions	3
Bayesian Statistics	2
Cognitive Measurement	2
Correlation	2
Data Analysis	2
Equations (Mathematics)	2
Error Patterns	2
More ▼

Wang, Wen-Chung	6
Jin, Kuan-Yu	2
von Davier, Matthias	2
Akihito Kamata	1
Amanda Goodwin	1
Cheng, Ying	1
Cho, Sun-Joo	1
Cornelis Potgieter	1
DeMars, Christine E.	1
Deng, Weiling	1
Douglas, Jeffrey	1
González B., Jorge	1
González, B. Jorge	1
González, Jorge	1
Güler Yavuz Temel	1
He, Wei	1
Henninger, Mirka	1
Henson, Robert	1
Holland, Paul W.	1
Hong, Maxwell	1
Hong, Yuan	1
Huang, Hung-Yu	1
Jiao, Hong	1
Jones, Eli	1
Kahraman, Nilufer	1
More ▼