ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	19

Descriptor

Models	21
Statistical Analysis	21
Simulation	8
Computation	6
Equated Scores	5
Test Items	5
Comparative Analysis	4
Goodness of Fit	4
Item Response Theory	4
Scores	4
Psychometrics	3
Sample Size	3
Computer Software	2
Correlation	2
Educational Testing	2
Error of Measurement	2
Essays	2
Evaluation	2
Foreign Countries	2
Guessing (Tests)	2
Mathematics Tests	2
Measurement	2
Multiple Choice Tests	2
Probability	2
Rating Scales	2
More ▼

Source

Journal of Educational…

Publication Type

Journal Articles	21
Reports - Research	13
Reports - Evaluative	6
Information Analyses	1
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Secondary Education	3
Elementary Secondary Education	1
Grade 10	1
Grade 9	1
High Schools	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1

Audience

Location

United Kingdom (England)

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

Model-Based Treatment of Rapid Guessing

Peer reviewed

Direct link

Deribo, Tobias; Kroehne, Ulf; Goldhammer, Frank – Journal of Educational Measurement, 2021

The increased availability of time-related information as a result of computer-based assessment has enabled new ways to measure test-taking engagement. One of these ways is to distinguish between solution and rapid guessing behavior. Prior research has recommended response-level filtering to deal with rapid guessing. Response-level filtering can…

Descriptors: Guessing (Tests), Models, Reaction Time, Statistical Analysis

An Alternative to the 3PL: Using Asymmetric Item Characteristic Curves to Address Guessing Effects

Peer reviewed

Direct link

Lee, Sora; Bolt, Daniel M. – Journal of Educational Measurement, 2018

Both the statistical and interpretational shortcomings of the three-parameter logistic (3PL) model in accommodating guessing effects on multiple-choice items are well documented. We consider the use of a residual heteroscedasticity (RH) model as an alternative, and compare its performance to the 3PL with real test data sets and through simulation…

Descriptors: Statistical Analysis, Models, Guessing (Tests), Multiple Choice Tests

Modeling Basic Writing Processes from Keystroke Logs

Peer reviewed

Direct link

Guo, Hongwen; Deane, Paul D.; van Rijn, Peter W.; Zhang, Mo; Bennett, Randy E. – Journal of Educational Measurement, 2018

The goal of this study is to model pauses extracted from writing keystroke logs as a way of characterizing the processes students use in essay composition. Low-level timing data were modeled, the interkey interval and its subtype, the intraword duration, thought to reflect processes associated with keyboarding skills and composition fluency.…

Descriptors: Writing Processes, Writing (Composition), Essays, Models

Statistical Assessment of Estimated Transformations in Observed-Score Equating

Peer reviewed

Direct link

Wiberg, Marie; González, Jorge – Journal of Educational Measurement, 2016

Equating methods make use of an appropriate transformation function to map the scores of one test form into the scale of another so that scores are comparable and can be used interchangeably. The equating literature shows that the ways of judging the success of an equating (i.e., the score transformation) might differ depending on the adopted…

Descriptors: Statistical Analysis, Equated Scores, Scores, Models

Modeling Skipped and Not-Reached Items Using IRTrees

Peer reviewed

Direct link

Debeer, Dries; Janssen, Rianne; De Boeck, Paul – Journal of Educational Measurement, 2017

When dealing with missing responses, two types of omissions can be discerned: items can be skipped or not reached by the test taker. When the occurrence of these omissions is related to the proficiency process the missingness is nonignorable. The purpose of this article is to present a tree-based IRT framework for modeling responses and omissions…

Descriptors: Item Response Theory, Test Items, Responses, Testing Problems

Statistical Models and Inference for the True Equating Transformation in the Context of Local Equating

Peer reviewed

Direct link

González, B. Jorge; von Davier, Matthias – Journal of Educational Measurement, 2013

Based on Lord's criterion of equity of equating, van der Linden (this issue) revisits the so-called local equating method and offers alternative as well as new thoughts on several topics including the types of transformations, symmetry, reliability, and population invariance appropriate for equating. A remarkable aspect is to define equating…

Descriptors: Equated Scores, Statistical Analysis, Models, Statistical Inference

An Odds Ratio Approach for Detecting DDF under the Nested Logit Modeling Framework

Peer reviewed

Direct link

Terzi, Ragip; Suh, Youngsuk – Journal of Educational Measurement, 2015

An odds ratio approach (ORA) under the framework of a nested logit model was proposed for evaluating differential distractor functioning (DDF) in multiple-choice items and was compared with an existing ORA developed under the nominal response model. The performances of the two ORAs for detecting DDF were investigated through an extensive…

Descriptors: Test Bias, Multiple Choice Tests, Test Items, Comparative Analysis

Evaluating the Wald Test for Item-Level Comparison of Saturated and Reduced Models in Cognitive Diagnosis

Peer reviewed

Direct link

de la Torre, Jimmy; Lee, Young-Sun – Journal of Educational Measurement, 2013

This article used the Wald test to evaluate the item-level fit of a saturated cognitive diagnosis model (CDM) relative to the fits of the reduced models it subsumes. A simulation study was carried out to examine the Type I error and power of the Wald test in the context of the G-DINA model. Results show that when the sample size is small and a…

Descriptors: Statistical Analysis, Test Items, Goodness of Fit, Error of Measurement

Relative and Absolute Fit Evaluation in Cognitive Diagnosis Modeling

Peer reviewed

Direct link

Chen, Jinsong; de la Torre, Jimmy; Zhang, Zao – Journal of Educational Measurement, 2013

As with any psychometric models, the validity of inferences from cognitive diagnosis models (CDMs) determines the extent to which these models can be useful. For inferences from CDMs to be valid, it is crucial that the fit of the model to the data is ascertained. Based on a simulation study, this study investigated the sensitivity of various fit…

Descriptors: Models, Psychometrics, Goodness of Fit, Statistical Analysis

Assessing Fit of Item Response Models Using the Information Matrix Test

Peer reviewed

Direct link

Ranger, Jochen; Kuhn, Jorg-Tobias – Journal of Educational Measurement, 2012

The information matrix can equivalently be determined via the expectation of the Hessian matrix or the expectation of the outer product of the score vector. The identity of these two matrices, however, is only valid in case of a correctly specified model. Therefore, differences between the two versions of the observed information matrix indicate…

Descriptors: Goodness of Fit, Item Response Theory, Models, Matrices

Observed Score Linear Equating with Covariates

Peer reviewed

Direct link

Branberg, Kenny; Wiberg, Marie – Journal of Educational Measurement, 2011

This paper examined observed score linear equating in two different data collection designs, the equivalent groups design and the nonequivalent groups design, when information from covariates (i.e., background variables correlated with the test scores) was included. The main purpose of the study was to examine the effect (i.e., bias, variance, and…

Descriptors: Equated Scores, Data Collection, Models, Accuracy

Rater Effects on Essay Scoring: A Multilevel Analysis of Severity Drift, Central Tendency, and Rater Experience

Peer reviewed

Direct link

Leckie, George; Baird, Jo-Anne – Journal of Educational Measurement, 2011

This study examined rater effects on essay scoring in an operational monitoring system from England's 2008 national curriculum English writing test for 14-year-olds. We fitted two multilevel models and analyzed: (1) drift in rater severity effects over time; (2) rater central tendency effects; and (3) differences in rater severity and central…

Descriptors: Scoring, Foreign Countries, National Curriculum, Writing Tests

The Random-Effect Generalized Rating Scale Model

Peer reviewed

Direct link

Wang, Wen-Chung; Wu, Shiu-Lien – Journal of Educational Measurement, 2011

Rating scale items have been widely used in educational and psychological tests. These items require people to make subjective judgments, and these subjective judgments usually involve randomness. To account for this randomness, Wang, Wilson, and Shih proposed the random-effect rating scale model in which the threshold parameters are treated as…

Descriptors: Rating Scales, Models, Statistical Analysis, Computation

Measurement Error Adjustment Using the SIMEX Method: An Application to Student Growth Percentiles

Peer reviewed

Direct link

Shang, Yi – Journal of Educational Measurement, 2012

Growth models are used extensively in the context of educational accountability to evaluate student-, class-, and school-level growth. However, when error-prone test scores are used as independent variables or right-hand-side controls, the estimation of such growth models can be substantially biased. This article introduces a…

Descriptors: Error of Measurement, Statistical Analysis, Regression (Statistics), Simulation

A Comparison of Item Calibration Procedures in the Presence of Test Speededness

Peer reviewed

Direct link

Suh, Youngsuk; Cho, Sun-Joo; Wollack, James A. – Journal of Educational Measurement, 2012

In the presence of test speededness, the parameter estimates of item response theory models can be poorly estimated due to conditional dependencies among items, particularly for end-of-test items (i.e., speeded items). This article conducted a systematic comparison of five-item calibration procedures--a two-parameter logistic (2PL) model, a…

Descriptors: Response Style (Tests), Timed Tests, Test Items, Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2

de la Torre, Jimmy	3
Holland, Paul W.	2
Moses, Tim	2
Suh, Youngsuk	2
Wiberg, Marie	2
Armstrong, Ronald D.	1
Baird, Jo-Anne	1
Bennett, Randy E.	1
Bolt, Daniel M.	1
Branberg, Kenny	1
Chen, Jinsong	1
Cho, Sun-Joo	1
De Boeck, Paul	1
Deane, Paul D.	1
Debeer, Dries	1
Deng, Weiling	1
Deribo, Tobias	1
Goldhammer, Frank	1
González, B. Jorge	1
González, Jorge	1
Guo, Hongwen	1
Hong, Yuan	1
Janssen, Rianne	1
Kolen, Michael J.	1
Kroehne, Ulf	1
More ▼