ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	4

Descriptor

Comparative Analysis	7
Educational Assessment	7
Item Response Theory	4
Accuracy	3
Equated Scores	3
Models	3
Simulation	3
Test Items	3
Elementary Secondary Education	2
Evaluation Criteria	2
Evaluation Methods	2
Statistical Analysis	2
Test Results	2
Bayesian Statistics	1
Classification	1
College Students	1
Criteria	1
Decision Making	1
Difficulty Level	1
Educational Testing	1
Error of Measurement	1
Evaluation Research	1
Evaluators	1
Higher Education	1
Item Analysis	1
More ▼

Source

Applied Measurement in…

Author

Lee, Won-Chan	2
Linn, Robert L.	2
Ban, Jae-Chun	1
Chang, Hua-Hua	1
Kang, Hyeon-Ah	1
Kiplinger, Vonda L.	1
Koziol, Natalie A.	1
Lu, Ying	1
Plake, Barbara S.	1
Song, Yoon Ah	1

Publication Type

Journal Articles	7
Reports - Research	4
Reports - Evaluative	3
Information Analyses	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Effects of Using Double Ratings as Item Scores on IRT Proficiency Estimation

Peer reviewed

Direct link

Song, Yoon Ah; Lee, Won-Chan – Applied Measurement in Education, 2022

This article presents the performance of item response theory (IRT) models when double ratings are used as item scores over single ratings when rater effects are present. Study 1 examined the influence of the number of ratings on the accuracy of proficiency estimation in the generalized partial credit model (GPCM). Study 2 compared the accuracy of…

Descriptors: Item Response Theory, Item Analysis, Scores, Accuracy

IRT Item Parameter Scaling for Developing New Item Pools

Peer reviewed

Direct link

Kang, Hyeon-Ah; Lu, Ying; Chang, Hua-Hua – Applied Measurement in Education, 2017

Increasing use of item pools in large-scale educational assessments calls for an appropriate scaling procedure to achieve a common metric among field-tested items. The present study examines scaling procedures for developing a new item pool under a spiraled block linking design. The three scaling procedures are considered: (a) concurrent…

Descriptors: Item Response Theory, Accuracy, Educational Assessment, Test Items

Parameter Recovery and Classification Accuracy under Conditions of Testlet Dependency: A Comparison of the Traditional 2PL, Testlet, and Bi-Factor Models

Peer reviewed

Direct link

Koziol, Natalie A. – Applied Measurement in Education, 2016

Testlets, or groups of related items, are commonly included in educational assessments due to their many logistical and conceptual advantages. Despite their advantages, testlets introduce complications into the theory and practice of educational measurement. Responses to items within a testlet tend to be correlated even after controlling for…

Descriptors: Classification, Accuracy, Comparative Analysis, Models

A Comparison of IRT Linking Procedures

Peer reviewed

Direct link

Lee, Won-Chan; Ban, Jae-Chun – Applied Measurement in Education, 2010

Various applications of item response theory often require linking to achieve a common scale for item parameter estimates obtained from different groups. This article used a simulation to examine the relative performance of four different item response theory (IRT) linking procedures in a random groups equating design: concurrent calibration with…

Descriptors: Item Response Theory, Simulation, Comparative Analysis, Measurement Techniques

Linking Results of Distinct Assessments.

Peer reviewed

Linn, Robert L. – Applied Measurement in Education, 1993

The following ways of linking results from distinct assessments to use them for multiple purposes are reviewed: (1) equating; (2) calibration; (3) statistical moderation; (4) prediction; and (5) social moderation. The characteristics of these methods, their requirements, and the comparative inferences they support are described. (SLD)

Descriptors: Comparative Analysis, Educational Assessment, Elementary Secondary Education, Equated Scores

Linking Statewide Tests to the National Assessment of Educational Progress: Stability of Results.

Peer reviewed

Linn, Robert L.; Kiplinger, Vonda L. – Applied Measurement in Education, 1995

The adequacy of linking statewide standardized test results to the National Assessment of Educational Progress by using equipercentile equating procedures was investigated using statewide mathematics data from four states. Results suggest that the linkings are not sufficiently trustworthy to make comparisons based on the tails of the distribution.…

Descriptors: Comparative Analysis, Educational Assessment, Equated Scores, Mathematics Tests

The Performance Domain and the Structure of the Decision Space.

Peer reviewed

Plake, Barbara S. – Applied Measurement in Education, 1995

This article provides a framework for the rest of the articles in this special issue comparing the utility of three standard-setting methods with complex performance assessments. The context of the standard setting study is described, and the methods are outlined. (SLD)

Descriptors: Comparative Analysis, Criteria, Decision Making, Educational Assessment