ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	9

Source

Applied Measurement in…	4
Journal of Educational…	2
College Board	1
Educational Measurement:…	1
International Journal of…	1

Author

Lee, Won-Chan	9
Kim, Stella Y.	2
Lim, Euijin	2
Ansley, Timothy N.	1
Brennan, Robert L.	1
Choi, Jiwon	1
Chon, Kyong Hee	1
Kang, Yujin	1
Kim, Kyung Yong	1
Kim, Seonghoon	1
Kolen, Michael J.	1
Lee, Eunjung	1
Lee, Guemin	1
More ▼

Publication Type

Journal Articles	8
Reports - Research	7
Numerical/Quantitative Data	1
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	1
Iowa Tests of Basic Skills	1

What Works Clearinghouse Rating

Showing all 9 results Save | Export

A Comparison of the Relative Performance of Four IRT Models on Equating Passage-Based Tests

Peer reviewed

Direct link

Kim, Kyung Yong; Lim, Euijin; Lee, Won-Chan – International Journal of Testing, 2019

For passage-based tests, items that belong to a common passage often violate the local independence assumption of unidimensional item response theory (UIRT). In this case, ignoring local item dependence (LID) and estimating item parameters using a UIRT model could be problematic because doing so might result in inaccurate parameter estimates,…

Descriptors: Item Response Theory, Equated Scores, Test Items, Models

Subscore Equating and Profile Reporting

Peer reviewed

Direct link

Lim, Euijin; Lee, Won-Chan – Applied Measurement in Education, 2020

The purpose of this study is to address the necessity of subscore equating and to evaluate the performance of various equating methods for subtests. Assuming the random groups design and number-correct scoring, this paper analyzed real data and simulated data with four study factors including test dimensionality, subtest length, form difference in…

Descriptors: Equated Scores, Test Length, Test Format, Difficulty Level

Classification Consistency and Accuracy for Mixed-Format Tests

Peer reviewed

Direct link

Kim, Stella Y.; Lee, Won-Chan – Applied Measurement in Education, 2019

This study explores classification consistency and accuracy for mixed-format tests using real and simulated data. In particular, the current study compares six methods of estimating classification consistency and accuracy for seven mixed-format tests. The relative performance of the estimation methods is evaluated using simulated data. Study…

Descriptors: Classification, Reliability, Accuracy, Test Format

IRT Approaches to Modeling Scores on Mixed-Format Tests

Peer reviewed

Direct link

Lee, Won-Chan; Kim, Stella Y.; Choi, Jiwon; Kang, Yujin – Journal of Educational Measurement, 2020

This article considers psychometric properties of composite raw scores and transformed scale scores on mixed-format tests that consist of a mixture of multiple-choice and free-response items. Test scores on several mixed-format tests are evaluated with respect to conditional and overall standard errors of measurement, score reliability, and…

Descriptors: Raw Scores, Item Response Theory, Test Format, Multiple Choice Tests

Bi-Factor MIRT Observed-Score Equating for Mixed-Format Tests

Peer reviewed

Direct link

Lee, Guemin; Lee, Won-Chan – Applied Measurement in Education, 2016

The main purposes of this study were to develop bi-factor multidimensional item response theory (BF-MIRT) observed-score equating procedures for mixed-format tests and to investigate relative appropriateness of the proposed procedures. Using data from a large-scale testing program, three types of pseudo data sets were formulated: matched samples,…

Descriptors: Test Format, Multidimensional Scaling, Item Response Theory, Equated Scores

An Empirical Investigation of Methods for Assessing Item Fit for Mixed Format Tests

Peer reviewed

Direct link

Chon, Kyong Hee; Lee, Won-Chan; Ansley, Timothy N. – Applied Measurement in Education, 2013

Empirical information regarding performance of model-fit procedures has been a persistent need in measurement practice. Statistical procedures for evaluating item fit were applied to real test examples that consist of both dichotomously and polytomously scored items. The item fit statistics used in this study included the PARSCALE's G[squared],…

Descriptors: Test Format, Test Items, Item Analysis, Goodness of Fit

Exploring Equity Properties in Equating Using AP® Examinations. Research Report No. 2012-4

Download full text

Lee, Eunjung; Lee, Won-Chan; Brennan, Robert L. – College Board, 2012

In almost all high-stakes testing programs, test equating is necessary to ensure that test scores across multiple test administrations are equivalent and can be used interchangeably. Test equating becomes even more challenging in mixed-format tests, such as Advanced Placement Program® (AP®) Exams, that contain both multiple-choice and constructed…

Descriptors: Test Construction, Test Interpretation, Test Norms, Test Reliability

Psychometric Properties of Raw and Scale Scores on Mixed-Format Tests

Peer reviewed

Direct link

Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011

This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…

Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores

An Extension of Four IRT Linking Methods for Mixed-Format Tests

Peer reviewed

Direct link

Kim, Seonghoon; Lee, Won-Chan – Journal of Educational Measurement, 2006

Under item response theory (IRT), linking proficiency scales from separate calibrations of multiple forms of a test to achieve a common scale is required in many applications. Four IRT linking methods including the mean/mean, mean/sigma, Haebara, and Stocking-Lord methods have been presented for use with single-format tests. This study extends the…

Descriptors: Simulation, Item Response Theory, Test Format, Measures (Individuals)

Test Format	9
Item Response Theory	8
Equated Scores	4
Accuracy	3
Psychometrics	3
Reliability	3
Test Items	3
Advanced Placement Programs	2
Classification	2
Comparative Analysis	2
Error of Measurement	2
Models	2
Multiple Choice Tests	2
Raw Scores	2
Sample Size	2
Simulation	2
Test Interpretation	2
Test Length	2
Computation	1
Cutting Scores	1
Data Analysis	1
Difficulty Level	1
Evaluation	1
Evaluation Criteria	1
Goodness of Fit	1
More ▼