Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 4 |
Descriptor
| Methods | 4 |
| Test Format | 4 |
| Accuracy | 2 |
| Item Response Theory | 2 |
| Advanced Placement Programs | 1 |
| Bayesian Statistics | 1 |
| Classification | 1 |
| Comparative Analysis | 1 |
| Computation | 1 |
| Cutting Scores | 1 |
| Data Interpretation | 1 |
| More ▼ | |
Author
| Ali, Usama S. | 1 |
| Debeer, Dries | 1 |
| Kim, Stella Y. | 1 |
| Lee, Won-Chan | 1 |
| Luo, Yong | 1 |
| Stella Yun Kim | 1 |
| Ting Sun | 1 |
| van Rijn, Peter W. | 1 |
Publication Type
| Journal Articles | 4 |
| Reports - Research | 4 |
Education Level
| High Schools | 1 |
| Secondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
| Advanced Placement… | 1 |
What Works Clearinghouse Rating
Ting Sun; Stella Yun Kim – Educational and Psychological Measurement, 2024
Equating is a statistical procedure used to adjust for the difference in form difficulty such that scores on those forms can be used and interpreted comparably. In practice, however, equating methods are often implemented without considering the extent to which two forms differ in difficulty. The study aims to examine the effect of the magnitude…
Descriptors: Difficulty Level, Data Interpretation, Equated Scores, High School Students
Luo, Yong – Measurement: Interdisciplinary Research and Perspectives, 2021
To date, only frequentist model-selection methods have been studied with mixed-format data in the context of IRT model-selection, and it is unknown how popular Bayesian model-selection methods such as DIC, WAIC, and LOO perform. In this study, we present the results of a comprehensive simulation study that compared the performances of eight…
Descriptors: Item Response Theory, Test Format, Selection, Methods
Kim, Stella Y.; Lee, Won-Chan – Applied Measurement in Education, 2019
This study explores classification consistency and accuracy for mixed-format tests using real and simulated data. In particular, the current study compares six methods of estimating classification consistency and accuracy for seven mixed-format tests. The relative performance of the estimation methods is evaluated using simulated data. Study…
Descriptors: Classification, Reliability, Accuracy, Test Format
Debeer, Dries; Ali, Usama S.; van Rijn, Peter W. – Journal of Educational Measurement, 2017
Test assembly is the process of selecting items from an item pool to form one or more new test forms. Often new test forms are constructed to be parallel with an existing (or an ideal) test. Within the context of item response theory, the test information function (TIF) or the test characteristic curve (TCC) are commonly used as statistical…
Descriptors: Test Format, Test Construction, Statistical Analysis, Comparative Analysis

Peer reviewed
Direct link
