Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 0 |
| Since 2007 (last 20 years) | 5 |
Descriptor
| Evaluation Methods | 6 |
| Test Length | 6 |
| Item Response Theory | 4 |
| Computation | 3 |
| Simulation | 3 |
| Test Items | 3 |
| Correlation | 2 |
| Evaluation Problems | 2 |
| Evaluation Research | 2 |
| Research Methodology | 2 |
| Testing Problems | 2 |
| More ▼ | |
Source
| Journal of Educational… | 2 |
| Applied Psychological… | 1 |
| Educational Research and… | 1 |
| Educational and Psychological… | 1 |
| OECD Publishing (NJ1) | 1 |
Author
| Camilli, Gregory | 1 |
| Cheng, Ying | 1 |
| Cui, Ying | 1 |
| Lathrop, Quinn N. | 1 |
| Leighton, Jacqueline P. | 1 |
| Wang, Wen-Chung | 1 |
| Woods, Carol M. | 1 |
| Wu, Margaret | 1 |
Publication Type
| Reports - Evaluative | 6 |
| Journal Articles | 5 |
| Numerical/Quantitative Data | 1 |
Education Level
| Elementary Secondary Education | 2 |
Audience
Location
| Asia | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| Program for International… | 1 |
| Trends in International… | 1 |
What Works Clearinghouse Rating
Lathrop, Quinn N.; Cheng, Ying – Journal of Educational Measurement, 2014
When cut scores for classifications occur on the total score scale, popular methods for estimating classification accuracy (CA) and classification consistency (CC) require assumptions about a parametric form of the test scores or about a parametric response model, such as item response theory (IRT). This article develops an approach to estimate CA…
Descriptors: Cutting Scores, Classification, Computation, Nonparametric Statistics
Camilli, Gregory – Educational Research and Evaluation, 2013
In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…
Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format
Woods, Carol M. – Applied Psychological Measurement, 2008
In Ramsay-curve item response theory (RC-IRT), the latent variable distribution is estimated simultaneously with the item parameters of a unidimensional item response model using marginal maximum likelihood estimation. This study evaluates RC-IRT for the three-parameter logistic (3PL) model with comparisons to the normal model and to the empirical…
Descriptors: Test Length, Computation, Item Response Theory, Maximum Likelihood Statistics
Wu, Margaret – OECD Publishing (NJ1), 2010
This paper makes an in-depth comparison of the PISA (OECD) and TIMSS (IEA) mathematics assessments conducted in 2003. First, a comparison of survey methodologies is presented, followed by an examination of the mathematics frameworks in the two studies. The methodologies and the frameworks in the two studies form the basis for providing…
Descriptors: Mathematics Achievement, Foreign Countries, Gender Differences, Comparative Analysis
Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009
In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…
Descriptors: Test Length, Simulation, Correlation, Research Methodology
Wang, Wen-Chung – Educational and Psychological Measurement, 2004
The Pearson correlation is used to depict effect sizes in the context of item response theory. Amultidimensional Rasch model is used to directly estimate the correlation between latent traits. Monte Carlo simulations were conducted to investigate whether the population correlation could be accurately estimated and whether the bootstrap method…
Descriptors: Test Length, Sampling, Effect Size, Correlation

Peer reviewed
Direct link
