Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 0 |
| Since 2007 (last 20 years) | 3 |
Descriptor
| Accuracy | 3 |
| Simulation | 3 |
| Comparative Analysis | 2 |
| Bias | 1 |
| Causal Models | 1 |
| Classification | 1 |
| Computation | 1 |
| English | 1 |
| Equated Scores | 1 |
| Error of Measurement | 1 |
| Influences | 1 |
| More ▼ | |
Author
| Bloom, Howard S. | 1 |
| Chen, Hanwei | 1 |
| Cui, Zhongmin | 1 |
| DeCarlo, Lawrence T. | 1 |
| Fang, Yu | 1 |
| Porter, Kristin E. | 1 |
| Reardon, Sean F. | 1 |
| Robinson-Cimpian, Joseph P. | 1 |
| Topczewski, Anna | 1 |
| Unlu, Fatih | 1 |
| Woodruff, David | 1 |
| More ▼ | |
Publication Type
| Numerical/Quantitative Data | 3 |
| Reports - Research | 3 |
| Journal Articles | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Porter, Kristin E.; Reardon, Sean F.; Unlu, Fatih; Bloom, Howard S.; Robinson-Cimpian, Joseph P. – MDRC, 2014
A valuable extension of the single-rating regression discontinuity design (RDD) is a multiple-rating RDD (MRRDD). To date, four main methods have been used to estimate average treatment effects at the multiple treatment frontiers of an MRRDD: the "surface" method, the "frontier" method, the "binding-score" method, and…
Descriptors: Regression (Statistics), Research Design, Quasiexperimental Design, Research Methodology
Topczewski, Anna; Cui, Zhongmin; Woodruff, David; Chen, Hanwei; Fang, Yu – ACT, Inc., 2013
This paper investigates four methods of linear equating under the common item nonequivalent groups design. Three of the methods are well known: Tucker, Angoff-Levine, and Congeneric-Levine. A fourth method is presented as a variant of the Congeneric-Levine method. Using simulation data generated from the three-parameter logistic IRT model we…
Descriptors: Comparative Analysis, Equated Scores, Methods, Simulation
DeCarlo, Lawrence T. – ETS Research Report Series, 2008
Rater behavior in essay grading can be viewed as a signal-detection task, in that raters attempt to discriminate between latent classes of essays, with the latent classes being defined by a scoring rubric. The present report examines basic aspects of an approach to constructed-response (CR) scoring via a latent-class signal-detection model. The…
Descriptors: Scoring, Responses, Test Format, Bias

Peer reviewed
