Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 0 |
| Since 2007 (last 20 years) | 5 |
Descriptor
| Sample Size | 5 |
| Computation | 3 |
| Equated Scores | 3 |
| Statistical Bias | 3 |
| Accuracy | 2 |
| Data Analysis | 2 |
| Error of Measurement | 2 |
| Statistical Analysis | 2 |
| Bayesian Statistics | 1 |
| Comparative Analysis | 1 |
| Correlation | 1 |
| More ▼ | |
Source
| Educational Testing Service | 5 |
Author
| Puhan, Gautam | 2 |
| Dorans, Neil | 1 |
| Harris, Ian | 1 |
| Jia, Yue | 1 |
| Lewis, Charles | 1 |
| Livingston, Samuel A. | 1 |
| Miao, Jing | 1 |
| Moses, Tim | 1 |
| Ricker, Kathryn L. | 1 |
| Stokes, Lynne | 1 |
| Tan, Xuan | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 4 |
| Reports - Evaluative | 1 |
Education Level
| Elementary Education | 1 |
| Elementary Secondary Education | 1 |
| Grade 4 | 1 |
| Intermediate Grades | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
| National Assessment of… | 1 |
What Works Clearinghouse Rating
Puhan, Gautam – Educational Testing Service, 2011
The study evaluated the effectiveness of log-linear presmoothing (Holland & Thayer, 1987) on the accuracy of small sample chained equipercentile equatings under two conditions (i.e., using small samples that differed randomly in ability from the target population "versus" using small samples that were distinctly different from the…
Descriptors: Equated Scores, Data Analysis, Accuracy, Sample Size
Jia, Yue; Stokes, Lynne; Harris, Ian; Wang, Yan – Educational Testing Service, 2011
Estimation of parameters of random effects models from samples collected via complex multistage designs is considered. One way to reduce estimation bias due to unequal probabilities of selection is to incorporate sampling weights. Many researchers have been proposed various weighting methods (Korn, & Graubard, 2003; Pfeffermann, Skinner,…
Descriptors: Computation, Statistical Bias, Sampling, Statistical Analysis
Tan, Xuan; Ricker, Kathryn L.; Puhan, Gautam – Educational Testing Service, 2010
This study examines the differences in equating outcomes between two trend score equating designs resulting from two different scoring strategies for trend scoring when operational constructed-response (CR) items are double-scored--the single group (SG) design, where each trend CR item is double-scored, and the nonequivalent groups with anchor…
Descriptors: Equated Scores, Scoring, Responses, Test Items
Moses, Tim; Miao, Jing; Dorans, Neil – Educational Testing Service, 2010
This study compared the accuracies of four differential item functioning (DIF) estimation methods, where each method makes use of only one of the following: raw data, logistic regression, loglinear models, or kernel smoothing. The major focus was on the estimation strategies' potential for estimating score-level, conditional DIF. A secondary focus…
Descriptors: Test Bias, Statistical Analysis, Computation, Scores
Livingston, Samuel A.; Lewis, Charles – Educational Testing Service, 2009
This report proposes an empirical Bayes approach to the problem of equating scores on test forms taken by very small numbers of test takers. The equated score is estimated separately at each score point, making it unnecessary to model either the score distribution or the equating transformation. Prior information comes from equatings of other…
Descriptors: Test Length, Equated Scores, Bayesian Statistics, Sample Size


