Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 2 |
| Since 2007 (last 20 years) | 7 |
Descriptor
Source
| International Journal of… | 2 |
| Journal of Educational… | 2 |
| Educational and Psychological… | 1 |
| Large-scale Assessments in… | 1 |
| National Center for Education… | 1 |
| Structural Equation Modeling:… | 1 |
Author
| Rutkowski, Leslie | 2 |
| Zhou, Yan | 2 |
| DeMars, Christine E. | 1 |
| Frey, Andreas | 1 |
| Hecht, Martin | 1 |
| Küchenhoff, Helmut | 1 |
| Lai, Keke | 1 |
| Lu, Zhenqiu | 1 |
| Mang, Julia | 1 |
| Meinck, Sabine | 1 |
| Prenzel, Manfred | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 7 |
| Reports - Research | 7 |
| Reports - Evaluative | 1 |
Education Level
| Secondary Education | 3 |
| Grade 4 | 2 |
| Grade 9 | 2 |
| High Schools | 2 |
| Junior High Schools | 2 |
| Middle Schools | 2 |
| Elementary Education | 1 |
| Grade 10 | 1 |
| Grade 11 | 1 |
| Grade 12 | 1 |
| Grade 7 | 1 |
| More ▼ | |
Audience
Laws, Policies, & Programs
Assessments and Surveys
| Program for International… | 2 |
| National Longitudinal Survey… | 1 |
| Peabody Individual… | 1 |
| Progress in International… | 1 |
| Trends in International… | 1 |
What Works Clearinghouse Rating
Mang, Julia; Küchenhoff, Helmut; Meinck, Sabine; Prenzel, Manfred – Large-scale Assessments in Education, 2021
Background: Standard methods for analysing data from large-scale assessments (LSA) cannot merely be adopted if hierarchical (or multilevel) regression modelling should be applied. Currently various approaches exist; they all follow generally a design-based model of estimation using the pseudo maximum likelihood method and adjusted weights for the…
Descriptors: Sampling, Hierarchical Linear Modeling, Simulation, Scaling
Sen, Sedat – International Journal of Testing, 2018
Recent research has shown that over-extraction of latent classes can be observed in the Bayesian estimation of the mixed Rasch model when the distribution of ability is non-normal. This study examined the effect of non-normal ability distributions on the number of latent classes in the mixed Rasch model when estimated with maximum likelihood…
Descriptors: Item Response Theory, Comparative Analysis, Computation, Maximum Likelihood Statistics
Rutkowski, Leslie; Rutkowski, David; Zhou, Yan – International Journal of Testing, 2016
Using an empirically-based simulation study, we show that typically used methods of choosing an item calibration sample have significant impacts on achievement bias and system rankings. We examine whether recent PISA accommodations, especially for lower performing participants, can mitigate some of this bias. Our findings indicate that standard…
Descriptors: Simulation, International Programs, Adolescents, Student Evaluation
Hecht, Martin; Weirich, Sebastian; Siegle, Thilo; Frey, Andreas – Educational and Psychological Measurement, 2015
The selection of an appropriate booklet design is an important element of large-scale assessments of student achievement. Two design properties that are typically optimized are the "balance" with respect to the positions the items are presented and with respect to the mutual occurrence of pairs of items in the same booklet. The purpose…
Descriptors: Measurement, Computation, Test Format, Test Items
Rutkowski, Leslie; Zhou, Yan – Journal of Educational Measurement, 2015
Given the importance of large-scale assessments to educational policy conversations, it is critical that subpopulation achievement is estimated reliably and with sufficient precision. Despite this importance, biased subpopulation estimates have been found to occur when variables in the conditioning model side of a latent regression model contain…
Descriptors: Error of Measurement, Error Correction, Regression (Statistics), Computation
Zhang, Zhiyong; Lai, Keke; Lu, Zhenqiu; Tong, Xin – Structural Equation Modeling: A Multidisciplinary Journal, 2013
Despite the widespread popularity of growth curve analysis, few studies have investigated robust growth curve models. In this article, the "t" distribution is applied to model heavy-tailed data and contaminated normal data with outliers for growth curve analysis. The derived robust growth curve models are estimated through Bayesian…
Descriptors: Structural Equation Models, Bayesian Statistics, Statistical Inference, Statistical Distributions
Schochet, Peter Z. – National Center for Education Evaluation and Regional Assistance, 2008
Pretest-posttest experimental designs are often used in randomized control trials (RCTs) in the education field to improve the precision of the estimated treatment effects. For logistic reasons, however, pretest data are often collected after random assignment, so that including them in the analysis could bias the posttest impact estimates. Thus,…
Descriptors: Pretests Posttests, Pretesting, Scores, Intervention
Wise, Steven L.; DeMars, Christine E. – Journal of Educational Measurement, 2006
The validity of inferences based on achievement test scores is dependent on the amount of effort that examinees put forth while taking the test. With low-stakes tests, for which this problem is particularly prevalent, there is a consequent need for psychometric models that can take into account differing levels of examinee effort. This article…
Descriptors: Guessing (Tests), Psychometrics, Inferences, Reaction Time

Peer reviewed
Direct link
