ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	7

Descriptor

Achievement Tests	8
Computation	8
Simulation	8
Foreign Countries	5
Item Response Theory	4
International Assessment	3
Bayesian Statistics	2
Grade 4	2
Grade 9	2
Hierarchical Linear Modeling	2
Mathematics Achievement	2
Mathematics Tests	2
Maximum Likelihood Statistics	2
Science Tests	2
Scores	2
Secondary School Students	2
Test Bias	2
Test Items	2
Academic Achievement	1
Achievement	1
Adolescents	1
Classification	1
Comparative Analysis	1
Computer Software	1
Error Correction	1
More ▼

Source

International Journal of…	2
Journal of Educational…	2
Educational and Psychological…	1
Large-scale Assessments in…	1
National Center for Education…	1
Structural Equation Modeling:…	1

Publication Type

Journal Articles	7
Reports - Research	7
Reports - Evaluative	1

Education Level

Secondary Education	3
Grade 4	2
Grade 9	2
High Schools	2
Junior High Schools	2
Middle Schools	2
Elementary Education	1
Grade 10	1
Grade 11	1
Grade 12	1
Grade 7	1
Grade 8	1
Intermediate Grades	1
More ▼

Audience

Location

Armenia	1
Austria	1
Germany	1
Iran	1
Norway	1
Tunisia	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	2
National Longitudinal Survey…	1
Peabody Individual…	1
Progress in International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Sampling Weights in Multilevel Modelling: An Investigation Using PISA Sampling Structures

Peer reviewed

Direct link

Mang, Julia; Küchenhoff, Helmut; Meinck, Sabine; Prenzel, Manfred – Large-scale Assessments in Education, 2021

Background: Standard methods for analysing data from large-scale assessments (LSA) cannot merely be adopted if hierarchical (or multilevel) regression modelling should be applied. Currently various approaches exist; they all follow generally a design-based model of estimation using the pseudo maximum likelihood method and adjusted weights for the…

Descriptors: Sampling, Hierarchical Linear Modeling, Simulation, Scaling

Spurious Latent Class Problem in the Mixed Rasch Model: A Comparison of Three Maximum Likelihood Estimation Methods under Different Ability Distributions

Peer reviewed

Direct link

Sen, Sedat – International Journal of Testing, 2018

Recent research has shown that over-extraction of latent classes can be observed in the Bayesian estimation of the mixed Rasch model when the distribution of ability is non-normal. This study examined the effect of non-normal ability distributions on the number of latent classes in the mixed Rasch model when estimated with maximum likelihood…

Descriptors: Item Response Theory, Comparative Analysis, Computation, Maximum Likelihood Statistics

Item Calibration Samples and the Stability of Achievement Estimates and System Rankings: Another Look at the PISA Model

Peer reviewed

Direct link

Rutkowski, Leslie; Rutkowski, David; Zhou, Yan – International Journal of Testing, 2016

Using an empirically-based simulation study, we show that typically used methods of choosing an item calibration sample have significant impacts on achievement bias and system rankings. We examine whether recent PISA accommodations, especially for lower performing participants, can mitigate some of this bias. Our findings indicate that standard…

Descriptors: Simulation, International Programs, Adolescents, Student Evaluation

Effects of Design Properties on Parameter Estimation in Large-Scale Assessments

Peer reviewed

Direct link

Hecht, Martin; Weirich, Sebastian; Siegle, Thilo; Frey, Andreas – Educational and Psychological Measurement, 2015

The selection of an appropriate booklet design is an important element of large-scale assessments of student achievement. Two design properties that are typically optimized are the "balance" with respect to the positions the items are presented and with respect to the mutual occurrence of pairs of items in the same booklet. The purpose…

Descriptors: Measurement, Computation, Test Format, Test Items

Correcting Measurement Error in Latent Regression Covariates via the MC-SIMEX Method

Peer reviewed

Direct link

Rutkowski, Leslie; Zhou, Yan – Journal of Educational Measurement, 2015

Given the importance of large-scale assessments to educational policy conversations, it is critical that subpopulation achievement is estimated reliably and with sufficient precision. Despite this importance, biased subpopulation estimates have been found to occur when variables in the conditioning model side of a latent regression model contain…

Descriptors: Error of Measurement, Error Correction, Regression (Statistics), Computation

Bayesian Inference and Application of Robust Growth Curve Models Using Student's "t" Distribution

Peer reviewed

Direct link

Zhang, Zhiyong; Lai, Keke; Lu, Zhenqiu; Tong, Xin – Structural Equation Modeling: A Multidisciplinary Journal, 2013

Despite the widespread popularity of growth curve analysis, few studies have investigated robust growth curve models. In this article, the "t" distribution is applied to model heavy-tailed data and contaminated normal data with outliers for growth curve analysis. The derived robust growth curve models are estimated through Bayesian…

Descriptors: Structural Equation Models, Bayesian Statistics, Statistical Inference, Statistical Distributions

The Late Pretest Problem in Randomized Control Trials of Education Interventions. NCEE 2009-4033

Peer reviewed
PDF on ERIC

Download full text

Schochet, Peter Z. – National Center for Education Evaluation and Regional Assistance, 2008

Pretest-posttest experimental designs are often used in randomized control trials (RCTs) in the education field to improve the precision of the estimated treatment effects. For logistic reasons, however, pretest data are often collected after random assignment, so that including them in the analysis could bias the posttest impact estimates. Thus,…

Descriptors: Pretests Posttests, Pretesting, Scores, Intervention

An Application of Item Response Time: The Effort-Moderated IRT Model

Peer reviewed

Direct link

Wise, Steven L.; DeMars, Christine E. – Journal of Educational Measurement, 2006

The validity of inferences based on achievement test scores is dependent on the amount of effort that examinees put forth while taking the test. With low-stakes tests, for which this problem is particularly prevalent, there is a consequent need for psychometric models that can take into account differing levels of examinee effort. This article…

Descriptors: Guessing (Tests), Psychometrics, Inferences, Reaction Time

Rutkowski, Leslie	2
Zhou, Yan	2
DeMars, Christine E.	1
Frey, Andreas	1
Hecht, Martin	1
Küchenhoff, Helmut	1
Lai, Keke	1
Lu, Zhenqiu	1
Mang, Julia	1
Meinck, Sabine	1
Prenzel, Manfred	1
Rutkowski, David	1
Schochet, Peter Z.	1
Sen, Sedat	1
Siegle, Thilo	1
Tong, Xin	1
Weirich, Sebastian	1
Wise, Steven L.	1
Zhang, Zhiyong	1
More ▼