ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	0
Since 2007 (last 20 years)	11

Descriptor

Computation	12
Models	12
Statistical Significance	12
Statistical Analysis	7
Simulation	6
Regression (Statistics)	5
Academic Achievement	4
Equations (Mathematics)	3
Error of Measurement	3
Research Design	3
Research Methodology	3
Bayesian Statistics	2
Comparative Analysis	2
Control Groups	2
Correlation	2
Data Analysis	2
Error Patterns	2
Experiments	2
Foreign Countries	2
Intervention	2
Item Response Theory	2
Prediction	2
Statistical Bias	2
Teacher Effectiveness	2
Test Bias	2
More ▼

Source

Journal of Educational and…	3
Advances in Physiology…	1
Asia Pacific Journal of…	1
Educational and Psychological…	1
Journal of Experimental…	1
Journal of Research on…	1
Mathematica Policy Research,…	1
National Center for Education…	1
National Center for Education…	1
Society for Research on…	1

Publication Type

Journal Articles	8
Reports - Research	7
Reports - Evaluative	3
Numerical/Quantitative Data	1
Opinion Papers	1
Reports - Descriptive	1

Education Level

Elementary Education	3
Middle Schools	3
Higher Education	2
Grade 5	1
Grade 8	1
Intermediate Grades	1
Junior High Schools	1
Secondary Education	1

Audience

Location

New Zealand	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Assessing the "Rothstein Falsification Test": Does It Really Show Teacher Value-Added Models Are Biased?

Peer reviewed

Direct link

Goldhaber, Dan; Chaplin, Duncan Dunbar – Journal of Research on Educational Effectiveness, 2015

In an influential paper, Jesse Rothstein (2010) shows that standard value-added models (VAMs) suggest implausible and large future teacher effects on past student achievement. This is the basis of a falsification test that "appears" to indicate bias in typical VAM estimates of teacher contributions to student learning on standardized…

Descriptors: Teacher Evaluation, Teacher Effectiveness, Teacher Influence, Models

Assessment of Person Fit for Mixed-Format Tests

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015

Person-fit assessment may help the researcher to obtain additional information regarding the answering behavior of persons. Although several researchers examined person fit, there is a lack of research on person-fit assessment for mixed-format tests. In this article, the lz statistic and the ?2 statistic, both of which have been used for tests…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Bayesian Statistics

School Composition and the Black-White Achievement Gap: Methodology Companion. NCES 2015-032

Peer reviewed
PDF on ERIC

Download full text

Bohrnstedt, G.; Kitmitto, S.; Ogut, B.; Sherman, D.; Chan, D. – National Center for Education Statistics, 2015

The School Composition and the Black-White Achievement Gap study was undertaken by the National Center for Education Statistics to present both descriptive and associative information on the relationships among the percentage of students in a school who were Black (referred to as "Black student density" or "density"), the…

Descriptors: School Demography, Racial Composition, Achievement Gap, African American Students

Different Tests for a Difference: How Do We Do Research?

Peer reviewed

Direct link

Drummond, Gordon B.; Vowler, Sarah L. – Advances in Physiology Education, 2012

Most biological scientists conduct experiments to look for effects, and test the results statistically. One of the commonly used test is Student's t test. However, this test concentrates on a very limited question. The authors assume that there is no effect in the experiment, and then estimate the possibility that they could have obtained these…

Descriptors: Statistical Significance, Scientists, Tests, Biology

Alternatives for Mixed-Effects Meta-Regression Models in the Reliability Generalization Approach: A Simulation Study

Peer reviewed

Direct link

López-López, José Antonio; Botella, Juan; Sánchez-Meca, Julio; Marín-Martínez, Fulgencio – Journal of Educational and Behavioral Statistics, 2013

Since heterogeneity between reliability coefficients is usually found in reliability generalization studies, moderator analyses constitute a crucial step for that meta-analytic approach. In this study, different procedures for conducting mixed-effects meta-regression analyses were compared. Specifically, four transformation methods for the…

Descriptors: Reliability, Generalization, Meta Analysis, Regression (Statistics)

Estimating Cross-Site Impact Variation in the Presence of Heteroscedasticity

Peer reviewed
PDF on ERIC

Download full text

Bloom, Howard S.; Porter, Kristin E.; Weiss, Michael J.; Raudenbush, Stephen – Society for Research on Educational Effectiveness, 2013

To date, evaluation research and policy analysis have focused mainly on average program impacts and paid little systematic attention to their variation. Recently, the growing number of multi-site randomized trials that are being planned and conducted make it increasingly feasible to study "cross-site" variation in impacts. Important…

Descriptors: Research Methodology, Policy, Evaluation Research, Randomized Controlled Trials

Assessing the Rothstein Test: Does It Really Show Teacher Value-Added Models Are Biased? Working Paper 5

Download full text

Goldhaber, Dan; Chaplin, Duncan – Mathematica Policy Research, Inc., 2012

In a provocative and influential paper, Jesse Rothstein (2010) finds that standard value-added models (VAMs) suggest implausible future teacher effects on past student achievement, a finding that obviously cannot be viewed as causal. This is the basis of a falsification test (the Rothstein falsification test) that appears to indicate bias in VAM…

Descriptors: Value Added Models, Academic Achievement, Teacher Effectiveness, Correlation

University Admission Models that Address Quality and Equity

Peer reviewed

Direct link

Turner, Rolf; Shulruf, Boaz; Li, Meisong; Yuan, Johnson – Asia Pacific Journal of Education, 2012

University entrance criteria can be a contentious topic, particularly in respect of equity. In this paper we discuss studies which demonstrate that revisions of entrance criteria which are designed with no explicit reference to equity issues can have a surprisingly positive impact on the fractions of disadvantaged subgroups admitted. We…

Descriptors: Admission Criteria, Statistical Significance, Foreign Countries, Equal Education

Power Analysis in Two-Level Unbalanced Designs

Peer reviewed

Direct link

Konstantopoulos, Spyros – Journal of Experimental Education, 2010

Previous work on statistical power has discussed mainly single-level designs or 2-level balanced designs with random effects. Although balanced experiments are common, in practice balance cannot always be achieved. Work on class size is one example of unbalanced designs. This study provides methods for power analysis in 2-level unbalanced designs…

Descriptors: Class Size, Computers, Statistical Analysis, Experiments

Correcting a Significance Test for Clustering

Peer reviewed

Direct link

Hedges, Larry V. – Journal of Educational and Behavioral Statistics, 2007

A common mistake in analysis of cluster randomized trials is to ignore the effect of clustering and analyze the data as if each treatment group were a simple random sample. This typically leads to an overstatement of the precision of results and anticonservative conclusions about precision and statistical significance of treatment effects. This…

Descriptors: Statistical Significance, Computation, Cluster Grouping, Statistics

Technical Methods Report: The Estimation of Average Treatment Effects for Clustered RCTs of Education Interventions. NCEE 2009-0061 rev.

Peer reviewed
PDF on ERIC

Download full text

Schochet, Peter Z. – National Center for Education Evaluation and Regional Assistance, 2009

This paper examines the estimation of two-stage clustered RCT designs in education research using the Neyman causal inference framework that underlies experiments. The key distinction between the considered causal models is whether potential treatment and control group outcomes are considered to be fixed for the study population (the…

Descriptors: Control Groups, Causal Models, Statistical Significance, Computation

On Chi-Square Difference and z Tests in Mean and Covariance Structure Analysis When the Base Model is Misspecified

Peer reviewed

Direct link

Yuan, Ke-Hai; Bentler, Peter M. – Educational and Psychological Measurement, 2004

In mean and covariance structure analysis, the chi-square difference test is often applied to evaluate the number of factors, cross-group constraints, and other nested model comparisons. Let model M[a] be the base model within which model M[b] is nested. In practice, this test is commonly used to justify M[b] even when M[a] is misspecified. The…

Descriptors: Statistical Significance, Item Response Theory, Computation, Statistical Analysis

Goldhaber, Dan	2
Bentler, Peter M.	1
Bloom, Howard S.	1
Bohrnstedt, G.	1
Botella, Juan	1
Chan, D.	1
Chaplin, Duncan	1
Chaplin, Duncan Dunbar	1
Drummond, Gordon B.	1
Hedges, Larry V.	1
Kitmitto, S.	1
Konstantopoulos, Spyros	1
Li, Meisong	1
López-López, José Antonio	1
Marín-Martínez, Fulgencio	1
Ogut, B.	1
Porter, Kristin E.	1
Raudenbush, Stephen	1
Schochet, Peter Z.	1
Sherman, D.	1
Shulruf, Boaz	1
Sinharay, Sandip	1
Sánchez-Meca, Julio	1
Turner, Rolf	1
Vowler, Sarah L.	1
More ▼