NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20260
Since 20250
Since 2022 (last 5 years)0
Since 2017 (last 10 years)0
Since 2007 (last 20 years)11
Audience
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of…1
What Works Clearinghouse Rating
Showing all 12 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Goldhaber, Dan; Chaplin, Duncan Dunbar – Journal of Research on Educational Effectiveness, 2015
In an influential paper, Jesse Rothstein (2010) shows that standard value-added models (VAMs) suggest implausible and large future teacher effects on past student achievement. This is the basis of a falsification test that "appears" to indicate bias in typical VAM estimates of teacher contributions to student learning on standardized…
Descriptors: Teacher Evaluation, Teacher Effectiveness, Teacher Influence, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015
Person-fit assessment may help the researcher to obtain additional information regarding the answering behavior of persons. Although several researchers examined person fit, there is a lack of research on person-fit assessment for mixed-format tests. In this article, the lz statistic and the ?2 statistic, both of which have been used for tests…
Descriptors: Test Format, Goodness of Fit, Item Response Theory, Bayesian Statistics
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Bohrnstedt, G.; Kitmitto, S.; Ogut, B.; Sherman, D.; Chan, D. – National Center for Education Statistics, 2015
The School Composition and the Black-White Achievement Gap study was undertaken by the National Center for Education Statistics to present both descriptive and associative information on the relationships among the percentage of students in a school who were Black (referred to as "Black student density" or "density"), the…
Descriptors: School Demography, Racial Composition, Achievement Gap, African American Students
Peer reviewed Peer reviewed
Direct linkDirect link
Drummond, Gordon B.; Vowler, Sarah L. – Advances in Physiology Education, 2012
Most biological scientists conduct experiments to look for effects, and test the results statistically. One of the commonly used test is Student's t test. However, this test concentrates on a very limited question. The authors assume that there is no effect in the experiment, and then estimate the possibility that they could have obtained these…
Descriptors: Statistical Significance, Scientists, Tests, Biology
Peer reviewed Peer reviewed
Direct linkDirect link
López-López, José Antonio; Botella, Juan; Sánchez-Meca, Julio; Marín-Martínez, Fulgencio – Journal of Educational and Behavioral Statistics, 2013
Since heterogeneity between reliability coefficients is usually found in reliability generalization studies, moderator analyses constitute a crucial step for that meta-analytic approach. In this study, different procedures for conducting mixed-effects meta-regression analyses were compared. Specifically, four transformation methods for the…
Descriptors: Reliability, Generalization, Meta Analysis, Regression (Statistics)
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Bloom, Howard S.; Porter, Kristin E.; Weiss, Michael J.; Raudenbush, Stephen – Society for Research on Educational Effectiveness, 2013
To date, evaluation research and policy analysis have focused mainly on average program impacts and paid little systematic attention to their variation. Recently, the growing number of multi-site randomized trials that are being planned and conducted make it increasingly feasible to study "cross-site" variation in impacts. Important…
Descriptors: Research Methodology, Policy, Evaluation Research, Randomized Controlled Trials
Goldhaber, Dan; Chaplin, Duncan – Mathematica Policy Research, Inc., 2012
In a provocative and influential paper, Jesse Rothstein (2010) finds that standard value-added models (VAMs) suggest implausible future teacher effects on past student achievement, a finding that obviously cannot be viewed as causal. This is the basis of a falsification test (the Rothstein falsification test) that appears to indicate bias in VAM…
Descriptors: Value Added Models, Academic Achievement, Teacher Effectiveness, Correlation
Peer reviewed Peer reviewed
Direct linkDirect link
Turner, Rolf; Shulruf, Boaz; Li, Meisong; Yuan, Johnson – Asia Pacific Journal of Education, 2012
University entrance criteria can be a contentious topic, particularly in respect of equity. In this paper we discuss studies which demonstrate that revisions of entrance criteria which are designed with no explicit reference to equity issues can have a surprisingly positive impact on the fractions of disadvantaged subgroups admitted. We…
Descriptors: Admission Criteria, Statistical Significance, Foreign Countries, Equal Education
Peer reviewed Peer reviewed
Direct linkDirect link
Konstantopoulos, Spyros – Journal of Experimental Education, 2010
Previous work on statistical power has discussed mainly single-level designs or 2-level balanced designs with random effects. Although balanced experiments are common, in practice balance cannot always be achieved. Work on class size is one example of unbalanced designs. This study provides methods for power analysis in 2-level unbalanced designs…
Descriptors: Class Size, Computers, Statistical Analysis, Experiments
Peer reviewed Peer reviewed
Direct linkDirect link
Hedges, Larry V. – Journal of Educational and Behavioral Statistics, 2007
A common mistake in analysis of cluster randomized trials is to ignore the effect of clustering and analyze the data as if each treatment group were a simple random sample. This typically leads to an overstatement of the precision of results and anticonservative conclusions about precision and statistical significance of treatment effects. This…
Descriptors: Statistical Significance, Computation, Cluster Grouping, Statistics
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Schochet, Peter Z. – National Center for Education Evaluation and Regional Assistance, 2009
This paper examines the estimation of two-stage clustered RCT designs in education research using the Neyman causal inference framework that underlies experiments. The key distinction between the considered causal models is whether potential treatment and control group outcomes are considered to be fixed for the study population (the…
Descriptors: Control Groups, Causal Models, Statistical Significance, Computation
Peer reviewed Peer reviewed
Direct linkDirect link
Yuan, Ke-Hai; Bentler, Peter M. – Educational and Psychological Measurement, 2004
In mean and covariance structure analysis, the chi-square difference test is often applied to evaluate the number of factors, cross-group constraints, and other nested model comparisons. Let model M[a] be the base model within which model M[b] is nested. In practice, this test is commonly used to justify M[b] even when M[a] is misspecified. The…
Descriptors: Statistical Significance, Item Response Theory, Computation, Statistical Analysis