NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 6 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Keselman, H. J.; Miller, Charles W.; Holland, Burt – Psychological Methods, 2011
There have been many discussions of how Type I errors should be controlled when many hypotheses are tested (e.g., all possible comparisons of means, correlations, proportions, the coefficients in hierarchical models, etc.). By and large, researchers have adopted familywise (FWER) control, though this practice certainly is not universal. Familywise…
Descriptors: Validity, Statistical Significance, Probability, Computation
Peer reviewed Peer reviewed
Direct linkDirect link
Rusticus, Shayna A.; Lovato, Chris Y. – Practical Assessment, Research & Evaluation, 2011
Assessing the comparability of different groups is an issue facing many researchers and evaluators in a variety of settings. Commonly, null hypothesis significance testing (NHST) is incorrectly used to demonstrate comparability when a non-significant result is found. This is problematic because a failure to find a difference between groups is not…
Descriptors: Medical Education, Evaluators, Intervals, Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Adedokun, Omolola A.; Childress, Amy L.; Burgess, Wilella D. – American Journal of Evaluation, 2011
A theory-driven approach to evaluation (TDE) emphasizes the development and empirical testing of conceptual models to understand the processes and mechanisms through which programs achieve their intended goals. However, most reported applications of TDE are limited to large-scale experimental/quasi-experimental program evaluation designs. Very few…
Descriptors: Feedback (Response), Program Evaluation, Structural Equation Models, Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Gottfried, Michael A. – Elementary School Journal, 2012
This study contributes a novel perspective on grade retention by empirically examining how classroom composition relates to the standardized-testing performance of grade-retained students in their post-retained years. This evaluation employed a sample of entire cohorts of urban elementary school children in the Philadelphia School District over 6…
Descriptors: Grade Repetition, School Holding Power, Evidence, Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Shudong; Jiao, Hong; Young, Michael J.; Brooks, Thomas; Olson, John – Educational and Psychological Measurement, 2008
In recent years, computer-based testing (CBT) has grown in popularity, is increasingly being implemented across the United States, and will likely become the primary mode for delivering tests in the future. Although CBT offers many advantages over traditional paper-and-pencil testing, assessment experts, researchers, practitioners, and users have…
Descriptors: Elementary Secondary Education, Reading Achievement, Computer Assisted Testing, Comparative Analysis
Peer reviewed Peer reviewed
Zimmerman, Donald W.; Zumbo, Bruno D. – Educational and Psychological Measurement, 1993
A computer simulation compared significance tests of correlation coefficients calculated from initial scores, from ranks assigned by the Spearman method, and from three kinds of modified ranks. Implications of findings for the idea that rank correlation is a nonparametric correlation method are discussed. (SLD)
Descriptors: Comparative Analysis, Computer Simulation, Correlation, Nonparametric Statistics