ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	0
Since 2007 (last 20 years)	5

Descriptor

Comparative Analysis	6
Statistical Significance	6
Testing	6
Effect Size	3
Academic Achievement	2
Correlation	2
Evaluation	2
Predictor Variables	2
Reading Achievement	2
Regression (Statistics)	2
Researchers	2
Sample Size	2
Scores	2
Structural Equation Models	2
Academic Failure	1
Achievement Gap	1
Communication Skills	1
Computation	1
Computer Assisted Testing	1
Computer Simulation	1
Educational Experience	1
Elementary School Students	1
Elementary Secondary Education	1
Entrepreneurship	1
Error Correction	1
More ▼

Source

Educational and Psychological…	2
American Journal of Evaluation	1
Elementary School Journal	1
Practical Assessment,…	1
Psychological Methods	1

Author

Adedokun, Omolola A.	1
Brooks, Thomas	1
Burgess, Wilella D.	1
Childress, Amy L.	1
Gottfried, Michael A.	1
Holland, Burt	1
Jiao, Hong	1
Keselman, H. J.	1
Lovato, Chris Y.	1
Miller, Charles W.	1
Olson, John	1
Rusticus, Shayna A.	1
Wang, Shudong	1
Young, Michael J.	1
Zimmerman, Donald W.	1
Zumbo, Bruno D.	1
More ▼

Publication Type

Journal Articles	6
Reports - Evaluative	3
Reports - Research	3

Education Level

Elementary Education	1
Elementary Secondary Education	1
Grade 2	1
Grade 3	1
Grade 4	1

Audience

Location

Canada	1
Indiana	1
Pennsylvania	1

Laws, Policies, & Programs

Assessments and Surveys

Stanford Achievement Tests

What Works Clearinghouse Rating

Showing all 6 results Save | Export

Many Tests of Significance: New Methods for Controlling Type I Errors

Peer reviewed

Direct link

Keselman, H. J.; Miller, Charles W.; Holland, Burt – Psychological Methods, 2011

There have been many discussions of how Type I errors should be controlled when many hypotheses are tested (e.g., all possible comparisons of means, correlations, proportions, the coefficients in hierarchical models, etc.). By and large, researchers have adopted familywise (FWER) control, though this practice certainly is not universal. Familywise…

Descriptors: Validity, Statistical Significance, Probability, Computation

Applying Tests of Equivalence for Multiple Group Comparisons: Demonstration of the Confidence Interval Approach

Peer reviewed

Direct link

Rusticus, Shayna A.; Lovato, Chris Y. – Practical Assessment, Research & Evaluation, 2011

Assessing the comparability of different groups is an issue facing many researchers and evaluators in a variety of settings. Commonly, null hypothesis significance testing (NHST) is incorrectly used to demonstrate comparability when a non-significant result is found. This is problematic because a failure to find a difference between groups is not…

Descriptors: Medical Education, Evaluators, Intervals, Testing

Testing Conceptual Frameworks of Nonexperimental Program Evaluation Designs Using Structural Equation Modeling

Peer reviewed

Direct link

Adedokun, Omolola A.; Childress, Amy L.; Burgess, Wilella D. – American Journal of Evaluation, 2011

A theory-driven approach to evaluation (TDE) emphasizes the development and empirical testing of conceptual models to understand the processes and mechanisms through which programs achieve their intended goals. However, most reported applications of TDE are limited to large-scale experimental/quasi-experimental program evaluation designs. Very few…

Descriptors: Feedback (Response), Program Evaluation, Structural Equation Models, Testing

Reframing Retention: New Evidence from within the Elementary School Classroom on Post-Retention Performance

Peer reviewed

Direct link

Gottfried, Michael A. – Elementary School Journal, 2012

This study contributes a novel perspective on grade retention by empirically examining how classroom composition relates to the standardized-testing performance of grade-retained students in their post-retained years. This evaluation employed a sample of entire cohorts of urban elementary school children in the Philadelphia School District over 6…

Descriptors: Grade Repetition, School Holding Power, Evidence, Testing

Comparability of Computer-Based and Paper-and-Pencil Testing in K-12 Reading Assessments: A Meta-Analysis of Testing Mode Effects

Peer reviewed

Direct link

Wang, Shudong; Jiao, Hong; Young, Michael J.; Brooks, Thomas; Olson, John – Educational and Psychological Measurement, 2008

In recent years, computer-based testing (CBT) has grown in popularity, is increasingly being implemented across the United States, and will likely become the primary mode for delivering tests in the future. Although CBT offers many advantages over traditional paper-and-pencil testing, assessment experts, researchers, practitioners, and users have…

Descriptors: Elementary Secondary Education, Reading Achievement, Computer Assisted Testing, Comparative Analysis

Significance Testing of Correlation Using Scores, Ranks, and Modified Ranks.

Peer reviewed

Zimmerman, Donald W.; Zumbo, Bruno D. – Educational and Psychological Measurement, 1993

A computer simulation compared significance tests of correlation coefficients calculated from initial scores, from ranks assigned by the Spearman method, and from three kinds of modified ranks. Implications of findings for the idea that rank correlation is a nonparametric correlation method are discussed. (SLD)

Descriptors: Comparative Analysis, Computer Simulation, Correlation, Nonparametric Statistics