Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 8 |
Descriptor
| Effect Size | 11 |
| Statistical Significance | 11 |
| Testing | 11 |
| Sample Size | 4 |
| Comparative Analysis | 3 |
| Hypothesis Testing | 3 |
| Research Methodology | 3 |
| Sampling | 3 |
| Correlation | 2 |
| Elementary Secondary Education | 2 |
| Evaluation | 2 |
| More ▼ | |
Source
Author
| Adedokun, Omolola A. | 1 |
| Bohnstedt, C. | 1 |
| Brooks, Thomas | 1 |
| Burgess, Wilella D. | 1 |
| Childress, Amy L. | 1 |
| Dorans, Neil | 1 |
| Gorard, Jonathan | 1 |
| Gorard, Stephen | 1 |
| Gottfried, Michael A. | 1 |
| Guo, Hongwen | 1 |
| Hullett, Craig R. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 8 |
| Reports - Research | 4 |
| Reports - Descriptive | 2 |
| Reports - Evaluative | 2 |
| Dissertations/Theses -… | 1 |
| ERIC Digests in Full Text | 1 |
| ERIC Publications | 1 |
| Information Analyses | 1 |
| Speeches/Meeting Papers | 1 |
Education Level
| Elementary Secondary Education | 2 |
| Elementary Education | 1 |
| Grade 2 | 1 |
| Grade 3 | 1 |
| Grade 4 | 1 |
Audience
Location
| Indiana | 1 |
| Pennsylvania | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| Stanford Achievement Tests | 1 |
What Works Clearinghouse Rating
Guo, Hongwen; Robin, Frederic; Dorans, Neil – Journal of Educational Measurement, 2017
The early detection of item drift is an important issue for frequently administered testing programs because items are reused over time. Unfortunately, operational data tend to be very sparse and do not lend themselves to frequent monitoring analyses, particularly for on-demand testing. Building on existing residual analyses, the authors propose…
Descriptors: Testing, Test Items, Identification, Sample Size
Gorard, Stephen; Gorard, Jonathan – International Journal of Social Research Methodology, 2016
This brief paper introduces a new approach to assessing the trustworthiness of research comparisons when expressed numerically. The 'number needed to disturb' a research finding would be the number of counterfactual values that can be added to the smallest arm of any comparison before the difference or 'effect' size disappears, minus the number of…
Descriptors: Statistical Significance, Testing, Sampling, Attrition (Research Studies)
Kromann, C. B.; Bohnstedt, C.; Jensen, M. L.; Ringsted, C. – Advances in Health Sciences Education, 2010
In a recent study we found that testing as a final activity in a skills course increases the learning outcome compared to spending an equal amount of time practicing. Whether this testing effect measured as skills performance can be demonstrated on long-term basis is not known. The research question was: does testing as a final activity in a…
Descriptors: First Aid, Control Groups, Medical Students, Intervention
Wright, Keith D. – ProQuest LLC, 2011
Standardized testing has been part of the American educational system for decades. Controversy from the beginning has plagued standardized testing, is plaguing testing today, and will continue to be controversial. Given the current federal educational policies supporting increased standardized testing, psychometricians, educators and policy makers…
Descriptors: Test Bias, Test Items, Simulation, Testing
Adedokun, Omolola A.; Childress, Amy L.; Burgess, Wilella D. – American Journal of Evaluation, 2011
A theory-driven approach to evaluation (TDE) emphasizes the development and empirical testing of conceptual models to understand the processes and mechanisms through which programs achieve their intended goals. However, most reported applications of TDE are limited to large-scale experimental/quasi-experimental program evaluation designs. Very few…
Descriptors: Feedback (Response), Program Evaluation, Structural Equation Models, Testing
Levine, Timothy R.; Weber, Rene; Park, Hee Sun; Hullett, Craig R. – Human Communication Research, 2008
This paper offers a practical guide to use null hypotheses significance testing and its alternatives. The focus is on improving the quality of statistical inference in quantitative communication research. More consistent reporting of descriptive statistics, estimates of effect size, confidence intervals around effect sizes, and increasing the…
Descriptors: Intervals, Communication Research, Testing, Statistical Significance
Gottfried, Michael A. – Elementary School Journal, 2012
This study contributes a novel perspective on grade retention by empirically examining how classroom composition relates to the standardized-testing performance of grade-retained students in their post-retained years. This evaluation employed a sample of entire cohorts of urban elementary school children in the Philadelphia School District over 6…
Descriptors: Grade Repetition, School Holding Power, Evidence, Testing
Wang, Shudong; Jiao, Hong; Young, Michael J.; Brooks, Thomas; Olson, John – Educational and Psychological Measurement, 2008
In recent years, computer-based testing (CBT) has grown in popularity, is increasingly being implemented across the United States, and will likely become the primary mode for delivering tests in the future. Although CBT offers many advantages over traditional paper-and-pencil testing, assessment experts, researchers, practitioners, and users have…
Descriptors: Elementary Secondary Education, Reading Achievement, Computer Assisted Testing, Comparative Analysis
McClain, Andrew L. – 1995
The present paper discusses criticisms of statistical significance testing from both historical and contemporary perspectives. Statistical significance testing is greatly influenced by sample size and often results in meaningless information being over-reported. Variance-accounted-for-effect sizes are presented as an alternative to statistical…
Descriptors: Correlation, Effect Size, Research Methodology, Sample Size
Onwuegbuzie, Anthony J.; Levin, Joel R.; Leech, Nancy L. – Learning Disabilities: A Contemporary Journal, 2003
Because of criticisms leveled at statistical hypothesis testing, some researchers have argued that measures of effect size should replace the significance-testing practice. We contend that although effect-size measures have logical appeal, they are also associated with a number of limitations that may result in problematic interpretations of them…
Descriptors: Intervals, Psychological Studies, Learning Disabilities, Testing
Thompson, Bruce – 1994
Too few researchers understand what statistical significance testing does and does not do, and consequently their results are misinterpreted. This Digest explains the concept of statistical significance testing and discusses the meaning of probabilities, the concept of statistical significance, arguments against significance testing,…
Descriptors: Data Analysis, Data Interpretation, Decision Making, Effect Size

Peer reviewed
Direct link
