NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20260
Since 20250
Since 2022 (last 5 years)0
Since 2017 (last 10 years)1
Since 2007 (last 20 years)8
Audience
Laws, Policies, & Programs
Assessments and Surveys
Stanford Achievement Tests1
What Works Clearinghouse Rating
Showing all 11 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Guo, Hongwen; Robin, Frederic; Dorans, Neil – Journal of Educational Measurement, 2017
The early detection of item drift is an important issue for frequently administered testing programs because items are reused over time. Unfortunately, operational data tend to be very sparse and do not lend themselves to frequent monitoring analyses, particularly for on-demand testing. Building on existing residual analyses, the authors propose…
Descriptors: Testing, Test Items, Identification, Sample Size
Peer reviewed Peer reviewed
Direct linkDirect link
Gorard, Stephen; Gorard, Jonathan – International Journal of Social Research Methodology, 2016
This brief paper introduces a new approach to assessing the trustworthiness of research comparisons when expressed numerically. The 'number needed to disturb' a research finding would be the number of counterfactual values that can be added to the smallest arm of any comparison before the difference or 'effect' size disappears, minus the number of…
Descriptors: Statistical Significance, Testing, Sampling, Attrition (Research Studies)
Peer reviewed Peer reviewed
Direct linkDirect link
Kromann, C. B.; Bohnstedt, C.; Jensen, M. L.; Ringsted, C. – Advances in Health Sciences Education, 2010
In a recent study we found that testing as a final activity in a skills course increases the learning outcome compared to spending an equal amount of time practicing. Whether this testing effect measured as skills performance can be demonstrated on long-term basis is not known. The research question was: does testing as a final activity in a…
Descriptors: First Aid, Control Groups, Medical Students, Intervention
Wright, Keith D. – ProQuest LLC, 2011
Standardized testing has been part of the American educational system for decades. Controversy from the beginning has plagued standardized testing, is plaguing testing today, and will continue to be controversial. Given the current federal educational policies supporting increased standardized testing, psychometricians, educators and policy makers…
Descriptors: Test Bias, Test Items, Simulation, Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Adedokun, Omolola A.; Childress, Amy L.; Burgess, Wilella D. – American Journal of Evaluation, 2011
A theory-driven approach to evaluation (TDE) emphasizes the development and empirical testing of conceptual models to understand the processes and mechanisms through which programs achieve their intended goals. However, most reported applications of TDE are limited to large-scale experimental/quasi-experimental program evaluation designs. Very few…
Descriptors: Feedback (Response), Program Evaluation, Structural Equation Models, Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Levine, Timothy R.; Weber, Rene; Park, Hee Sun; Hullett, Craig R. – Human Communication Research, 2008
This paper offers a practical guide to use null hypotheses significance testing and its alternatives. The focus is on improving the quality of statistical inference in quantitative communication research. More consistent reporting of descriptive statistics, estimates of effect size, confidence intervals around effect sizes, and increasing the…
Descriptors: Intervals, Communication Research, Testing, Statistical Significance
Peer reviewed Peer reviewed
Direct linkDirect link
Gottfried, Michael A. – Elementary School Journal, 2012
This study contributes a novel perspective on grade retention by empirically examining how classroom composition relates to the standardized-testing performance of grade-retained students in their post-retained years. This evaluation employed a sample of entire cohorts of urban elementary school children in the Philadelphia School District over 6…
Descriptors: Grade Repetition, School Holding Power, Evidence, Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Shudong; Jiao, Hong; Young, Michael J.; Brooks, Thomas; Olson, John – Educational and Psychological Measurement, 2008
In recent years, computer-based testing (CBT) has grown in popularity, is increasingly being implemented across the United States, and will likely become the primary mode for delivering tests in the future. Although CBT offers many advantages over traditional paper-and-pencil testing, assessment experts, researchers, practitioners, and users have…
Descriptors: Elementary Secondary Education, Reading Achievement, Computer Assisted Testing, Comparative Analysis
McClain, Andrew L. – 1995
The present paper discusses criticisms of statistical significance testing from both historical and contemporary perspectives. Statistical significance testing is greatly influenced by sample size and often results in meaningless information being over-reported. Variance-accounted-for-effect sizes are presented as an alternative to statistical…
Descriptors: Correlation, Effect Size, Research Methodology, Sample Size
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Onwuegbuzie, Anthony J.; Levin, Joel R.; Leech, Nancy L. – Learning Disabilities: A Contemporary Journal, 2003
Because of criticisms leveled at statistical hypothesis testing, some researchers have argued that measures of effect size should replace the significance-testing practice. We contend that although effect-size measures have logical appeal, they are also associated with a number of limitations that may result in problematic interpretations of them…
Descriptors: Intervals, Psychological Studies, Learning Disabilities, Testing
Thompson, Bruce – 1994
Too few researchers understand what statistical significance testing does and does not do, and consequently their results are misinterpreted. This Digest explains the concept of statistical significance testing and discusses the meaning of probabilities, the concept of statistical significance, arguments against significance testing,…
Descriptors: Data Analysis, Data Interpretation, Decision Making, Effect Size