ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	8

Descriptor

Effect Size	11
Statistical Significance	11
Testing	11
Sample Size	4
Comparative Analysis	3
Hypothesis Testing	3
Research Methodology	3
Sampling	3
Correlation	2
Elementary Secondary Education	2
Evaluation	2
Intervals	2
Item Response Theory	2
Outcomes of Education	2
Prediction	2
Predictor Variables	2
Reading Achievement	2
Regression (Statistics)	2
Research Problems	2
Standardized Tests	2
Statistical Analysis	2
Test Items	2
Academic Achievement	1
Academic Failure	1
Achievement Gap	1
More ▼

Source

Advances in Health Sciences…	1
American Journal of Evaluation	1
Educational and Psychological…	1
Elementary School Journal	1
Human Communication Research	1
International Journal of…	1
Journal of Educational…	1
Learning Disabilities: A…	1
ProQuest LLC	1

Publication Type

Journal Articles	8
Reports - Research	4
Reports - Descriptive	2
Reports - Evaluative	2
Dissertations/Theses -…	1
ERIC Digests in Full Text	1
ERIC Publications	1
Information Analyses	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	2
Elementary Education	1
Grade 2	1
Grade 3	1
Grade 4	1

Audience

Location

Indiana	1
Pennsylvania	1

Laws, Policies, & Programs

Assessments and Surveys

Stanford Achievement Tests

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Detecting Item Drift in Large-Scale Testing

Peer reviewed

Direct link

Guo, Hongwen; Robin, Frederic; Dorans, Neil – Journal of Educational Measurement, 2017

The early detection of item drift is an important issue for frequently administered testing programs because items are reused over time. Unfortunately, operational data tend to be very sparse and do not lend themselves to frequent monitoring analyses, particularly for on-demand testing. Building on existing residual analyses, the authors propose…

Descriptors: Testing, Test Items, Identification, Sample Size

What to Do Instead of Significance Testing? Calculating the 'Number of Counterfactual Cases Needed to Disturb a Finding'

Peer reviewed

Direct link

Gorard, Stephen; Gorard, Jonathan – International Journal of Social Research Methodology, 2016

This brief paper introduces a new approach to assessing the trustworthiness of research comparisons when expressed numerically. The 'number needed to disturb' a research finding would be the number of counterfactual values that can be added to the smallest arm of any comparison before the difference or 'effect' size disappears, minus the number of…

Descriptors: Statistical Significance, Testing, Sampling, Attrition (Research Studies)

The Testing Effect on Skills Learning Might Last 6 Months

Peer reviewed

Direct link

Kromann, C. B.; Bohnstedt, C.; Jensen, M. L.; Ringsted, C. – Advances in Health Sciences Education, 2010

In a recent study we found that testing as a final activity in a skills course increases the learning outcome compared to spending an equal amount of time practicing. Whether this testing effect measured as skills performance can be demonstrated on long-term basis is not known. The research question was: does testing as a final activity in a…

Descriptors: First Aid, Control Groups, Medical Students, Intervention

Improvements for Differential Functioning of Items and Tests (DFIT): Investigating the Addition of Reporting an Effect Size Measure and Power

Direct link

Wright, Keith D. – ProQuest LLC, 2011

Standardized testing has been part of the American educational system for decades. Controversy from the beginning has plagued standardized testing, is plaguing testing today, and will continue to be controversial. Given the current federal educational policies supporting increased standardized testing, psychometricians, educators and policy makers…

Descriptors: Test Bias, Test Items, Simulation, Testing

Testing Conceptual Frameworks of Nonexperimental Program Evaluation Designs Using Structural Equation Modeling

Peer reviewed

Direct link

Adedokun, Omolola A.; Childress, Amy L.; Burgess, Wilella D. – American Journal of Evaluation, 2011

A theory-driven approach to evaluation (TDE) emphasizes the development and empirical testing of conceptual models to understand the processes and mechanisms through which programs achieve their intended goals. However, most reported applications of TDE are limited to large-scale experimental/quasi-experimental program evaluation designs. Very few…

Descriptors: Feedback (Response), Program Evaluation, Structural Equation Models, Testing

A Communication Researchers' Guide to Null Hypothesis Significance Testing and Alternatives

Peer reviewed

Direct link

Levine, Timothy R.; Weber, Rene; Park, Hee Sun; Hullett, Craig R. – Human Communication Research, 2008

This paper offers a practical guide to use null hypotheses significance testing and its alternatives. The focus is on improving the quality of statistical inference in quantitative communication research. More consistent reporting of descriptive statistics, estimates of effect size, confidence intervals around effect sizes, and increasing the…

Descriptors: Intervals, Communication Research, Testing, Statistical Significance

Reframing Retention: New Evidence from within the Elementary School Classroom on Post-Retention Performance

Peer reviewed

Direct link

Gottfried, Michael A. – Elementary School Journal, 2012

This study contributes a novel perspective on grade retention by empirically examining how classroom composition relates to the standardized-testing performance of grade-retained students in their post-retained years. This evaluation employed a sample of entire cohorts of urban elementary school children in the Philadelphia School District over 6…

Descriptors: Grade Repetition, School Holding Power, Evidence, Testing

Comparability of Computer-Based and Paper-and-Pencil Testing in K-12 Reading Assessments: A Meta-Analysis of Testing Mode Effects

Peer reviewed

Direct link

Wang, Shudong; Jiao, Hong; Young, Michael J.; Brooks, Thomas; Olson, John – Educational and Psychological Measurement, 2008

In recent years, computer-based testing (CBT) has grown in popularity, is increasingly being implemented across the United States, and will likely become the primary mode for delivering tests in the future. Although CBT offers many advantages over traditional paper-and-pencil testing, assessment experts, researchers, practitioners, and users have…

Descriptors: Elementary Secondary Education, Reading Achievement, Computer Assisted Testing, Comparative Analysis

Effect Size as an Alternative to Statistical Significance Testing.

Download full text

McClain, Andrew L. – 1995

The present paper discusses criticisms of statistical significance testing from both historical and contemporary perspectives. Statistical significance testing is greatly influenced by sample size and often results in meaningless information being over-reported. Variance-accounted-for-effect sizes are presented as an alternative to statistical…

Descriptors: Correlation, Effect Size, Research Methodology, Sample Size

Do Effect-Size Measures Measure up?: A Brief Assessment

Peer reviewed
PDF on ERIC

Download full text

Onwuegbuzie, Anthony J.; Levin, Joel R.; Leech, Nancy L. – Learning Disabilities: A Contemporary Journal, 2003

Because of criticisms leveled at statistical hypothesis testing, some researchers have argued that measures of effect size should replace the significance-testing practice. We contend that although effect-size measures have logical appeal, they are also associated with a number of limitations that may result in problematic interpretations of them…

Descriptors: Intervals, Psychological Studies, Learning Disabilities, Testing

The Concept of Statistical Significance Testing. ERIC/AE Digest.

Download full text

Thompson, Bruce – 1994

Too few researchers understand what statistical significance testing does and does not do, and consequently their results are misinterpreted. This Digest explains the concept of statistical significance testing and discusses the meaning of probabilities, the concept of statistical significance, arguments against significance testing,…

Descriptors: Data Analysis, Data Interpretation, Decision Making, Effect Size

Adedokun, Omolola A.	1
Bohnstedt, C.	1
Brooks, Thomas	1
Burgess, Wilella D.	1
Childress, Amy L.	1
Dorans, Neil	1
Gorard, Jonathan	1
Gorard, Stephen	1
Gottfried, Michael A.	1
Guo, Hongwen	1
Hullett, Craig R.	1
Jensen, M. L.	1
Jiao, Hong	1
Kromann, C. B.	1
Leech, Nancy L.	1
Levin, Joel R.	1
Levine, Timothy R.	1
McClain, Andrew L.	1
Olson, John	1
Onwuegbuzie, Anthony J.	1
Park, Hee Sun	1
Ringsted, C.	1
Robin, Frederic	1
Thompson, Bruce	1
Wang, Shudong	1
More ▼