ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	6

Descriptor

Hypothesis Testing	12
Statistical Significance	12
Testing	12
Statistical Analysis	8
Data Analysis	3
Effect Size	3
Research Methodology	3
Sampling	3
Academic Achievement	2
Correlation	2
Decision Making	2
Evaluation Methods	2
Foreign Countries	2
Intervals	2
Matrices	2
Probability	2
Researchers	2
Test Results	2
Adults	1
Bayesian Statistics	1
Bulletins	1
Cheating	1
Children	1
Classification	1
Community Influence	1
More ▼

Source

Cognitive Research:…	1
Educational and Psychological…	1
Journal of International…	1
Learning Disabilities: A…	1
Multivariate Behavioral…	1
National Center for Education…	1
Practical Assessment,…	1
ProQuest LLC	1

Publication Type

Journal Articles	5
Reports - Research	4
Reports - Evaluative	3
Dissertations/Theses -…	1
ERIC Digests in Full Text	1
ERIC Publications	1
Reports - Descriptive	1

Education Level

Elementary Secondary Education	1
Higher Education	1
Postsecondary Education	1

Audience

Location

Canada	1
Nigeria	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 12 results Save | Export

The Psychological Reality of the Learned "P < .05" Boundary

Peer reviewed

Direct link

V. N. Vimal Rao; Jeffrey K. Bye; Sashank Varma – Cognitive Research: Principles and Implications, 2024

The 0.05 boundary within Null Hypothesis Statistical Testing (NHST) "has made a lot of people very angry and been widely regarded as a bad move" (to quote Douglas Adams). Here, we move past meta-scientific arguments and ask an empirical question: What is the psychological standing of the 0.05 boundary for statistical significance? We…

Descriptors: Psychological Patterns, Statistical Analysis, Testing, Statistical Significance

Factorial Invariance in Multiple Populations: A Multiple Testing Procedure

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A.; Millsap, Roger E. – Educational and Psychological Measurement, 2013

A multiple testing method for examining factorial invariance for latent constructs evaluated by multiple indicators in distinct populations is outlined. The procedure is based on the false discovery rate concept and multiple individual restriction tests and resolves general limitations of a popular factorial invariance testing approach. The…

Descriptors: Testing, Statistical Analysis, Factor Analysis, Statistical Significance

Improvements for Differential Functioning of Items and Tests (DFIT): Investigating the Addition of Reporting an Effect Size Measure and Power

Direct link

Wright, Keith D. – ProQuest LLC, 2011

Standardized testing has been part of the American educational system for decades. Controversy from the beginning has plagued standardized testing, is plaguing testing today, and will continue to be controversial. Given the current federal educational policies supporting increased standardized testing, psychometricians, educators and policy makers…

Descriptors: Test Bias, Test Items, Simulation, Testing

Applying Tests of Equivalence for Multiple Group Comparisons: Demonstration of the Confidence Interval Approach

Peer reviewed

Direct link

Rusticus, Shayna A.; Lovato, Chris Y. – Practical Assessment, Research & Evaluation, 2011

Assessing the comparability of different groups is an issue facing many researchers and evaluators in a variety of settings. Commonly, null hypothesis significance testing (NHST) is incorrectly used to demonstrate comparability when a non-significant result is found. This is problematic because a failure to find a difference between groups is not…

Descriptors: Medical Education, Evaluators, Intervals, Testing

Influence of Host Community on Industrial Relations Practices and Policies: A Survey of Agbara Community and Power Holding Company of Nigeria (PHCN)

Peer reviewed

Direct link

Chidi, Christopher O.; Shadare, Oluseyi A. – Journal of International Education Research, 2011

This study investigated the influence of host community on industrial relations practices and policies using Agbara community and Power Holding Company of Nigeria PLC as a case. The study adopted both the qualitative and quantitative methods. A total of 120 samples were drawn from the population using the simple random sampling technique in which…

Descriptors: Testing, Social Sciences, Foreign Countries, Sampling

Technical Methods Report: Guidelines for Multiple Testing in Impact Evaluations. NCEE 2008-4018

Peer reviewed
PDF on ERIC

Download full text

Schochet, Peter Z. – National Center for Education Evaluation and Regional Assistance, 2008

This report presents guidelines for addressing the multiple comparisons problem in impact evaluations in the education area. The problem occurs due to the large number of hypothesis tests that are typically conducted across outcomes and subgroups in these studies, which can lead to spurious statistically significant impact findings. The…

Descriptors: Guidelines, Testing, Hypothesis Testing, Statistical Significance

A Statistical Test for Cheating.

Download full text

Lord, Frederic M. – 1974

A statistical test for cheating is developed. The case of a single examinee who has taken parallel forms of the same selection test on three occasions, obtaining scores x, y, z, is used to illustrate the development. It is assumed that each score is normally distributed with the same known variance, that is, the variance of the errors of…

Descriptors: Cheating, Hypothesis Testing, Statistical Analysis, Statistical Significance

The Sampling Distribution and a Test for the Significance of the Bimultivariate Redundancy Statistic: A Monte Carlo Study

Peer reviewed

Miller, John K. – Multivariate Behavioral Research, 1975

Descriptors: Correlation, Goodness of Fit, Hypothesis Testing, Matrices

Automated Hypothesis Tests and Standard Errors for Nonstandard Problems with Description of Computer Package: A Draft.

Download full text

Lord, Frederic M.; Stocking, Martha – 1972

A general Computer program is described that will compute asymptotic standard errors and carry out significance tests for an endless variety of (standard and) nonstandard large-sample statistical problems, without requiring the statistician to derive asymptotic standard error formulas. The program assumes that the observations have a multinormal…

Descriptors: Bulletins, Computer Programs, Data Processing, Error of Measurement

Do Effect-Size Measures Measure up?: A Brief Assessment

Peer reviewed
PDF on ERIC

Download full text

Onwuegbuzie, Anthony J.; Levin, Joel R.; Leech, Nancy L. – Learning Disabilities: A Contemporary Journal, 2003

Because of criticisms leveled at statistical hypothesis testing, some researchers have argued that measures of effect size should replace the significance-testing practice. We contend that although effect-size measures have logical appeal, they are also associated with a number of limitations that may result in problematic interpretations of them…

Descriptors: Intervals, Psychological Studies, Learning Disabilities, Testing

The Concept of Statistical Significance Testing. ERIC/AE Digest.

Download full text

Thompson, Bruce – 1994

Too few researchers understand what statistical significance testing does and does not do, and consequently their results are misinterpreted. This Digest explains the concept of statistical significance testing and discusses the meaning of probabilities, the concept of statistical significance, arguments against significance testing,…

Descriptors: Data Analysis, Data Interpretation, Decision Making, Effect Size

Some Nonparametric Approaches to the Use of Criterion-Referenced Statewide Test Results in the Evaluation of Local District Educational Programs.

Download full text

Ascher, Gordon – 1975

The increased use of criterion-referenced statewide testing programs is an outgrowth of the need for more diagnostic information for planning and decision making than is provided by norm-referenced programs. There remains, however, a need for state agencies to compare the results of local districts to a variety of comparison groups for the purpose…

Descriptors: Academic Achievement, Comparative Testing, Correlation, Criterion Referenced Tests

Lord, Frederic M.	2
Ascher, Gordon	1
Chidi, Christopher O.	1
Jeffrey K. Bye	1
Leech, Nancy L.	1
Levin, Joel R.	1
Lovato, Chris Y.	1
Marcoulides, George A.	1
Miller, John K.	1
Millsap, Roger E.	1
Onwuegbuzie, Anthony J.	1
Raykov, Tenko	1
Rusticus, Shayna A.	1
Sashank Varma	1
Schochet, Peter Z.	1
Shadare, Oluseyi A.	1
Stocking, Martha	1
Thompson, Bruce	1
V. N. Vimal Rao	1
Wright, Keith D.	1
More ▼