Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 6 |
Descriptor
| Hypothesis Testing | 12 |
| Statistical Significance | 12 |
| Testing | 12 |
| Statistical Analysis | 8 |
| Data Analysis | 3 |
| Effect Size | 3 |
| Research Methodology | 3 |
| Sampling | 3 |
| Academic Achievement | 2 |
| Correlation | 2 |
| Decision Making | 2 |
| More ▼ | |
Source
| Cognitive Research:… | 1 |
| Educational and Psychological… | 1 |
| Journal of International… | 1 |
| Learning Disabilities: A… | 1 |
| Multivariate Behavioral… | 1 |
| National Center for Education… | 1 |
| Practical Assessment,… | 1 |
| ProQuest LLC | 1 |
Author
Publication Type
| Journal Articles | 5 |
| Reports - Research | 4 |
| Reports - Evaluative | 3 |
| Dissertations/Theses -… | 1 |
| ERIC Digests in Full Text | 1 |
| ERIC Publications | 1 |
| Reports - Descriptive | 1 |
Education Level
| Elementary Secondary Education | 1 |
| Higher Education | 1 |
| Postsecondary Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
V. N. Vimal Rao; Jeffrey K. Bye; Sashank Varma – Cognitive Research: Principles and Implications, 2024
The 0.05 boundary within Null Hypothesis Statistical Testing (NHST) "has made a lot of people very angry and been widely regarded as a bad move" (to quote Douglas Adams). Here, we move past meta-scientific arguments and ask an empirical question: What is the psychological standing of the 0.05 boundary for statistical significance? We…
Descriptors: Psychological Patterns, Statistical Analysis, Testing, Statistical Significance
Raykov, Tenko; Marcoulides, George A.; Millsap, Roger E. – Educational and Psychological Measurement, 2013
A multiple testing method for examining factorial invariance for latent constructs evaluated by multiple indicators in distinct populations is outlined. The procedure is based on the false discovery rate concept and multiple individual restriction tests and resolves general limitations of a popular factorial invariance testing approach. The…
Descriptors: Testing, Statistical Analysis, Factor Analysis, Statistical Significance
Wright, Keith D. – ProQuest LLC, 2011
Standardized testing has been part of the American educational system for decades. Controversy from the beginning has plagued standardized testing, is plaguing testing today, and will continue to be controversial. Given the current federal educational policies supporting increased standardized testing, psychometricians, educators and policy makers…
Descriptors: Test Bias, Test Items, Simulation, Testing
Rusticus, Shayna A.; Lovato, Chris Y. – Practical Assessment, Research & Evaluation, 2011
Assessing the comparability of different groups is an issue facing many researchers and evaluators in a variety of settings. Commonly, null hypothesis significance testing (NHST) is incorrectly used to demonstrate comparability when a non-significant result is found. This is problematic because a failure to find a difference between groups is not…
Descriptors: Medical Education, Evaluators, Intervals, Testing
Chidi, Christopher O.; Shadare, Oluseyi A. – Journal of International Education Research, 2011
This study investigated the influence of host community on industrial relations practices and policies using Agbara community and Power Holding Company of Nigeria PLC as a case. The study adopted both the qualitative and quantitative methods. A total of 120 samples were drawn from the population using the simple random sampling technique in which…
Descriptors: Testing, Social Sciences, Foreign Countries, Sampling
Schochet, Peter Z. – National Center for Education Evaluation and Regional Assistance, 2008
This report presents guidelines for addressing the multiple comparisons problem in impact evaluations in the education area. The problem occurs due to the large number of hypothesis tests that are typically conducted across outcomes and subgroups in these studies, which can lead to spurious statistically significant impact findings. The…
Descriptors: Guidelines, Testing, Hypothesis Testing, Statistical Significance
Lord, Frederic M. – 1974
A statistical test for cheating is developed. The case of a single examinee who has taken parallel forms of the same selection test on three occasions, obtaining scores x, y, z, is used to illustrate the development. It is assumed that each score is normally distributed with the same known variance, that is, the variance of the errors of…
Descriptors: Cheating, Hypothesis Testing, Statistical Analysis, Statistical Significance
Peer reviewedMiller, John K. – Multivariate Behavioral Research, 1975
Descriptors: Correlation, Goodness of Fit, Hypothesis Testing, Matrices
Lord, Frederic M.; Stocking, Martha – 1972
A general Computer program is described that will compute asymptotic standard errors and carry out significance tests for an endless variety of (standard and) nonstandard large-sample statistical problems, without requiring the statistician to derive asymptotic standard error formulas. The program assumes that the observations have a multinormal…
Descriptors: Bulletins, Computer Programs, Data Processing, Error of Measurement
Onwuegbuzie, Anthony J.; Levin, Joel R.; Leech, Nancy L. – Learning Disabilities: A Contemporary Journal, 2003
Because of criticisms leveled at statistical hypothesis testing, some researchers have argued that measures of effect size should replace the significance-testing practice. We contend that although effect-size measures have logical appeal, they are also associated with a number of limitations that may result in problematic interpretations of them…
Descriptors: Intervals, Psychological Studies, Learning Disabilities, Testing
Thompson, Bruce – 1994
Too few researchers understand what statistical significance testing does and does not do, and consequently their results are misinterpreted. This Digest explains the concept of statistical significance testing and discusses the meaning of probabilities, the concept of statistical significance, arguments against significance testing,…
Descriptors: Data Analysis, Data Interpretation, Decision Making, Effect Size
Ascher, Gordon – 1975
The increased use of criterion-referenced statewide testing programs is an outgrowth of the need for more diagnostic information for planning and decision making than is provided by norm-referenced programs. There remains, however, a need for state agencies to compare the results of local districts to a variety of comparison groups for the purpose…
Descriptors: Academic Achievement, Comparative Testing, Correlation, Criterion Referenced Tests

Direct link
