NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 12 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
V. N. Vimal Rao; Jeffrey K. Bye; Sashank Varma – Cognitive Research: Principles and Implications, 2024
The 0.05 boundary within Null Hypothesis Statistical Testing (NHST) "has made a lot of people very angry and been widely regarded as a bad move" (to quote Douglas Adams). Here, we move past meta-scientific arguments and ask an empirical question: What is the psychological standing of the 0.05 boundary for statistical significance? We…
Descriptors: Psychological Patterns, Statistical Analysis, Testing, Statistical Significance
Peer reviewed Peer reviewed
Direct linkDirect link
Raykov, Tenko; Marcoulides, George A.; Millsap, Roger E. – Educational and Psychological Measurement, 2013
A multiple testing method for examining factorial invariance for latent constructs evaluated by multiple indicators in distinct populations is outlined. The procedure is based on the false discovery rate concept and multiple individual restriction tests and resolves general limitations of a popular factorial invariance testing approach. The…
Descriptors: Testing, Statistical Analysis, Factor Analysis, Statistical Significance
Wright, Keith D. – ProQuest LLC, 2011
Standardized testing has been part of the American educational system for decades. Controversy from the beginning has plagued standardized testing, is plaguing testing today, and will continue to be controversial. Given the current federal educational policies supporting increased standardized testing, psychometricians, educators and policy makers…
Descriptors: Test Bias, Test Items, Simulation, Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Rusticus, Shayna A.; Lovato, Chris Y. – Practical Assessment, Research & Evaluation, 2011
Assessing the comparability of different groups is an issue facing many researchers and evaluators in a variety of settings. Commonly, null hypothesis significance testing (NHST) is incorrectly used to demonstrate comparability when a non-significant result is found. This is problematic because a failure to find a difference between groups is not…
Descriptors: Medical Education, Evaluators, Intervals, Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Chidi, Christopher O.; Shadare, Oluseyi A. – Journal of International Education Research, 2011
This study investigated the influence of host community on industrial relations practices and policies using Agbara community and Power Holding Company of Nigeria PLC as a case. The study adopted both the qualitative and quantitative methods. A total of 120 samples were drawn from the population using the simple random sampling technique in which…
Descriptors: Testing, Social Sciences, Foreign Countries, Sampling
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Schochet, Peter Z. – National Center for Education Evaluation and Regional Assistance, 2008
This report presents guidelines for addressing the multiple comparisons problem in impact evaluations in the education area. The problem occurs due to the large number of hypothesis tests that are typically conducted across outcomes and subgroups in these studies, which can lead to spurious statistically significant impact findings. The…
Descriptors: Guidelines, Testing, Hypothesis Testing, Statistical Significance
Lord, Frederic M. – 1974
A statistical test for cheating is developed. The case of a single examinee who has taken parallel forms of the same selection test on three occasions, obtaining scores x, y, z, is used to illustrate the development. It is assumed that each score is normally distributed with the same known variance, that is, the variance of the errors of…
Descriptors: Cheating, Hypothesis Testing, Statistical Analysis, Statistical Significance
Peer reviewed Peer reviewed
Miller, John K. – Multivariate Behavioral Research, 1975
Descriptors: Correlation, Goodness of Fit, Hypothesis Testing, Matrices
Lord, Frederic M.; Stocking, Martha – 1972
A general Computer program is described that will compute asymptotic standard errors and carry out significance tests for an endless variety of (standard and) nonstandard large-sample statistical problems, without requiring the statistician to derive asymptotic standard error formulas. The program assumes that the observations have a multinormal…
Descriptors: Bulletins, Computer Programs, Data Processing, Error of Measurement
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Onwuegbuzie, Anthony J.; Levin, Joel R.; Leech, Nancy L. – Learning Disabilities: A Contemporary Journal, 2003
Because of criticisms leveled at statistical hypothesis testing, some researchers have argued that measures of effect size should replace the significance-testing practice. We contend that although effect-size measures have logical appeal, they are also associated with a number of limitations that may result in problematic interpretations of them…
Descriptors: Intervals, Psychological Studies, Learning Disabilities, Testing
Thompson, Bruce – 1994
Too few researchers understand what statistical significance testing does and does not do, and consequently their results are misinterpreted. This Digest explains the concept of statistical significance testing and discusses the meaning of probabilities, the concept of statistical significance, arguments against significance testing,…
Descriptors: Data Analysis, Data Interpretation, Decision Making, Effect Size
Ascher, Gordon – 1975
The increased use of criterion-referenced statewide testing programs is an outgrowth of the need for more diagnostic information for planning and decision making than is provided by norm-referenced programs. There remains, however, a need for state agencies to compare the results of local districts to a variety of comparison groups for the purpose…
Descriptors: Academic Achievement, Comparative Testing, Correlation, Criterion Referenced Tests