Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 2 |
| Since 2007 (last 20 years) | 10 |
Descriptor
Source
Author
| Lord, Frederic M. | 2 |
| Ascher, Gordon | 1 |
| Bodkin-Andrews, Gawaian H. | 1 |
| Brink, Carole Sanger | 1 |
| Chidi, Christopher O. | 1 |
| Craven, Rhonda G. | 1 |
| Dorans, Neil | 1 |
| Gill, Martin | 1 |
| Guo, Hongwen | 1 |
| Ha, My Trinh | 1 |
| Hullett, Craig | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 9 |
| Reports - Research | 5 |
| Reports - Evaluative | 3 |
| Dissertations/Theses -… | 2 |
| Opinion Papers | 2 |
| ERIC Digests in Full Text | 1 |
| ERIC Publications | 1 |
| Reports - Descriptive | 1 |
Education Level
| Early Childhood Education | 1 |
| Elementary Education | 1 |
| Grade 1 | 1 |
| Grade 2 | 1 |
| Grade 3 | 1 |
| Grade 4 | 1 |
| Grade 5 | 1 |
| Grade 6 | 1 |
| Grade 7 | 1 |
| Grade 8 | 1 |
| Higher Education | 1 |
| More ▼ | |
Audience
Location
| Australia | 1 |
| Canada | 1 |
| Georgia | 1 |
| Nigeria | 1 |
| West Virginia | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| Georgia Criterion Referenced… | 1 |
| Iowa Tests of Basic Skills | 1 |
| Self Description Questionnaire | 1 |
What Works Clearinghouse Rating
V. N. Vimal Rao; Jeffrey K. Bye; Sashank Varma – Cognitive Research: Principles and Implications, 2024
The 0.05 boundary within Null Hypothesis Statistical Testing (NHST) "has made a lot of people very angry and been widely regarded as a bad move" (to quote Douglas Adams). Here, we move past meta-scientific arguments and ask an empirical question: What is the psychological standing of the 0.05 boundary for statistical significance? We…
Descriptors: Psychological Patterns, Statistical Analysis, Testing, Statistical Significance
Guo, Hongwen; Robin, Frederic; Dorans, Neil – Journal of Educational Measurement, 2017
The early detection of item drift is an important issue for frequently administered testing programs because items are reused over time. Unfortunately, operational data tend to be very sparse and do not lend themselves to frequent monitoring analyses, particularly for on-demand testing. Building on existing residual analyses, the authors propose…
Descriptors: Testing, Test Items, Identification, Sample Size
Raykov, Tenko; Marcoulides, George A.; Millsap, Roger E. – Educational and Psychological Measurement, 2013
A multiple testing method for examining factorial invariance for latent constructs evaluated by multiple indicators in distinct populations is outlined. The procedure is based on the false discovery rate concept and multiple individual restriction tests and resolves general limitations of a popular factorial invariance testing approach. The…
Descriptors: Testing, Statistical Analysis, Factor Analysis, Statistical Significance
Miller, William M. – ProQuest LLC, 2013
Background/Purpose: In response to concerns with increasing rates of childhood obesity, many states have enacted policies that affect physical education. A commonly used approach is state mandated fitness test administration in school-based settings. While this approach is widely debated throughout the literature, one area that lacks research is…
Descriptors: Physical Education, Physical Education Teachers, Teacher Attitudes, State Legislation
Rusticus, Shayna A.; Lovato, Chris Y. – Practical Assessment, Research & Evaluation, 2011
Assessing the comparability of different groups is an issue facing many researchers and evaluators in a variety of settings. Commonly, null hypothesis significance testing (NHST) is incorrectly used to demonstrate comparability when a non-significant result is found. This is problematic because a failure to find a difference between groups is not…
Descriptors: Medical Education, Evaluators, Intervals, Testing
Chidi, Christopher O.; Shadare, Oluseyi A. – Journal of International Education Research, 2011
This study investigated the influence of host community on industrial relations practices and policies using Agbara community and Power Holding Company of Nigeria PLC as a case. The study adopted both the qualitative and quantitative methods. A total of 120 samples were drawn from the population using the simple random sampling technique in which…
Descriptors: Testing, Social Sciences, Foreign Countries, Sampling
A Critical Assessment of Null Hypothesis Significance Testing in Quantitative Communication Research
Levine, Timothy R.; Weber, Rene; Hullett, Craig; Park, Hee Sun; Lindsey, Lisa L. Massi – Human Communication Research, 2008
Null hypothesis significance testing (NHST) is the most widely accepted and frequently used approach to statistical inference in quantitative communication research. NHST, however, is highly controversial, and several serious problems with the approach have been identified. This paper reviews NHST and the controversy surrounding it. Commonly…
Descriptors: Communication Research, Testing, Statistical Significance, Statistical Inference
Brink, Carole Sanger – ProQuest LLC, 2011
In 2007, Georgia developed a comprehensive framework to define what students need to know. One component of this framework emphasizes the use of both formative and summative assessments as part of an integral and specific component of the teachers. performance evaluation. Georgia administers the Criterion-Referenced Competency Test (CRCT) to every…
Descriptors: Academic Achievement, High Stakes Tests, Educational Strategies, Program Effectiveness
Meijer, Rob R.; Oosterloo, Sebie J. – Measurement: Interdisciplinary Research and Perspectives, 2008
In elementary books on applied statistics (e.g., Siegel, 1988; Agresti, 1990) and books on research methodology in psychology and personality assessment (e.g., Aiken, 1999), it is often suggested that the choice of a statistical test and the choice of statistical operations should be determined by the level of measurement of the data. Although…
Descriptors: Measures (Individuals), Statistical Analysis, Testing, Attitudes
Bodkin-Andrews, Gawaian H.; Ha, My Trinh; Craven, Rhonda G.; Yeung, Alexander Seesing – International Journal of Testing, 2010
This investigation reports on the cross-cultural equivalence testing of the Self-Description Questionnaire II (short version; SDQII-S) for Indigenous and non-Indigenous Australian secondary student samples. A variety of statistical analysis techniques were employed to assess the psychometric properties of the SDQII-S for both the Indigenous and…
Descriptors: Indigenous Populations, Disadvantaged, Testing, Measures (Individuals)
Lord, Frederic M. – 1974
A statistical test for cheating is developed. The case of a single examinee who has taken parallel forms of the same selection test on three occasions, obtaining scores x, y, z, is used to illustrate the development. It is assumed that each score is normally distributed with the same known variance, that is, the variance of the errors of…
Descriptors: Cheating, Hypothesis Testing, Statistical Analysis, Statistical Significance
Gill, Martin – Edinburgh Working Papers in Applied Linguistics, 1993
This paper examines some of the implications of testing for statistical significance. After considering methodological issues raised by two examples from the literature, the paper proceeds to look in detail at a variety of misunderstandings attached to the reporting of "significant" results. It is concluded that significance testing is of limited…
Descriptors: Applied Linguistics, Case Studies, Foreign Countries, Linguistic Theory
Lord, Frederic M.; Stocking, Martha – 1972
A general Computer program is described that will compute asymptotic standard errors and carry out significance tests for an endless variety of (standard and) nonstandard large-sample statistical problems, without requiring the statistician to derive asymptotic standard error formulas. The program assumes that the observations have a multinormal…
Descriptors: Bulletins, Computer Programs, Data Processing, Error of Measurement
Thompson, Bruce – 1994
Too few researchers understand what statistical significance testing does and does not do, and consequently their results are misinterpreted. This Digest explains the concept of statistical significance testing and discusses the meaning of probabilities, the concept of statistical significance, arguments against significance testing,…
Descriptors: Data Analysis, Data Interpretation, Decision Making, Effect Size
Ascher, Gordon – 1975
The increased use of criterion-referenced statewide testing programs is an outgrowth of the need for more diagnostic information for planning and decision making than is provided by norm-referenced programs. There remains, however, a need for state agencies to compare the results of local districts to a variety of comparison groups for the purpose…
Descriptors: Academic Achievement, Comparative Testing, Correlation, Criterion Referenced Tests

Peer reviewed
Direct link
