ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	10

Descriptor

Statistical Analysis	15
Statistical Significance	15
Testing	15
Hypothesis Testing	8
Foreign Countries	4
Research Methodology	4
Academic Achievement	3
Measures (Individuals)	3
Criterion Referenced Tests	2
Data Analysis	2
Decision Making	2
Effect Size	2
Factor Analysis	2
Program Evaluation	2
Reliability	2
Researchers	2
Sample Size	2
Sampling	2
Test Results	2
Academic Standards	1
Applied Linguistics	1
Attitudes	1
Behavioral Science Research	1
Bulletins	1
Case Studies	1
More ▼

Source

ProQuest LLC	2
Cognitive Research:…	1
Edinburgh Working Papers in…	1
Educational and Psychological…	1
Human Communication Research	1
International Journal of…	1
Journal of Educational…	1
Journal of International…	1
Measurement:…	1
Practical Assessment,…	1

Publication Type

Journal Articles	9
Reports - Research	5
Reports - Evaluative	3
Dissertations/Theses -…	2
Opinion Papers	2
ERIC Digests in Full Text	1
ERIC Publications	1
Reports - Descriptive	1

Education Level

Early Childhood Education	1
Elementary Education	1
Grade 1	1
Grade 2	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Higher Education	1
Intermediate Grades	1
Postsecondary Education	1
More ▼

Audience

Location

Australia	1
Canada	1
Georgia	1
Nigeria	1
West Virginia	1

Laws, Policies, & Programs

Assessments and Surveys

Georgia Criterion Referenced…	1
Iowa Tests of Basic Skills	1
Self Description Questionnaire	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

The Psychological Reality of the Learned "P < .05" Boundary

Peer reviewed

Direct link

V. N. Vimal Rao; Jeffrey K. Bye; Sashank Varma – Cognitive Research: Principles and Implications, 2024

The 0.05 boundary within Null Hypothesis Statistical Testing (NHST) "has made a lot of people very angry and been widely regarded as a bad move" (to quote Douglas Adams). Here, we move past meta-scientific arguments and ask an empirical question: What is the psychological standing of the 0.05 boundary for statistical significance? We…

Descriptors: Psychological Patterns, Statistical Analysis, Testing, Statistical Significance

Detecting Item Drift in Large-Scale Testing

Peer reviewed

Direct link

Guo, Hongwen; Robin, Frederic; Dorans, Neil – Journal of Educational Measurement, 2017

The early detection of item drift is an important issue for frequently administered testing programs because items are reused over time. Unfortunately, operational data tend to be very sparse and do not lend themselves to frequent monitoring analyses, particularly for on-demand testing. Building on existing residual analyses, the authors propose…

Descriptors: Testing, Test Items, Identification, Sample Size

Factorial Invariance in Multiple Populations: A Multiple Testing Procedure

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A.; Millsap, Roger E. – Educational and Psychological Measurement, 2013

A multiple testing method for examining factorial invariance for latent constructs evaluated by multiple indicators in distinct populations is outlined. The procedure is based on the false discovery rate concept and multiple individual restriction tests and resolves general limitations of a popular factorial invariance testing approach. The…

Descriptors: Testing, Statistical Analysis, Factor Analysis, Statistical Significance

West Virginia Physical Education Teacher Perceptions of State Mandated Fitnessgram® Testing and Application of Results

Direct link

Miller, William M. – ProQuest LLC, 2013

Background/Purpose: In response to concerns with increasing rates of childhood obesity, many states have enacted policies that affect physical education. A commonly used approach is state mandated fitness test administration in school-based settings. While this approach is widely debated throughout the literature, one area that lacks research is…

Descriptors: Physical Education, Physical Education Teachers, Teacher Attitudes, State Legislation

Applying Tests of Equivalence for Multiple Group Comparisons: Demonstration of the Confidence Interval Approach

Peer reviewed

Direct link

Rusticus, Shayna A.; Lovato, Chris Y. – Practical Assessment, Research & Evaluation, 2011

Assessing the comparability of different groups is an issue facing many researchers and evaluators in a variety of settings. Commonly, null hypothesis significance testing (NHST) is incorrectly used to demonstrate comparability when a non-significant result is found. This is problematic because a failure to find a difference between groups is not…

Descriptors: Medical Education, Evaluators, Intervals, Testing

Influence of Host Community on Industrial Relations Practices and Policies: A Survey of Agbara Community and Power Holding Company of Nigeria (PHCN)

Peer reviewed

Direct link

Chidi, Christopher O.; Shadare, Oluseyi A. – Journal of International Education Research, 2011

This study investigated the influence of host community on industrial relations practices and policies using Agbara community and Power Holding Company of Nigeria PLC as a case. The study adopted both the qualitative and quantitative methods. A total of 120 samples were drawn from the population using the simple random sampling technique in which…

Descriptors: Testing, Social Sciences, Foreign Countries, Sampling

A Critical Assessment of Null Hypothesis Significance Testing in Quantitative Communication Research

Peer reviewed

Direct link

Levine, Timothy R.; Weber, Rene; Hullett, Craig; Park, Hee Sun; Lindsey, Lisa L. Massi – Human Communication Research, 2008

Null hypothesis significance testing (NHST) is the most widely accepted and frequently used approach to statistical inference in quantitative communication research. NHST, however, is highly controversial, and several serious problems with the approach have been identified. This paper reviews NHST and the controversy surrounding it. Commonly…

Descriptors: Communication Research, Testing, Statistical Significance, Statistical Inference

A Historical Perspective of Testing and Assessment Including the Impact of Summative and Formative Assessment on Student Achievement

Direct link

Brink, Carole Sanger – ProQuest LLC, 2011

In 2007, Georgia developed a comprehensive framework to define what students need to know. One component of this framework emphasizes the use of both formative and summative assessments as part of an integral and specific component of the teachers. performance evaluation. Georgia administers the Criterion-Referenced Competency Test (CRCT) to every…

Descriptors: Academic Achievement, High Stakes Tests, Educational Strategies, Program Effectiveness

A Note on Measurement Scales and Statistical Testing

Peer reviewed

Direct link

Meijer, Rob R.; Oosterloo, Sebie J. – Measurement: Interdisciplinary Research and Perspectives, 2008

In elementary books on applied statistics (e.g., Siegel, 1988; Agresti, 1990) and books on research methodology in psychology and personality assessment (e.g., Aiken, 1999), it is often suggested that the choice of a statistical test and the choice of statistical operations should be determined by the level of measurement of the data. Although…

Descriptors: Measures (Individuals), Statistical Analysis, Testing, Attitudes

Factorial Invariance Testing and Latent Mean Differences for the Self-Description Questionnaire II (Short Version) with Indigenous and Non-Indigenous Australian Secondary School Students

Peer reviewed

Direct link

Bodkin-Andrews, Gawaian H.; Ha, My Trinh; Craven, Rhonda G.; Yeung, Alexander Seesing – International Journal of Testing, 2010

This investigation reports on the cross-cultural equivalence testing of the Self-Description Questionnaire II (short version; SDQII-S) for Indigenous and non-Indigenous Australian secondary student samples. A variety of statistical analysis techniques were employed to assess the psychometric properties of the SDQII-S for both the Indigenous and…

Descriptors: Indigenous Populations, Disadvantaged, Testing, Measures (Individuals)

A Statistical Test for Cheating.

Download full text

Lord, Frederic M. – 1974

A statistical test for cheating is developed. The case of a single examinee who has taken parallel forms of the same selection test on three occasions, obtaining scores x, y, z, is used to illustrate the development. It is assumed that each score is normally distributed with the same known variance, that is, the variance of the errors of…

Descriptors: Cheating, Hypothesis Testing, Statistical Analysis, Statistical Significance

The Significance of "Significance."

Download full text

Gill, Martin – Edinburgh Working Papers in Applied Linguistics, 1993

This paper examines some of the implications of testing for statistical significance. After considering methodological issues raised by two examples from the literature, the paper proceeds to look in detail at a variety of misunderstandings attached to the reporting of "significant" results. It is concluded that significance testing is of limited…

Descriptors: Applied Linguistics, Case Studies, Foreign Countries, Linguistic Theory

Automated Hypothesis Tests and Standard Errors for Nonstandard Problems with Description of Computer Package: A Draft.

Download full text

Lord, Frederic M.; Stocking, Martha – 1972

A general Computer program is described that will compute asymptotic standard errors and carry out significance tests for an endless variety of (standard and) nonstandard large-sample statistical problems, without requiring the statistician to derive asymptotic standard error formulas. The program assumes that the observations have a multinormal…

Descriptors: Bulletins, Computer Programs, Data Processing, Error of Measurement

The Concept of Statistical Significance Testing. ERIC/AE Digest.

Download full text

Thompson, Bruce – 1994

Too few researchers understand what statistical significance testing does and does not do, and consequently their results are misinterpreted. This Digest explains the concept of statistical significance testing and discusses the meaning of probabilities, the concept of statistical significance, arguments against significance testing,…

Descriptors: Data Analysis, Data Interpretation, Decision Making, Effect Size

Some Nonparametric Approaches to the Use of Criterion-Referenced Statewide Test Results in the Evaluation of Local District Educational Programs.

Download full text

Ascher, Gordon – 1975

The increased use of criterion-referenced statewide testing programs is an outgrowth of the need for more diagnostic information for planning and decision making than is provided by norm-referenced programs. There remains, however, a need for state agencies to compare the results of local districts to a variety of comparison groups for the purpose…

Descriptors: Academic Achievement, Comparative Testing, Correlation, Criterion Referenced Tests

Lord, Frederic M.	2
Ascher, Gordon	1
Bodkin-Andrews, Gawaian H.	1
Brink, Carole Sanger	1
Chidi, Christopher O.	1
Craven, Rhonda G.	1
Dorans, Neil	1
Gill, Martin	1
Guo, Hongwen	1
Ha, My Trinh	1
Hullett, Craig	1
Jeffrey K. Bye	1
Levine, Timothy R.	1
Lindsey, Lisa L. Massi	1
Lovato, Chris Y.	1
Marcoulides, George A.	1
Meijer, Rob R.	1
Miller, William M.	1
Millsap, Roger E.	1
Oosterloo, Sebie J.	1
Park, Hee Sun	1
Raykov, Tenko	1
Robin, Frederic	1
Rusticus, Shayna A.	1
Sashank Varma	1
More ▼