ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	7

Descriptor

Sample Size	12
Simulation	12
Statistical Significance	12
Effect Size	8
Statistical Analysis	4
Correlation	3
Test Items	3
Analysis of Covariance	2
Classification	2
Comparative Analysis	2
Educational Research	2
Hypothesis Testing	2
Item Response Theory	2
Meta Analysis	2
Research Methodology	2
Sampling	2
Statistical Distributions	2
Statistical Studies	2
Allied Health Occupations…	1
Analysis of Variance	1
College Instruction	1
College Mathematics	1
Computation	1
Databases	1
Discriminant Analysis	1
More ▼

Source

Educational and Psychological…	3
Advances in Health Sciences…	1
Evaluation & Research in…	1
Journal of Educational…	1
Journal of Statistics…	1
Multivariate Behavioral…	1
Psychological Methods	1
Psychometrika	1

Author

Carvajal, Jorge	1
Cook, David A.	1
Dawson, Robert	1
Debelak, Rudolf	1
Hatala, Rose	1
Henninger, Mirka	1
Lemons, Christopher J.	1
Magee, Kevin N.	1
Mecklin, Christopher J.	1
Overall, John E.	1
Sandler, Andrew B.	1
Saner, Hilary	1
Skorupski, William P.	1
Strobl, Carolin	1
Suh, Youngsuk	1
Wilcox, Rand R.	1
Zou, Guang Yong	1
More ▼

Publication Type

Journal Articles	10
Reports - Research	6
Reports - Evaluative	4
Reports - Descriptive	2
Speeches/Meeting Papers	2

Education Level

Higher Education	2
Postsecondary Education	2
Kindergarten	1

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

Woodcock Reading Mastery Test

What Works Clearinghouse Rating

Showing all 12 results Save | Export

A New Stopping Criterion for Rasch Trees Based on the Mantel-Haenszel Effect Size Measure for Differential Item Functioning

Peer reviewed

Direct link

Henninger, Mirka; Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2023

To detect differential item functioning (DIF), Rasch trees search for optimal split-points in covariates and identify subgroups of respondents in a data-driven way. To determine whether and in which covariate a split should be performed, Rasch trees use statistical significance tests. Consequently, Rasch trees are more likely to label small DIF…

Descriptors: Item Response Theory, Test Items, Effect Size, Statistical Significance

Effect Size Measures for Differential Item Functioning in a Multidimensional IRT Model

Peer reviewed

Direct link

Suh, Youngsuk – Journal of Educational Measurement, 2016

This study adapted an effect size measure used for studying differential item functioning (DIF) in unidimensional tests and extended the measure to multidimensional tests. Two effect size measures were considered in a multidimensional item response theory model: signed weighted P-difference and unsigned weighted P-difference. The performance of…

Descriptors: Effect Size, Goodness of Fit, Statistical Analysis, Statistical Significance

Got Power? A Systematic Review of Sample Size Adequacy in Health Professions Education Research

Peer reviewed

Direct link

Cook, David A.; Hatala, Rose – Advances in Health Sciences Education, 2015

Many education research studies employ small samples, which in turn lowers statistical power. We re-analyzed the results of a meta-analysis of simulation-based education to determine study power across a range of effect sizes, and the smallest effect that could be plausibly excluded. We systematically searched multiple databases through May 2011,…

Descriptors: Educational Research, Comparative Analysis, Sample Size, Meta Analysis

How Significant Is a Boxplot Outlier?

Peer reviewed

Direct link

Dawson, Robert – Journal of Statistics Education, 2011

It is common to consider Tukey's schematic ("full") boxplot as an informal test for the existence of outliers. While the procedure is useful, it should be used with caution, as at least 30% of samples from a normally-distributed population of any size will be flagged as containing an outlier, while for small samples (N less than 10) even extreme…

Descriptors: Spreadsheets, Educational Technology, Simulation, Mathematics Activities

The Effects of Small Sample Size on Identifying Polytomous DIF Using the Liu-Agresti Estimator of the Cumulative Common Odds Ratio

Peer reviewed

Direct link

Carvajal, Jorge; Skorupski, William P. – Educational and Psychological Measurement, 2010

This study is an evaluation of the behavior of the Liu-Agresti estimator of the cumulative common odds ratio when identifying differential item functioning (DIF) with polytomously scored test items using small samples. The Liu-Agresti estimator has been proposed by Penfield and Algina as a promising approach for the study of polytomous DIF but no…

Descriptors: Test Bias, Sample Size, Test Items, Computation

Replication of Significant Correlations in Small Samples

Peer reviewed

Direct link

Lemons, Christopher J. – Evaluation & Research in Education, 2009

Researchers conducting studies involving individuals with exceptionalities are often prevented from involving large numbers of participants in their study samples. When this is the case, some say significant correlations are likely to replicate because the relation between two variables must be robust enough to be detected even with low…

Descriptors: Correlation, Statistical Significance, Sample Size, Statistical Analysis

Toward Using Confidence Intervals to Compare Correlations

Peer reviewed

Direct link

Zou, Guang Yong – Psychological Methods, 2007

Confidence intervals are widely accepted as a preferred way to present study results. They encompass significance tests and provide an estimate of the magnitude of the effect. However, comparisons of correlations still rely heavily on significance testing. The persistence of this practice is caused primarily by the lack of simple yet accurate…

Descriptors: Intervals, Effect Size, Research Methodology, Correlation

The Use of Equivalence Testing in Conjunction with Standard Hypothesis Testing and Effect Sizes.

Download full text

Mecklin, Christopher J. – 2002

Whether one should use null hypothesis testing, confidence intervals, and/or effect sizes is a source of continuing controversy in educational research. An alternative to testing for statistical significance, known as equivalence testing, is little used in educational research. Equivalence testing is useful in situations where the researcher…

Descriptors: Educational Research, Effect Size, Hypothesis Testing, Sample Size

Testing the Hypothesis of Independence between Two Sets of Variates.

Peer reviewed

Wilcox, Rand R. – Multivariate Behavioral Research, 1995

Five methods for testing the hypothesis of independence between two sets of variates were compared through simulation. Results indicate that two new methods, based on robust measures reflecting the linear association between two random variables, provide reasonably accurate control over Type I errors. Drawbacks to rank-based methods are discussed.…

Descriptors: Analysis of Covariance, Comparative Analysis, Hypothesis Testing, Robustness (Statistics)

A Conservative Inverse Normal Test Procedure for Combining P-Values in Integrative Research.

Peer reviewed

Saner, Hilary – Psychometrika, 1994

The use of p-values in combining results of studies often involves studies that are potentially aberrant. This paper proposes a combined test that permits trimming some of the extreme p-values. The trimmed statistic is based on an inverse cumulative normal transformation of the ordered p-values. (SLD)

Descriptors: Effect Size, Meta Analysis, Research Methodology, Sample Size

Formulae for Estimating Rater Reliability from the Significance of Treatment Effects.

Peer reviewed

Magee, Kevin N.; Overall, John E. – Educational and Psychological Measurement, 1992

Formulae for estimating individual rater reliabilities from analysis of treatment effects are presented and evaluated. Monte Carlo methods illustrate the formulae. Results indicate that large sample sizes, large true treatment effects, and large differences in the actual reliabilities of raters are required for the approach to be useful. (SLD)

Descriptors: Effect Size, Estimation (Mathematics), Experimental Groups, Mathematical Formulas

The Use of Invariance and Bootstrap Procedures as a Method to Establish the Reliability of Research Results.

Sandler, Andrew B. – 1987

Statistical significance is misused in educational and psychological research when it is applied as a method to establish the reliability of research results. Other techniques have been developed which can be correctly utilized to establish the generalizability of findings. Methods that do provide such estimates are known as invariance or…

Descriptors: Analysis of Covariance, Analysis of Variance, Correlation, Discriminant Analysis