NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 12 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Henninger, Mirka; Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2023
To detect differential item functioning (DIF), Rasch trees search for optimal split-points in covariates and identify subgroups of respondents in a data-driven way. To determine whether and in which covariate a split should be performed, Rasch trees use statistical significance tests. Consequently, Rasch trees are more likely to label small DIF…
Descriptors: Item Response Theory, Test Items, Effect Size, Statistical Significance
Peer reviewed Peer reviewed
Direct linkDirect link
Suh, Youngsuk – Journal of Educational Measurement, 2016
This study adapted an effect size measure used for studying differential item functioning (DIF) in unidimensional tests and extended the measure to multidimensional tests. Two effect size measures were considered in a multidimensional item response theory model: signed weighted P-difference and unsigned weighted P-difference. The performance of…
Descriptors: Effect Size, Goodness of Fit, Statistical Analysis, Statistical Significance
Peer reviewed Peer reviewed
Direct linkDirect link
Cook, David A.; Hatala, Rose – Advances in Health Sciences Education, 2015
Many education research studies employ small samples, which in turn lowers statistical power. We re-analyzed the results of a meta-analysis of simulation-based education to determine study power across a range of effect sizes, and the smallest effect that could be plausibly excluded. We systematically searched multiple databases through May 2011,…
Descriptors: Educational Research, Comparative Analysis, Sample Size, Meta Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Dawson, Robert – Journal of Statistics Education, 2011
It is common to consider Tukey's schematic ("full") boxplot as an informal test for the existence of outliers. While the procedure is useful, it should be used with caution, as at least 30% of samples from a normally-distributed population of any size will be flagged as containing an outlier, while for small samples (N less than 10) even extreme…
Descriptors: Spreadsheets, Educational Technology, Simulation, Mathematics Activities
Peer reviewed Peer reviewed
Direct linkDirect link
Carvajal, Jorge; Skorupski, William P. – Educational and Psychological Measurement, 2010
This study is an evaluation of the behavior of the Liu-Agresti estimator of the cumulative common odds ratio when identifying differential item functioning (DIF) with polytomously scored test items using small samples. The Liu-Agresti estimator has been proposed by Penfield and Algina as a promising approach for the study of polytomous DIF but no…
Descriptors: Test Bias, Sample Size, Test Items, Computation
Peer reviewed Peer reviewed
Direct linkDirect link
Lemons, Christopher J. – Evaluation & Research in Education, 2009
Researchers conducting studies involving individuals with exceptionalities are often prevented from involving large numbers of participants in their study samples. When this is the case, some say significant correlations are likely to replicate because the relation between two variables must be robust enough to be detected even with low…
Descriptors: Correlation, Statistical Significance, Sample Size, Statistical Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Zou, Guang Yong – Psychological Methods, 2007
Confidence intervals are widely accepted as a preferred way to present study results. They encompass significance tests and provide an estimate of the magnitude of the effect. However, comparisons of correlations still rely heavily on significance testing. The persistence of this practice is caused primarily by the lack of simple yet accurate…
Descriptors: Intervals, Effect Size, Research Methodology, Correlation
Mecklin, Christopher J. – 2002
Whether one should use null hypothesis testing, confidence intervals, and/or effect sizes is a source of continuing controversy in educational research. An alternative to testing for statistical significance, known as equivalence testing, is little used in educational research. Equivalence testing is useful in situations where the researcher…
Descriptors: Educational Research, Effect Size, Hypothesis Testing, Sample Size
Peer reviewed Peer reviewed
Wilcox, Rand R. – Multivariate Behavioral Research, 1995
Five methods for testing the hypothesis of independence between two sets of variates were compared through simulation. Results indicate that two new methods, based on robust measures reflecting the linear association between two random variables, provide reasonably accurate control over Type I errors. Drawbacks to rank-based methods are discussed.…
Descriptors: Analysis of Covariance, Comparative Analysis, Hypothesis Testing, Robustness (Statistics)
Peer reviewed Peer reviewed
Saner, Hilary – Psychometrika, 1994
The use of p-values in combining results of studies often involves studies that are potentially aberrant. This paper proposes a combined test that permits trimming some of the extreme p-values. The trimmed statistic is based on an inverse cumulative normal transformation of the ordered p-values. (SLD)
Descriptors: Effect Size, Meta Analysis, Research Methodology, Sample Size
Peer reviewed Peer reviewed
Magee, Kevin N.; Overall, John E. – Educational and Psychological Measurement, 1992
Formulae for estimating individual rater reliabilities from analysis of treatment effects are presented and evaluated. Monte Carlo methods illustrate the formulae. Results indicate that large sample sizes, large true treatment effects, and large differences in the actual reliabilities of raters are required for the approach to be useful. (SLD)
Descriptors: Effect Size, Estimation (Mathematics), Experimental Groups, Mathematical Formulas
Sandler, Andrew B. – 1987
Statistical significance is misused in educational and psychological research when it is applied as a method to establish the reliability of research results. Other techniques have been developed which can be correctly utilized to establish the generalizability of findings. Methods that do provide such estimates are known as invariance or…
Descriptors: Analysis of Covariance, Analysis of Variance, Correlation, Discriminant Analysis