Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 3 |
| Since 2017 (last 10 years) | 5 |
| Since 2007 (last 20 years) | 28 |
Descriptor
| Simulation | 40 |
| Statistical Significance | 40 |
| Correlation | 12 |
| Effect Size | 11 |
| Computation | 10 |
| Sample Size | 10 |
| Statistical Analysis | 9 |
| Sampling | 8 |
| Item Response Theory | 7 |
| Models | 7 |
| Comparative Analysis | 5 |
| More ▼ | |
Source
Author
| Abraham, W. Todd | 1 |
| Aiken, Leona S. | 1 |
| Ashler, Daniel | 1 |
| Atkins, David C. | 1 |
| Beauchaine, Theodore P. | 1 |
| Bedics, Jamie D. | 1 |
| Buchanan, Taylor L. | 1 |
| Buonasera, Ash K. | 1 |
| Buttery, Paula | 1 |
| Carvajal, Jorge | 1 |
| Cham, Heining | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 40 |
| Reports - Research | 23 |
| Reports - Descriptive | 9 |
| Reports - Evaluative | 7 |
| Information Analyses | 1 |
Education Level
| Higher Education | 8 |
| Postsecondary Education | 5 |
| High Schools | 2 |
| Middle Schools | 2 |
| Elementary Education | 1 |
| Grade 5 | 1 |
| Grade 9 | 1 |
| Intermediate Grades | 1 |
| Kindergarten | 1 |
| Secondary Education | 1 |
Audience
| Teachers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| Woodcock Reading Mastery Test | 1 |
What Works Clearinghouse Rating
Elliott, Mark; Buttery, Paula – Educational and Psychological Measurement, 2022
We investigate two non-iterative estimation procedures for Rasch models, the pair-wise estimation procedure (PAIR) and the Eigenvector method (EVM), and identify theoretical issues with EVM for rating scale model (RSM) threshold estimation. We develop a new procedure to resolve these issues--the conditional pairwise adjacent thresholds procedure…
Descriptors: Item Response Theory, Rating Scales, Computation, Simulation
Henninger, Mirka; Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2023
To detect differential item functioning (DIF), Rasch trees search for optimal split-points in covariates and identify subgroups of respondents in a data-driven way. To determine whether and in which covariate a split should be performed, Rasch trees use statistical significance tests. Consequently, Rasch trees are more likely to label small DIF…
Descriptors: Item Response Theory, Test Items, Effect Size, Statistical Significance
Ernesto Sánchez; Victor Nozair García-Ríos; Francisco Sepúlveda – Educational Studies in Mathematics, 2024
Sampling distributions are fundamental for statistical inference, yet their abstract nature poses challenges for students. This research investigates the development of high school students' conceptions of sampling distribution through informal significance tests with the aid of digital technology. The study focuses on how technological tools…
Descriptors: High School Students, Concept Formation, Thinking Skills, Skill Development
Gorard, Stephen – International Journal of Social Research Methodology, 2019
This paper compares the use of confidence intervals (CIs) and a sensitivity analysis called the number needed to disturb (NNTD), in the analysis of research findings expressed as 'effect' sizes. Using 1,000 simulations of randomised trials with up to 1,000 cases in each, the paper shows that both approaches are very similar in outcomes, and each…
Descriptors: Intervals, Statistics, Social Sciences, Foreign Countries
Walker, Cindy M.; Gocer Sahin, Sakine – Educational and Psychological Measurement, 2017
The theoretical reason for the presence of differential item functioning (DIF) is that data are multidimensional and two groups of examinees differ in their underlying ability distribution for the secondary dimension(s). Therefore, the purpose of this study was to determine how much the secondary ability distributions must differ before DIF is…
Descriptors: Item Response Theory, Test Bias, Correlation, Statistical Significance
Suh, Youngsuk – Journal of Educational Measurement, 2016
This study adapted an effect size measure used for studying differential item functioning (DIF) in unidimensional tests and extended the measure to multidimensional tests. Two effect size measures were considered in a multidimensional item response theory model: signed weighted P-difference and unsigned weighted P-difference. The performance of…
Descriptors: Effect Size, Goodness of Fit, Statistical Analysis, Statistical Significance
Wollack, James A.; Cohen, Allan S.; Eckerly, Carol A. – Educational and Psychological Measurement, 2015
Test tampering, especially on tests for educational accountability, is an unfortunate reality, necessitating that the state (or its testing vendor) perform data forensic analyses, such as erasure analyses, to look for signs of possible malfeasance. Few statistical approaches exist for detecting fraudulent erasures, and those that do largely do not…
Descriptors: Tests, Cheating, Item Response Theory, Accountability
Leth-Steensen, Craig; Gallitto, Elena – Educational and Psychological Measurement, 2016
A large number of approaches have been proposed for estimating and testing the significance of indirect effects in mediation models. In this study, four sets of Monte Carlo simulations involving full latent variable structural equation models were run in order to contrast the effectiveness of the currently popular bias-corrected bootstrapping…
Descriptors: Mediation Theory, Structural Equation Models, Monte Carlo Methods, Simulation
Cheng, Xusen; Wang, Xueyin; Huang, Jianqing; Zarifis, Alex – International Review of Research in Open and Distributed Learning, 2016
On the one hand, a growing amount of research discusses support for improving online collaborative learning quality, and many indicators are focused to assess its success. On the other hand, thinkLets for designing reputable and valuable collaborative processes have been developed for more than ten years. However, few studies try to apply…
Descriptors: Satisfaction, Electronic Learning, Cooperative Learning, Program Effectiveness
Pinder, Jonathan P. – Decision Sciences Journal of Innovative Education, 2014
Business analytics courses, such as marketing research, data mining, forecasting, and advanced financial modeling, have substantial predictive modeling components. The predictive modeling in these courses requires students to estimate and test many linear regressions. As a result, false positive variable selection ("type I errors") is…
Descriptors: Data Collection, Data Analysis, Regression (Statistics), Predictive Measurement
Cook, David A.; Hatala, Rose – Advances in Health Sciences Education, 2015
Many education research studies employ small samples, which in turn lowers statistical power. We re-analyzed the results of a meta-analysis of simulation-based education to determine study power across a range of effect sizes, and the smallest effect that could be plausibly excluded. We systematically searched multiple databases through May 2011,…
Descriptors: Educational Research, Comparative Analysis, Sample Size, Meta Analysis
Goldhaber, Dan; Chaplin, Duncan Dunbar – Journal of Research on Educational Effectiveness, 2015
In an influential paper, Jesse Rothstein (2010) shows that standard value-added models (VAMs) suggest implausible and large future teacher effects on past student achievement. This is the basis of a falsification test that "appears" to indicate bias in typical VAM estimates of teacher contributions to student learning on standardized…
Descriptors: Teacher Evaluation, Teacher Effectiveness, Teacher Influence, Models
Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015
Person-fit assessment may help the researcher to obtain additional information regarding the answering behavior of persons. Although several researchers examined person fit, there is a lack of research on person-fit assessment for mixed-format tests. In this article, the lz statistic and the ?2 statistic, both of which have been used for tests…
Descriptors: Test Format, Goodness of Fit, Item Response Theory, Bayesian Statistics
Linting, Marielle; van Os, Bart Jan; Meulman, Jacqueline J. – Psychometrika, 2011
In this paper, the statistical significance of the contribution of variables to the principal components in principal components analysis (PCA) is assessed nonparametrically by the use of permutation tests. We compare a new strategy to a strategy used in previous research consisting of permuting the columns (variables) of a data matrix…
Descriptors: Intervals, Simulation, Statistical Significance, Factor Analysis
Buchanan, Taylor L.; Lohse, Keith R. – Measurement in Physical Education and Exercise Science, 2016
We surveyed researchers in the health and exercise sciences to explore different areas and magnitudes of bias in researchers' decision making. Participants were presented with scenarios (testing a central hypothesis with p = 0.06 or p = 0.04) in a random order and surveyed about what they would do in each scenario. Participants showed significant…
Descriptors: Researchers, Attitudes, Statistical Significance, Bias

Peer reviewed
Direct link
