Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 3 |
| Since 2017 (last 10 years) | 4 |
| Since 2007 (last 20 years) | 20 |
Descriptor
| Simulation | 35 |
| Statistical Significance | 35 |
| Statistical Analysis | 11 |
| Correlation | 10 |
| Sampling | 10 |
| Effect Size | 9 |
| Computation | 8 |
| Hypothesis Testing | 7 |
| Item Response Theory | 7 |
| Comparative Analysis | 6 |
| Models | 6 |
| More ▼ | |
Source
Author
| Aiken, Leona S. | 1 |
| Ashler, Daniel | 1 |
| Barcikowski, Robert S. | 1 |
| Becker, Betsy Jane | 1 |
| Bloom, Howard S. | 1 |
| Buchanan, Taylor L. | 1 |
| Buonasera, Ash K. | 1 |
| Buttery, Paula | 1 |
| Cham, Heining | 1 |
| Chaplin, Duncan Dunbar | 1 |
| Cheng, Xusen | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 35 |
| Journal Articles | 23 |
| Speeches/Meeting Papers | 6 |
Education Level
| Higher Education | 6 |
| Postsecondary Education | 5 |
| Elementary Education | 1 |
| Elementary Secondary Education | 1 |
| Grade 5 | 1 |
| High Schools | 1 |
| Intermediate Grades | 1 |
| Middle Schools | 1 |
| Secondary Education | 1 |
Audience
| Researchers | 2 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Elliott, Mark; Buttery, Paula – Educational and Psychological Measurement, 2022
We investigate two non-iterative estimation procedures for Rasch models, the pair-wise estimation procedure (PAIR) and the Eigenvector method (EVM), and identify theoretical issues with EVM for rating scale model (RSM) threshold estimation. We develop a new procedure to resolve these issues--the conditional pairwise adjacent thresholds procedure…
Descriptors: Item Response Theory, Rating Scales, Computation, Simulation
Henninger, Mirka; Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2023
To detect differential item functioning (DIF), Rasch trees search for optimal split-points in covariates and identify subgroups of respondents in a data-driven way. To determine whether and in which covariate a split should be performed, Rasch trees use statistical significance tests. Consequently, Rasch trees are more likely to label small DIF…
Descriptors: Item Response Theory, Test Items, Effect Size, Statistical Significance
Ernesto Sánchez; Victor Nozair García-Ríos; Francisco Sepúlveda – Educational Studies in Mathematics, 2024
Sampling distributions are fundamental for statistical inference, yet their abstract nature poses challenges for students. This research investigates the development of high school students' conceptions of sampling distribution through informal significance tests with the aid of digital technology. The study focuses on how technological tools…
Descriptors: High School Students, Concept Formation, Thinking Skills, Skill Development
Walker, Cindy M.; Gocer Sahin, Sakine – Educational and Psychological Measurement, 2017
The theoretical reason for the presence of differential item functioning (DIF) is that data are multidimensional and two groups of examinees differ in their underlying ability distribution for the secondary dimension(s). Therefore, the purpose of this study was to determine how much the secondary ability distributions must differ before DIF is…
Descriptors: Item Response Theory, Test Bias, Correlation, Statistical Significance
Suh, Youngsuk – Journal of Educational Measurement, 2016
This study adapted an effect size measure used for studying differential item functioning (DIF) in unidimensional tests and extended the measure to multidimensional tests. Two effect size measures were considered in a multidimensional item response theory model: signed weighted P-difference and unsigned weighted P-difference. The performance of…
Descriptors: Effect Size, Goodness of Fit, Statistical Analysis, Statistical Significance
Wollack, James A.; Cohen, Allan S.; Eckerly, Carol A. – Educational and Psychological Measurement, 2015
Test tampering, especially on tests for educational accountability, is an unfortunate reality, necessitating that the state (or its testing vendor) perform data forensic analyses, such as erasure analyses, to look for signs of possible malfeasance. Few statistical approaches exist for detecting fraudulent erasures, and those that do largely do not…
Descriptors: Tests, Cheating, Item Response Theory, Accountability
Leth-Steensen, Craig; Gallitto, Elena – Educational and Psychological Measurement, 2016
A large number of approaches have been proposed for estimating and testing the significance of indirect effects in mediation models. In this study, four sets of Monte Carlo simulations involving full latent variable structural equation models were run in order to contrast the effectiveness of the currently popular bias-corrected bootstrapping…
Descriptors: Mediation Theory, Structural Equation Models, Monte Carlo Methods, Simulation
Cheng, Xusen; Wang, Xueyin; Huang, Jianqing; Zarifis, Alex – International Review of Research in Open and Distributed Learning, 2016
On the one hand, a growing amount of research discusses support for improving online collaborative learning quality, and many indicators are focused to assess its success. On the other hand, thinkLets for designing reputable and valuable collaborative processes have been developed for more than ten years. However, few studies try to apply…
Descriptors: Satisfaction, Electronic Learning, Cooperative Learning, Program Effectiveness
Cook, David A.; Hatala, Rose – Advances in Health Sciences Education, 2015
Many education research studies employ small samples, which in turn lowers statistical power. We re-analyzed the results of a meta-analysis of simulation-based education to determine study power across a range of effect sizes, and the smallest effect that could be plausibly excluded. We systematically searched multiple databases through May 2011,…
Descriptors: Educational Research, Comparative Analysis, Sample Size, Meta Analysis
Goldhaber, Dan; Chaplin, Duncan Dunbar – Journal of Research on Educational Effectiveness, 2015
In an influential paper, Jesse Rothstein (2010) shows that standard value-added models (VAMs) suggest implausible and large future teacher effects on past student achievement. This is the basis of a falsification test that "appears" to indicate bias in typical VAM estimates of teacher contributions to student learning on standardized…
Descriptors: Teacher Evaluation, Teacher Effectiveness, Teacher Influence, Models
Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015
Person-fit assessment may help the researcher to obtain additional information regarding the answering behavior of persons. Although several researchers examined person fit, there is a lack of research on person-fit assessment for mixed-format tests. In this article, the lz statistic and the ?2 statistic, both of which have been used for tests…
Descriptors: Test Format, Goodness of Fit, Item Response Theory, Bayesian Statistics
Franciosi, Stephan J.; Mehring, Jeffrey – Research-publishing.net, 2015
Studies suggest that simulations and games not only improve target language skills, but they can also support knowledge creation regarding a broader variety of topics. Thus, we wanted to explore how playing an online simulation game affected knowledge of energy supply and its relationship to environmental and economic factors among learners of…
Descriptors: Foreign Countries, English (Second Language), Second Language Learning, Educational Games
Glazerman, Steve; Dotter, Dallas – Mathematica Policy Research, Inc., 2016
We estimate school-choice preferences revealed by the rank-ordered lists submitted by more than 22,000 applicants to a citywide lottery for more than 200 traditional and charter public schools in Washington, DC. The results confirm previously reported findings that commuting distance, school demographics, and academic indicators play important…
Descriptors: Charter Schools, School Choice, Simulation, Selective Admission
Bloom, Howard S.; Porter, Kristin E.; Weiss, Michael J.; Raudenbush, Stephen – Society for Research on Educational Effectiveness, 2013
To date, evaluation research and policy analysis have focused mainly on average program impacts and paid little systematic attention to their variation. Recently, the growing number of multi-site randomized trials that are being planned and conducted make it increasingly feasible to study "cross-site" variation in impacts. Important…
Descriptors: Research Methodology, Policy, Evaluation Research, Randomized Controlled Trials
Buchanan, Taylor L.; Lohse, Keith R. – Measurement in Physical Education and Exercise Science, 2016
We surveyed researchers in the health and exercise sciences to explore different areas and magnitudes of bias in researchers' decision making. Participants were presented with scenarios (testing a central hypothesis with p = 0.06 or p = 0.04) in a random order and surveyed about what they would do in each scenario. Participants showed significant…
Descriptors: Researchers, Attitudes, Statistical Significance, Bias

Peer reviewed
Direct link
