NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20260
Since 20250
Since 2022 (last 5 years)3
Since 2017 (last 10 years)6
Since 2007 (last 20 years)46
Audience
Teachers1
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 46 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Elliott, Mark; Buttery, Paula – Educational and Psychological Measurement, 2022
We investigate two non-iterative estimation procedures for Rasch models, the pair-wise estimation procedure (PAIR) and the Eigenvector method (EVM), and identify theoretical issues with EVM for rating scale model (RSM) threshold estimation. We develop a new procedure to resolve these issues--the conditional pairwise adjacent thresholds procedure…
Descriptors: Item Response Theory, Rating Scales, Computation, Simulation
Peer reviewed Peer reviewed
Direct linkDirect link
Henninger, Mirka; Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2023
To detect differential item functioning (DIF), Rasch trees search for optimal split-points in covariates and identify subgroups of respondents in a data-driven way. To determine whether and in which covariate a split should be performed, Rasch trees use statistical significance tests. Consequently, Rasch trees are more likely to label small DIF…
Descriptors: Item Response Theory, Test Items, Effect Size, Statistical Significance
Peer reviewed Peer reviewed
Direct linkDirect link
Ernesto Sánchez; Victor Nozair García-Ríos; Francisco Sepúlveda – Educational Studies in Mathematics, 2024
Sampling distributions are fundamental for statistical inference, yet their abstract nature poses challenges for students. This research investigates the development of high school students' conceptions of sampling distribution through informal significance tests with the aid of digital technology. The study focuses on how technological tools…
Descriptors: High School Students, Concept Formation, Thinking Skills, Skill Development
Peer reviewed Peer reviewed
Direct linkDirect link
Gorard, Stephen – International Journal of Social Research Methodology, 2019
This paper compares the use of confidence intervals (CIs) and a sensitivity analysis called the number needed to disturb (NNTD), in the analysis of research findings expressed as 'effect' sizes. Using 1,000 simulations of randomised trials with up to 1,000 cases in each, the paper shows that both approaches are very similar in outcomes, and each…
Descriptors: Intervals, Statistics, Social Sciences, Foreign Countries
Peer reviewed Peer reviewed
Direct linkDirect link
Walker, Cindy M.; Gocer Sahin, Sakine – Educational and Psychological Measurement, 2017
The theoretical reason for the presence of differential item functioning (DIF) is that data are multidimensional and two groups of examinees differ in their underlying ability distribution for the secondary dimension(s). Therefore, the purpose of this study was to determine how much the secondary ability distributions must differ before DIF is…
Descriptors: Item Response Theory, Test Bias, Correlation, Statistical Significance
Peer reviewed Peer reviewed
Direct linkDirect link
Suh, Youngsuk – Journal of Educational Measurement, 2016
This study adapted an effect size measure used for studying differential item functioning (DIF) in unidimensional tests and extended the measure to multidimensional tests. Two effect size measures were considered in a multidimensional item response theory model: signed weighted P-difference and unsigned weighted P-difference. The performance of…
Descriptors: Effect Size, Goodness of Fit, Statistical Analysis, Statistical Significance
Brown, Robin T. – ProQuest LLC, 2017
This scholarly project was a non-experimental, pre/post-test design to (a) facilitate the voluntary adoption of the National Early Warning Score (NEWS), and (b) develop clinical decision making (CDM) in one cohort of junior level nursing students participating in a simulation lab. NEWS is an evidence-based predictive scoring tool developed by the…
Descriptors: Nursing Students, Scoring, Evidence Based Practice, Prediction
Peer reviewed Peer reviewed
Direct linkDirect link
Wollack, James A.; Cohen, Allan S.; Eckerly, Carol A. – Educational and Psychological Measurement, 2015
Test tampering, especially on tests for educational accountability, is an unfortunate reality, necessitating that the state (or its testing vendor) perform data forensic analyses, such as erasure analyses, to look for signs of possible malfeasance. Few statistical approaches exist for detecting fraudulent erasures, and those that do largely do not…
Descriptors: Tests, Cheating, Item Response Theory, Accountability
Peer reviewed Peer reviewed
Direct linkDirect link
Leth-Steensen, Craig; Gallitto, Elena – Educational and Psychological Measurement, 2016
A large number of approaches have been proposed for estimating and testing the significance of indirect effects in mediation models. In this study, four sets of Monte Carlo simulations involving full latent variable structural equation models were run in order to contrast the effectiveness of the currently popular bias-corrected bootstrapping…
Descriptors: Mediation Theory, Structural Equation Models, Monte Carlo Methods, Simulation
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Cheng, Xusen; Wang, Xueyin; Huang, Jianqing; Zarifis, Alex – International Review of Research in Open and Distributed Learning, 2016
On the one hand, a growing amount of research discusses support for improving online collaborative learning quality, and many indicators are focused to assess its success. On the other hand, thinkLets for designing reputable and valuable collaborative processes have been developed for more than ten years. However, few studies try to apply…
Descriptors: Satisfaction, Electronic Learning, Cooperative Learning, Program Effectiveness
Peer reviewed Peer reviewed
Direct linkDirect link
Pinder, Jonathan P. – Decision Sciences Journal of Innovative Education, 2014
Business analytics courses, such as marketing research, data mining, forecasting, and advanced financial modeling, have substantial predictive modeling components. The predictive modeling in these courses requires students to estimate and test many linear regressions. As a result, false positive variable selection ("type I errors") is…
Descriptors: Data Collection, Data Analysis, Regression (Statistics), Predictive Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Cook, David A.; Hatala, Rose – Advances in Health Sciences Education, 2015
Many education research studies employ small samples, which in turn lowers statistical power. We re-analyzed the results of a meta-analysis of simulation-based education to determine study power across a range of effect sizes, and the smallest effect that could be plausibly excluded. We systematically searched multiple databases through May 2011,…
Descriptors: Educational Research, Comparative Analysis, Sample Size, Meta Analysis
Ladd, Melissa – ProQuest LLC, 2016
This study strived to determine the effectiveness of the AR phonics program relative to the effectiveness of the scripted phonics program for developing the letter identification, sound verbalization, and blending abilities of kindergarten students considered at-risk based on state assessments. The researcher was interested in pretest and posttest…
Descriptors: Simulated Environment, Simulation, Phonics, Scripts
Peer reviewed Peer reviewed
Direct linkDirect link
Goldhaber, Dan; Chaplin, Duncan Dunbar – Journal of Research on Educational Effectiveness, 2015
In an influential paper, Jesse Rothstein (2010) shows that standard value-added models (VAMs) suggest implausible and large future teacher effects on past student achievement. This is the basis of a falsification test that "appears" to indicate bias in typical VAM estimates of teacher contributions to student learning on standardized…
Descriptors: Teacher Evaluation, Teacher Effectiveness, Teacher Influence, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015
Person-fit assessment may help the researcher to obtain additional information regarding the answering behavior of persons. Although several researchers examined person fit, there is a lack of research on person-fit assessment for mixed-format tests. In this article, the lz statistic and the ?2 statistic, both of which have been used for tests…
Descriptors: Test Format, Goodness of Fit, Item Response Theory, Bayesian Statistics
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4