ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	3
Since 2017 (last 10 years)	5
Since 2007 (last 20 years)	28

Descriptor

Simulation	40
Statistical Significance	40
Correlation	12
Effect Size	11
Computation	10
Sample Size	10
Statistical Analysis	9
Sampling	8
Item Response Theory	7
Models	7
Comparative Analysis	5
Error Patterns	5
Foreign Countries	5
Probability	5
Statistical Distributions	5
Statistics	5
Error Correction	4
Error of Measurement	4
Evaluation Methods	4
Hypothesis Testing	4
Intervals	4
Research Methodology	4
Test Bias	4
Classification	3
Equations (Mathematics)	3
More ▼

Publication Type

Journal Articles	40
Reports - Research	23
Reports - Descriptive	9
Reports - Evaluative	7
Information Analyses	1

Education Level

Higher Education	8
Postsecondary Education	5
High Schools	2
Middle Schools	2
Elementary Education	1
Grade 5	1
Grade 9	1
Intermediate Grades	1
Kindergarten	1
Secondary Education	1

Audience

Teachers

Location

Canada	2
China (Beijing)	1
New Zealand	1
South Carolina	1
United Kingdom	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Woodcock Reading Mastery Test

What Works Clearinghouse Rating

Showing 1 to 15 of 40 results Save | Export

Non-Iterative Conditional Pairwise Estimation for the Rating Scale Model

Peer reviewed

Direct link

Elliott, Mark; Buttery, Paula – Educational and Psychological Measurement, 2022

We investigate two non-iterative estimation procedures for Rasch models, the pair-wise estimation procedure (PAIR) and the Eigenvector method (EVM), and identify theoretical issues with EVM for rating scale model (RSM) threshold estimation. We develop a new procedure to resolve these issues--the conditional pairwise adjacent thresholds procedure…

Descriptors: Item Response Theory, Rating Scales, Computation, Simulation

A New Stopping Criterion for Rasch Trees Based on the Mantel-Haenszel Effect Size Measure for Differential Item Functioning

Peer reviewed

Direct link

Henninger, Mirka; Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2023

To detect differential item functioning (DIF), Rasch trees search for optimal split-points in covariates and identify subgroups of respondents in a data-driven way. To determine whether and in which covariate a split should be performed, Rasch trees use statistical significance tests. Consequently, Rasch trees are more likely to label small DIF…

Descriptors: Item Response Theory, Test Items, Effect Size, Statistical Significance

Development of High School Students' Conceptions of Sampling Distribution in the Context of Learning Significance Tests with Technology

Peer reviewed

Direct link

Ernesto Sánchez; Victor Nozair García-Ríos; Francisco Sepúlveda – Educational Studies in Mathematics, 2024

Sampling distributions are fundamental for statistical inference, yet their abstract nature poses challenges for students. This research investigates the development of high school students' conceptions of sampling distribution through informal significance tests with the aid of digital technology. The study focuses on how technological tools…

Descriptors: High School Students, Concept Formation, Thinking Skills, Skill Development

Do We Really Need Confidence Intervals in the New Statistics?

Peer reviewed

Direct link

Gorard, Stephen – International Journal of Social Research Methodology, 2019

This paper compares the use of confidence intervals (CIs) and a sensitivity analysis called the number needed to disturb (NNTD), in the analysis of research findings expressed as 'effect' sizes. Using 1,000 simulations of randomised trials with up to 1,000 cases in each, the paper shows that both approaches are very similar in outcomes, and each…

Descriptors: Intervals, Statistics, Social Sciences, Foreign Countries

Using a Multidimensional IRT Framework to Better Understand Differential Item Functioning (DIF): A Tale of Three DIF Detection Procedures

Peer reviewed

Direct link

Walker, Cindy M.; Gocer Sahin, Sakine – Educational and Psychological Measurement, 2017

The theoretical reason for the presence of differential item functioning (DIF) is that data are multidimensional and two groups of examinees differ in their underlying ability distribution for the secondary dimension(s). Therefore, the purpose of this study was to determine how much the secondary ability distributions must differ before DIF is…

Descriptors: Item Response Theory, Test Bias, Correlation, Statistical Significance

Effect Size Measures for Differential Item Functioning in a Multidimensional IRT Model

Peer reviewed

Direct link

Suh, Youngsuk – Journal of Educational Measurement, 2016

This study adapted an effect size measure used for studying differential item functioning (DIF) in unidimensional tests and extended the measure to multidimensional tests. Two effect size measures were considered in a multidimensional item response theory model: signed weighted P-difference and unsigned weighted P-difference. The performance of…

Descriptors: Effect Size, Goodness of Fit, Statistical Analysis, Statistical Significance

Detecting Test Tampering Using Item Response Theory

Peer reviewed

Direct link

Wollack, James A.; Cohen, Allan S.; Eckerly, Carol A. – Educational and Psychological Measurement, 2015

Test tampering, especially on tests for educational accountability, is an unfortunate reality, necessitating that the state (or its testing vendor) perform data forensic analyses, such as erasure analyses, to look for signs of possible malfeasance. Few statistical approaches exist for detecting fraudulent erasures, and those that do largely do not…

Descriptors: Tests, Cheating, Item Response Theory, Accountability

Testing Mediation in Structural Equation Modeling: The Effectiveness of the Test of Joint Significance

Peer reviewed

Direct link

Leth-Steensen, Craig; Gallitto, Elena – Educational and Psychological Measurement, 2016

A large number of approaches have been proposed for estimating and testing the significance of indirect effects in mediation models. In this study, four sets of Monte Carlo simulations involving full latent variable structural equation models were run in order to contrast the effectiveness of the currently popular bias-corrected bootstrapping…

Descriptors: Mediation Theory, Structural Equation Models, Monte Carlo Methods, Simulation

An Experimental Study of Satisfaction Response: Evaluation of Online Collaborative Learning

Peer reviewed
PDF on ERIC

Download full text

Cheng, Xusen; Wang, Xueyin; Huang, Jianqing; Zarifis, Alex – International Review of Research in Open and Distributed Learning, 2016

On the one hand, a growing amount of research discusses support for improving online collaborative learning quality, and many indicators are focused to assess its success. On the other hand, thinkLets for designing reputable and valuable collaborative processes have been developed for more than ten years. However, few studies try to apply…

Descriptors: Satisfaction, Electronic Learning, Cooperative Learning, Program Effectiveness

A Demonstration of Regression False Positive Selection in Data Mining

Peer reviewed

Direct link

Pinder, Jonathan P. – Decision Sciences Journal of Innovative Education, 2014

Business analytics courses, such as marketing research, data mining, forecasting, and advanced financial modeling, have substantial predictive modeling components. The predictive modeling in these courses requires students to estimate and test many linear regressions. As a result, false positive variable selection ("type I errors") is…

Descriptors: Data Collection, Data Analysis, Regression (Statistics), Predictive Measurement

Got Power? A Systematic Review of Sample Size Adequacy in Health Professions Education Research

Peer reviewed

Direct link

Cook, David A.; Hatala, Rose – Advances in Health Sciences Education, 2015

Many education research studies employ small samples, which in turn lowers statistical power. We re-analyzed the results of a meta-analysis of simulation-based education to determine study power across a range of effect sizes, and the smallest effect that could be plausibly excluded. We systematically searched multiple databases through May 2011,…

Descriptors: Educational Research, Comparative Analysis, Sample Size, Meta Analysis

Assessing the "Rothstein Falsification Test": Does It Really Show Teacher Value-Added Models Are Biased?

Peer reviewed

Direct link

Goldhaber, Dan; Chaplin, Duncan Dunbar – Journal of Research on Educational Effectiveness, 2015

In an influential paper, Jesse Rothstein (2010) shows that standard value-added models (VAMs) suggest implausible and large future teacher effects on past student achievement. This is the basis of a falsification test that "appears" to indicate bias in typical VAM estimates of teacher contributions to student learning on standardized…

Descriptors: Teacher Evaluation, Teacher Effectiveness, Teacher Influence, Models

Assessment of Person Fit for Mixed-Format Tests

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015

Person-fit assessment may help the researcher to obtain additional information regarding the answering behavior of persons. Although several researchers examined person fit, there is a lack of research on person-fit assessment for mixed-format tests. In this article, the lz statistic and the ?2 statistic, both of which have been used for tests…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Bayesian Statistics

Statistical Significance of the Contribution of Variables to the PCA Solution: An Alternative Permutation Strategy

Peer reviewed

Direct link

Linting, Marielle; van Os, Bart Jan; Meulman, Jacqueline J. – Psychometrika, 2011

In this paper, the statistical significance of the contribution of variables to the principal components in principal components analysis (PCA) is assessed nonparametrically by the use of permutation tests. We compare a new strategy to a strategy used in previous research consisting of permuting the columns (variables) of a data matrix…

Descriptors: Intervals, Simulation, Statistical Significance, Factor Analysis

Researchers' Perceptions of Statistical Significance Contribute to Bias in Health and Exercise Science

Peer reviewed

Direct link

Buchanan, Taylor L.; Lohse, Keith R. – Measurement in Physical Education and Exercise Science, 2016

We surveyed researchers in the health and exercise sciences to explore different areas and magnitudes of bias in researchers' decision making. Participants were presented with scenarios (testing a central hypothesis with p = 0.06 or p = 0.04) in a random order and surveyed about what they would do in each scenario. Participants showed significant…

Descriptors: Researchers, Attitudes, Statistical Significance, Bias

Previous Page | Next Page »

Pages: 1 | 2 | 3

Educational and Psychological…	7
Psychometrika	4
Journal of Educational and…	2
Journal of Statistics…	2
Multivariate Behavioral…	2
Advances in Health Sciences…	1
Asia Pacific Journal of…	1
Decision Sciences Journal of…	1
Educational Studies in…	1
Evaluation & Research in…	1
Health & Social Work	1
International Journal of…	1
International Journal of…	1
International Review of…	1
Journal of Consulting and…	1
Journal of Counseling…	1
Journal of Educational…	1
Journal of Educational…	1
Journal of Experimental…	1
Journal of Political Science…	1
Journal of Psychoeducational…	1
Journal of Research on…	1
Measurement in Physical…	1
Multiple Linear Regression…	1
New Directions for…	1
More ▼

Abraham, W. Todd	1
Aiken, Leona S.	1
Ashler, Daniel	1
Atkins, David C.	1
Beauchaine, Theodore P.	1
Bedics, Jamie D.	1
Buchanan, Taylor L.	1
Buonasera, Ash K.	1
Buttery, Paula	1
Carvajal, Jorge	1
Cham, Heining	1
Chaplin, Duncan Dunbar	1
Cheng, Xusen	1
Cleroux, Robert	1
Cohen, Allan S.	1
Cook, David A.	1
Dawson, Robert	1
Debelak, Rudolf	1
Dunleavy, Eric M.	1
Eckerly, Carol A.	1
Elliott, Mark	1
Ernesto Sánchez	1
Eudey, T. Lynn	1
Fidalgo, Angel M.	1
More ▼