ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	3
Since 2017 (last 10 years)	5
Since 2007 (last 20 years)	11

Descriptor

Bayesian Statistics	12
Effect Size	12
Evaluation Methods	12
Hypothesis Testing	7
Probability	4
Data Interpretation	3
Goodness of Fit	3
Replication (Evaluation)	3
Statistical Distributions	3
Statistical Inference	3
Computation	2
Data Analysis	2
Educational Research	2
Error of Measurement	2
Evaluation Problems	2
Evidence	2
Experimental Psychology	2
Experiments	2
Measurement Techniques	2
Misconceptions	2
Models	2
Predictive Measurement	2
Program Evaluation	2
Robustness (Statistics)	2
Sample Size	2
More ▼

Source

Psychological Methods	3
Educational and Psychological…	2
Educational Researcher	1
Evaluation and the Health…	1
Grantee Submission	1
Journal of Experimental…	1
Journal of Special Education	1
National Center for Education…	1
Research Synthesis Methods	1

Publication Type

Journal Articles	10
Reports - Research	5
Reports - Descriptive	3
Reports - Evaluative	2
Guides - Non-Classroom	1
Opinion Papers	1

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Evaluating Local Model Misspecification with Modification Indices in Bayesian Structural Equation Modeling

Peer reviewed

Direct link

Mauricio Garnier-Villarreal; Terrence D. Jorgensen – Grantee Submission, 2024

Model evaluation is a crucial step in SEM, consisting of two broad areas: global and local fit, where local fit indices are use to modify the original model. In the modification process, the modification index (MI) and the standardized expected parameter change (SEPC) are used to select the parameters that can be added to improve the fit. The…

Descriptors: Bayesian Statistics, Structural Equation Models, Goodness of Fit, Indexes

A Tutorial on Aggregating Evidence from Conceptual Replication Studies Using the Product Bayes Factor

Peer reviewed

Direct link

Caspar J. Van Lissa; Eli-Boaz Clapper; Rebecca Kuiper – Research Synthesis Methods, 2024

The product Bayes factor (PBF) synthesizes evidence for an informative hypothesis across heterogeneous replication studies. It can be used when fixed- or random effects meta-analysis fall short. For example, when effect sizes are incomparable and cannot be pooled, or when studies diverge significantly in the populations, study designs, and…

Descriptors: Hypothesis Testing, Evaluation Methods, Replication (Evaluation), Sample Size

The BASIE (BAyeSian Interpretation of Estimates) Framework for Interpreting Findings from Impact Evaluations: A Practical Guide for Education Researchers. Toolkit. NCEE 2022-005

Peer reviewed
PDF on ERIC

Download full text

Deke, John; Finucane, Mariel; Thal, Daniel – National Center for Education Evaluation and Regional Assistance, 2022

BASIE is a framework for interpreting impact estimates from evaluations. It is an alternative to null hypothesis significance testing. This guide walks researchers through the key steps of applying BASIE, including selecting prior evidence, reporting impact estimates, interpreting impact estimates, and conducting sensitivity analyses. The guide…

Descriptors: Bayesian Statistics, Educational Research, Data Interpretation, Hypothesis Testing

Whose Prior Is It Anyway? A Note on "Rigorous Large-Scale Educational RCTs Are Often Uninformative"

Peer reviewed

Direct link

Simpson, Adrian – Educational Researcher, 2019

A recent paper uses Bayes factors to argue a large minority of rigorous, large-scale education RCTs are "uninformative." The definition of "uninformative" depends on the authors' hypothesis choices for calculating Bayes factors. These arguably overadjust for effect size inflation and involve a fixed prior distribution,…

Descriptors: Randomized Controlled Trials, Bayesian Statistics, Educational Research, Program Evaluation

Hypothesis Testing, "p" Values, Confidence Intervals, Measures of Effect Size, and Bayesian Methods in Light of Modern Robust Techniques

Peer reviewed

Direct link

Wilcox, Rand R.; Serang, Sarfaraz – Educational and Psychological Measurement, 2017

The article provides perspectives on p values, null hypothesis testing, and alternative techniques in light of modern robust statistical methods. Null hypothesis testing and "p" values can provide useful information provided they are interpreted in a sound manner, which includes taking into account insights and advances that have…

Descriptors: Hypothesis Testing, Bayesian Statistics, Computation, Effect Size

Using Visual Analysis to Evaluate and Refine Multilevel Models of Single-Case Studies

Peer reviewed

Direct link

Baek, Eun Kyeng; Petit-Bois, Merlande; Van den Noortgate, Wim; Beretvas, S. Natasha; Ferron, John M. – Journal of Special Education, 2016

In special education, multilevel models of single-case research have been used as a method of estimating treatment effects over time and across individuals. Although multilevel models can accurately summarize the effect, it is known that if the model is misspecified, inferences about the effects can be biased. Concern with the potential for model…

Descriptors: Models, Case Studies, Special Education, Outcomes of Treatment

Bayesian Estimation Supersedes the "t" Test

Peer reviewed

Direct link

Kruschke, John K. – Journal of Experimental Psychology: General, 2013

Bayesian estimation for 2 groups provides complete distributions of credible values for the effect size, group means and their difference, standard deviations and their difference, and the normality of the data. The method handles outliers. The decision rule can accept the null value (unlike traditional "t" tests) when certainty in the estimate is…

Descriptors: Bayesian Statistics, Computation, Evaluation Methods, Computer Software

Evaluation of Two Types of Differential Item Functioning in Factor Mixture Models with Binary Outcomes

Peer reviewed

Direct link

Lee, HwaYoung; Beretvas, S. Natasha – Educational and Psychological Measurement, 2014

Conventional differential item functioning (DIF) detection methods (e.g., the Mantel-Haenszel test) can be used to detect DIF only across observed groups, such as gender or ethnicity. However, research has found that DIF is not typically fully explained by an observed variable. True sources of DIF may include unobserved, latent variables, such as…

Descriptors: Item Analysis, Factor Structure, Bayesian Statistics, Goodness of Fit

Bayes Factor Approaches for Testing Interval Null Hypotheses

Peer reviewed

Direct link

Morey, Richard D.; Rouder, Jeffrey N. – Psychological Methods, 2011

Psychological theories are statements of constraint. The role of hypothesis testing in psychology is to test whether specific theoretical constraints hold in data. Bayesian statistics is well suited to the task of finding supporting evidence for constraint, because it allows for comparing evidence for 2 hypotheses against each another. One issue…

Descriptors: Evidence, Intervals, Testing, Hypothesis Testing

A Model-Averaging Approach to Replication : The Case of "p[subscript rep]"

Peer reviewed

Direct link

Iverson, Geoffrey J.; Wagenmakers, Eric-Jan; Lee, Michael D. – Psychological Methods, 2010

The purpose of the recently proposed "p[subscript rep]" statistic is to estimate the probability of concurrence, that is, the probability that a replicate experiment yields an effect of the same sign (Killeen, 2005a). The influential journal "Psychological Science" endorses "p[subscript rep]" and recommends its use…

Descriptors: Effect Size, Evaluation Methods, Probability, Experiments

Replication, "p[subscript rep]," and Confidence Intervals: Comment Prompted by Iverson, Wagenmakers, and Lee (2010); Lecoutre, Lecoutre, and Poitevineau (2010); and Maraun and Gabriel (2010)

Peer reviewed

Direct link

Cumming, Geoff – Psychological Methods, 2010

This comment offers three descriptions of "p[subscript rep]" that start with a frequentist account of confidence intervals, draw on R. A. Fisher's fiducial argument, and do not make Bayesian assumptions. Links are described among "p[subscript rep]," "p" values, and the probability a confidence interval will capture…

Descriptors: Replication (Evaluation), Measurement Techniques, Research Methodology, Validity

A Bayesian Aggregate Meta-Analytic Evaluation Approach.

Peer reviewed

Suen, Hoi K. – Evaluation and the Health Professions, 1984

The Bayesian inferential process is modified for use in an aggregate meta-analytic evaluation. Compared with the average effect size meta-analytic approach, the Bayesian approach was more sensitive, more consistent and more powerful. This approach is recommended when primary data are not available and when all evaluations involve comparisons of…

Descriptors: Bayesian Statistics, Data Interpretation, Effect Size, Evaluation Methods

Beretvas, S. Natasha	2
Baek, Eun Kyeng	1
Caspar J. Van Lissa	1
Cumming, Geoff	1
Deke, John	1
Eli-Boaz Clapper	1
Ferron, John M.	1
Finucane, Mariel	1
Iverson, Geoffrey J.	1
Kruschke, John K.	1
Lee, HwaYoung	1
Lee, Michael D.	1
Mauricio Garnier-Villarreal	1
Morey, Richard D.	1
Petit-Bois, Merlande	1
Rebecca Kuiper	1
Rouder, Jeffrey N.	1
Serang, Sarfaraz	1
Simpson, Adrian	1
Suen, Hoi K.	1
Terrence D. Jorgensen	1
Thal, Daniel	1
Van den Noortgate, Wim	1
Wagenmakers, Eric-Jan	1
Wilcox, Rand R.	1
More ▼