Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 3 |
| Since 2017 (last 10 years) | 5 |
| Since 2007 (last 20 years) | 11 |
Descriptor
| Bayesian Statistics | 12 |
| Effect Size | 12 |
| Evaluation Methods | 12 |
| Hypothesis Testing | 7 |
| Probability | 4 |
| Data Interpretation | 3 |
| Goodness of Fit | 3 |
| Replication (Evaluation) | 3 |
| Statistical Distributions | 3 |
| Statistical Inference | 3 |
| Computation | 2 |
| More ▼ | |
Source
Author
| Beretvas, S. Natasha | 2 |
| Baek, Eun Kyeng | 1 |
| Caspar J. Van Lissa | 1 |
| Cumming, Geoff | 1 |
| Deke, John | 1 |
| Eli-Boaz Clapper | 1 |
| Ferron, John M. | 1 |
| Finucane, Mariel | 1 |
| Iverson, Geoffrey J. | 1 |
| Kruschke, John K. | 1 |
| Lee, HwaYoung | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 10 |
| Reports - Research | 5 |
| Reports - Descriptive | 3 |
| Reports - Evaluative | 2 |
| Guides - Non-Classroom | 1 |
| Opinion Papers | 1 |
Education Level
Audience
| Researchers | 1 |
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Mauricio Garnier-Villarreal; Terrence D. Jorgensen – Grantee Submission, 2024
Model evaluation is a crucial step in SEM, consisting of two broad areas: global and local fit, where local fit indices are use to modify the original model. In the modification process, the modification index (MI) and the standardized expected parameter change (SEPC) are used to select the parameters that can be added to improve the fit. The…
Descriptors: Bayesian Statistics, Structural Equation Models, Goodness of Fit, Indexes
Caspar J. Van Lissa; Eli-Boaz Clapper; Rebecca Kuiper – Research Synthesis Methods, 2024
The product Bayes factor (PBF) synthesizes evidence for an informative hypothesis across heterogeneous replication studies. It can be used when fixed- or random effects meta-analysis fall short. For example, when effect sizes are incomparable and cannot be pooled, or when studies diverge significantly in the populations, study designs, and…
Descriptors: Hypothesis Testing, Evaluation Methods, Replication (Evaluation), Sample Size
Deke, John; Finucane, Mariel; Thal, Daniel – National Center for Education Evaluation and Regional Assistance, 2022
BASIE is a framework for interpreting impact estimates from evaluations. It is an alternative to null hypothesis significance testing. This guide walks researchers through the key steps of applying BASIE, including selecting prior evidence, reporting impact estimates, interpreting impact estimates, and conducting sensitivity analyses. The guide…
Descriptors: Bayesian Statistics, Educational Research, Data Interpretation, Hypothesis Testing
Simpson, Adrian – Educational Researcher, 2019
A recent paper uses Bayes factors to argue a large minority of rigorous, large-scale education RCTs are "uninformative." The definition of "uninformative" depends on the authors' hypothesis choices for calculating Bayes factors. These arguably overadjust for effect size inflation and involve a fixed prior distribution,…
Descriptors: Randomized Controlled Trials, Bayesian Statistics, Educational Research, Program Evaluation
Wilcox, Rand R.; Serang, Sarfaraz – Educational and Psychological Measurement, 2017
The article provides perspectives on p values, null hypothesis testing, and alternative techniques in light of modern robust statistical methods. Null hypothesis testing and "p" values can provide useful information provided they are interpreted in a sound manner, which includes taking into account insights and advances that have…
Descriptors: Hypothesis Testing, Bayesian Statistics, Computation, Effect Size
Baek, Eun Kyeng; Petit-Bois, Merlande; Van den Noortgate, Wim; Beretvas, S. Natasha; Ferron, John M. – Journal of Special Education, 2016
In special education, multilevel models of single-case research have been used as a method of estimating treatment effects over time and across individuals. Although multilevel models can accurately summarize the effect, it is known that if the model is misspecified, inferences about the effects can be biased. Concern with the potential for model…
Descriptors: Models, Case Studies, Special Education, Outcomes of Treatment
Kruschke, John K. – Journal of Experimental Psychology: General, 2013
Bayesian estimation for 2 groups provides complete distributions of credible values for the effect size, group means and their difference, standard deviations and their difference, and the normality of the data. The method handles outliers. The decision rule can accept the null value (unlike traditional "t" tests) when certainty in the estimate is…
Descriptors: Bayesian Statistics, Computation, Evaluation Methods, Computer Software
Lee, HwaYoung; Beretvas, S. Natasha – Educational and Psychological Measurement, 2014
Conventional differential item functioning (DIF) detection methods (e.g., the Mantel-Haenszel test) can be used to detect DIF only across observed groups, such as gender or ethnicity. However, research has found that DIF is not typically fully explained by an observed variable. True sources of DIF may include unobserved, latent variables, such as…
Descriptors: Item Analysis, Factor Structure, Bayesian Statistics, Goodness of Fit
Morey, Richard D.; Rouder, Jeffrey N. – Psychological Methods, 2011
Psychological theories are statements of constraint. The role of hypothesis testing in psychology is to test whether specific theoretical constraints hold in data. Bayesian statistics is well suited to the task of finding supporting evidence for constraint, because it allows for comparing evidence for 2 hypotheses against each another. One issue…
Descriptors: Evidence, Intervals, Testing, Hypothesis Testing
Iverson, Geoffrey J.; Wagenmakers, Eric-Jan; Lee, Michael D. – Psychological Methods, 2010
The purpose of the recently proposed "p[subscript rep]" statistic is to estimate the probability of concurrence, that is, the probability that a replicate experiment yields an effect of the same sign (Killeen, 2005a). The influential journal "Psychological Science" endorses "p[subscript rep]" and recommends its use…
Descriptors: Effect Size, Evaluation Methods, Probability, Experiments
Cumming, Geoff – Psychological Methods, 2010
This comment offers three descriptions of "p[subscript rep]" that start with a frequentist account of confidence intervals, draw on R. A. Fisher's fiducial argument, and do not make Bayesian assumptions. Links are described among "p[subscript rep]," "p" values, and the probability a confidence interval will capture…
Descriptors: Replication (Evaluation), Measurement Techniques, Research Methodology, Validity
Peer reviewedSuen, Hoi K. – Evaluation and the Health Professions, 1984
The Bayesian inferential process is modified for use in an aggregate meta-analytic evaluation. Compared with the average effect size meta-analytic approach, the Bayesian approach was more sensitive, more consistent and more powerful. This approach is recommended when primary data are not available and when all evaluations involve comparisons of…
Descriptors: Bayesian Statistics, Data Interpretation, Effect Size, Evaluation Methods

Direct link
