Publication Date
| In 2026 | 0 |
| Since 2025 | 10 |
| Since 2022 (last 5 years) | 58 |
| Since 2017 (last 10 years) | 90 |
| Since 2007 (last 20 years) | 157 |
Descriptor
| Bayesian Statistics | 194 |
| Evaluation Methods | 194 |
| Models | 70 |
| Simulation | 40 |
| Comparative Analysis | 36 |
| Computation | 33 |
| Item Response Theory | 30 |
| Hypothesis Testing | 29 |
| Statistical Analysis | 28 |
| Probability | 26 |
| Data Analysis | 24 |
| More ▼ | |
Source
Author
| Chun Wang | 3 |
| Lee, Michael D. | 3 |
| Lee, Sik-Yum | 3 |
| Bejar, Isaac I. | 2 |
| Beretvas, S. Natasha | 2 |
| David Kaplan | 2 |
| Houston, Walter M. | 2 |
| James Ohisei Uanhoro | 2 |
| Jihong Zhang | 2 |
| Jing Lu | 2 |
| Jiwei Zhang | 2 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 7 |
| Administrators | 1 |
| Students | 1 |
| Teachers | 1 |
Location
| Australia | 2 |
| Germany | 2 |
| Italy | 2 |
| Brazil | 1 |
| California | 1 |
| China | 1 |
| Florida | 1 |
| Florida (Miami) | 1 |
| Iceland | 1 |
| Louisiana | 1 |
| Missouri | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
| Program for International… | 6 |
| Early Childhood Longitudinal… | 3 |
| National Longitudinal Study… | 1 |
| Trends in International… | 1 |
| Wechsler Adult Intelligence… | 1 |
What Works Clearinghouse Rating
Gardner, Josh; Brooks, Christopher – Journal of Learning Analytics, 2018
Model evaluation -- the process of making inferences about the performance of predictive models -- is a critical component of predictive modelling research in learning analytics. We survey the state of the practice with respect to model evaluation in learning analytics, which overwhelmingly uses only naïve methods for model evaluation or…
Descriptors: Prediction, Models, Evaluation, Evaluation Methods
Henman, Paul; Brown, Scott D.; Dennis, Simon – Australian Universities' Review, 2017
In 2015, the Australian Government's Excellence in Research for Australia (ERA) assessment of research quality declined to rate 1.5 per cent of submissions from universities. The public debate focused on practices of gaming or "coding errors" within university submissions as the reason for this outcome. The issue was about the…
Descriptors: Rating Scales, Foreign Countries, Universities, Achievement Rating
Hicks, Tyler; Rodríguez-Campos, Liliana; Choi, Jeong Hoon – American Journal of Evaluation, 2018
To begin statistical analysis, Bayesians quantify their confidence in modeling hypotheses with priors. A prior describes the probability of a certain modeling hypothesis apart from the data. Bayesians should be able to defend their choice of prior to a skeptical audience. Collaboration between evaluators and stakeholders could make their choices…
Descriptors: Bayesian Statistics, Evaluation Methods, Statistical Analysis, Hypothesis Testing
Kim, Seohyun; Lu, Zhenqiu; Cohen, Allan S. – Measurement: Interdisciplinary Research and Perspectives, 2018
Bayesian algorithms have been used successfully in the social and behavioral sciences to analyze dichotomous data particularly with complex structural equation models. In this study, we investigate the use of the Polya-Gamma data augmentation method with Gibbs sampling to improve estimation of structural equation models with dichotomous variables.…
Descriptors: Bayesian Statistics, Structural Equation Models, Computation, Social Science Research
Wyse, Adam E. – Educational Measurement: Issues and Practice, 2017
This article illustrates five different methods for estimating Angoff cut scores using item response theory (IRT) models. These include maximum likelihood (ML), expected a priori (EAP), modal a priori (MAP), and weighted maximum likelihood (WML) estimators, as well as the most commonly used approach based on translating ratings through the test…
Descriptors: Cutting Scores, Item Response Theory, Bayesian Statistics, Maximum Likelihood Statistics
Okada, Kensuke – Research Synthesis Methods, 2015
This paper proposes a new method to evaluate informative hypotheses for meta-analysis of Cronbach's coefficient alpha using a Bayesian approach. The coefficient alpha is one of the most widely used reliability indices. In meta-analyses of reliability, researchers typically form specific informative hypotheses beforehand, such as "alpha of…
Descriptors: Correlation, Bayesian Statistics, Meta Analysis, Hypothesis Testing
De Marsico, Maria; Sciarrone, Filippo; Sterbini, Andrea; Temperini, Marco – EURASIA Journal of Mathematics, Science & Technology Education, 2017
We show an approach to semi-automatic grading of answers given by students to open ended questions (open answers). We use both peer-evaluation and teacher evaluation. A learner is modeled by her Knowledge and her assessments quality (Judgment). The data generated by the peer- and teacher-evaluations, and by the learner models is represented by a…
Descriptors: Evaluation Methods, Peer Evaluation, Models, Grading
Christ, Theodore J.; Desjardins, Christopher David – Journal of Psychoeducational Assessment, 2018
Curriculum-Based Measurement of Oral Reading (CBM-R) is often used to monitor student progress and guide educational decisions. Ordinary least squares regression (OLSR) is the most widely used method to estimate the slope, or rate of improvement (ROI), even though published research demonstrates OLSR's lack of validity and reliability, and…
Descriptors: Bayesian Statistics, Curriculum Based Assessment, Oral Reading, Least Squares Statistics
Finch, William Holmes; Hernandez Finch, Maria E. – AERA Online Paper Repository, 2017
High dimensional multivariate data, where the number of variables approaches or exceeds the sample size, is an increasingly common occurrence for social scientists. Several tools exist for dealing with such data in the context of univariate regression, including regularization methods such as Lasso, Elastic net, Ridge Regression, as well as the…
Descriptors: Multivariate Analysis, Regression (Statistics), Sampling, Sample Size
Ames, Allison J.; Penfield, Randall D. – Educational Measurement: Issues and Practice, 2015
Drawing valid inferences from item response theory (IRT) models is contingent upon a good fit of the data to the model. Violations of model-data fit have numerous consequences, limiting the usefulness and applicability of the model. This instructional module provides an overview of methods used for evaluating the fit of IRT models. Upon completing…
Descriptors: Item Response Theory, Goodness of Fit, Models, Evaluation Methods
Martin-Fernandez, Manuel; Revuelta, Javier – Psicologica: International Journal of Methodology and Experimental Psychology, 2017
This study compares the performance of two estimation algorithms of new usage, the Metropolis-Hastings Robins-Monro (MHRM) and the Hamiltonian MCMC (HMC), with two consolidated algorithms in the psychometric literature, the marginal likelihood via EM algorithm (MML-EM) and the Markov chain Monte Carlo (MCMC), in the estimation of multidimensional…
Descriptors: Bayesian Statistics, Item Response Theory, Models, Comparative Analysis
Falakmasir, Mohammad; Yudelson, Michael; Ritter, Steve; Koedinger, Ken – International Educational Data Mining Society, 2015
Bayesian Knowledge Tracing (BKT) has been in wide use for modeling student skill acquisition in Intelligent Tutoring Systems (ITS). BKT tracks and updates student's latent mastery of a skill as a probability distribution of a binary variable. BKT does so by accounting for observed student successes in applying the skill correctly, where success is…
Descriptors: Bayesian Statistics, Models, Skill Development, Intelligent Tutoring Systems
Martori, Francesc; Cuadros, Jordi; González-Sabaté, Lucinio – International Educational Data Mining Society, 2015
Student modeling can help guide the behavior of a cognitive tutor system and provide insight to researchers on understanding how students learn. In this context, Bayesian Knowledge Tracing (BKT) is one of the most popular knowledge inference models due to its predictive accuracy, interpretability and ability to infer student knowledge. However,…
Descriptors: Bayesian Statistics, Inferences, Prediction, Accuracy
Bloom, Howard S.; Raudenbush, Stephen W.; Weiss, Michael J.; Porter, Kristin – Journal of Research on Educational Effectiveness, 2017
The present article considers a fundamental question in evaluation research: "By how much do program effects vary across sites?" The article first presents a theoretical model of cross-site impact variation and a related estimation model with a random treatment coefficient and fixed site-specific intercepts. This approach eliminates…
Descriptors: Evaluation Research, Program Evaluation, Welfare Services, Employment
Baek, Eun Kyeng; Petit-Bois, Merlande; Van den Noortgate, Wim; Beretvas, S. Natasha; Ferron, John M. – Journal of Special Education, 2016
In special education, multilevel models of single-case research have been used as a method of estimating treatment effects over time and across individuals. Although multilevel models can accurately summarize the effect, it is known that if the model is misspecified, inferences about the effects can be biased. Concern with the potential for model…
Descriptors: Models, Case Studies, Special Education, Outcomes of Treatment

Peer reviewed
Direct link
