Publication Date
| In 2026 | 0 |
| Since 2025 | 10 |
| Since 2022 (last 5 years) | 58 |
| Since 2017 (last 10 years) | 90 |
| Since 2007 (last 20 years) | 157 |
Descriptor
| Bayesian Statistics | 194 |
| Evaluation Methods | 194 |
| Models | 70 |
| Simulation | 40 |
| Comparative Analysis | 36 |
| Computation | 33 |
| Item Response Theory | 30 |
| Hypothesis Testing | 29 |
| Statistical Analysis | 28 |
| Probability | 26 |
| Data Analysis | 24 |
| More ▼ | |
Source
Author
| Chun Wang | 3 |
| Lee, Michael D. | 3 |
| Lee, Sik-Yum | 3 |
| Bejar, Isaac I. | 2 |
| Beretvas, S. Natasha | 2 |
| David Kaplan | 2 |
| Houston, Walter M. | 2 |
| James Ohisei Uanhoro | 2 |
| Jihong Zhang | 2 |
| Jing Lu | 2 |
| Jiwei Zhang | 2 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 7 |
| Administrators | 1 |
| Students | 1 |
| Teachers | 1 |
Location
| Australia | 2 |
| Germany | 2 |
| Italy | 2 |
| Brazil | 1 |
| California | 1 |
| China | 1 |
| Florida | 1 |
| Florida (Miami) | 1 |
| Iceland | 1 |
| Louisiana | 1 |
| Missouri | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
| Program for International… | 6 |
| Early Childhood Longitudinal… | 3 |
| National Longitudinal Study… | 1 |
| Trends in International… | 1 |
| Wechsler Adult Intelligence… | 1 |
What Works Clearinghouse Rating
Levy, Roy – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2014
Digital games offer an appealing environment for assessing student proficiencies, including skills and misconceptions in a diagnostic setting. This paper proposes a dynamic Bayesian network modeling approach for observations of student performance from an educational video game. A Bayesian approach to model construction, calibration, and use in…
Descriptors: Video Games, Educational Games, Bayesian Statistics, Observation
Zwick, Rebecca – ETS Research Report Series, 2012
Differential item functioning (DIF) analysis is a key component in the evaluation of the fairness and validity of educational tests. The goal of this project was to review the status of ETS DIF analysis procedures, focusing on three aspects: (a) the nature and stringency of the statistical rules used to flag items, (b) the minimum sample size…
Descriptors: Test Bias, Sample Size, Bayesian Statistics, Evaluation Methods
Lu, Hongjing; Chen, Dawn; Holyoak, Keith J. – Psychological Review, 2012
How can humans acquire relational representations that enable analogical inference and other forms of high-level reasoning? Using comparative relations as a model domain, we explore the possibility that bottom-up learning mechanisms applied to objects coded as feature vectors can yield representations of relations sufficient to solve analogy…
Descriptors: Inferences, Thinking Skills, Comparative Analysis, Models
Morey, Richard D.; Rouder, Jeffrey N. – Psychological Methods, 2011
Psychological theories are statements of constraint. The role of hypothesis testing in psychology is to test whether specific theoretical constraints hold in data. Bayesian statistics is well suited to the task of finding supporting evidence for constraint, because it allows for comparing evidence for 2 hypotheses against each another. One issue…
Descriptors: Evidence, Intervals, Testing, Hypothesis Testing
Perfors, Amy; Tenenbaum, Joshua B.; Griffiths, Thomas L.; Xu, Fei – Cognition, 2011
We present an introduction to Bayesian inference as it is used in probabilistic models of cognitive development. Our goal is to provide an intuitive and accessible guide to the "what", the "how", and the "why" of the Bayesian approach: what sorts of problems and data the framework is most relevant for, and how and why it may be useful for…
Descriptors: Bayesian Statistics, Cognitive Psychology, Inferences, Cognitive Development
Kuiper, Rebecca M.; Hoijtink, Herbert – Psychological Methods, 2010
This article discusses comparisons of means using exploratory and confirmatory approaches. Three methods are discussed: hypothesis testing, model selection based on information criteria, and Bayesian model selection. Throughout the article, an example is used to illustrate and evaluate the two approaches and the three methods. We demonstrate that…
Descriptors: Models, Testing, Hypothesis Testing, Probability
Klugkist, Irene; van Wesel, Floryt; Bullens, Jessie – International Journal of Behavioral Development, 2011
Null hypothesis testing (NHT) is the most commonly used tool in empirical psychological research even though it has several known limitations. It is argued that since the hypotheses evaluated with NHT do not reflect the research-question or theory of the researchers, conclusions from NHT must be formulated with great modesty, that is, they cannot…
Descriptors: Psychological Studies, Hypothesis Testing, Researchers, Evaluation Methods
Zhang, Zhidong; Lu, Jingyan – International Education Studies, 2014
The changes of learning environments and the advancement of learning theories have increasingly demanded for feedback that can describe learning progress trajectories. Effective assessment should be able to evaluate how learners acquire knowledge and develop problem solving skills. Additionally, it should identify what issues these learners have…
Descriptors: Medical Students, Student Evaluation, Feedback (Response), Task Analysis
Jenkins, Melissa M.; Youngstrom, Eric A.; Youngstrom, Jennifer Kogos; Feeny, Norah C.; Findling, Robert L. – Psychological Assessment, 2012
Bipolar disorder is frequently clinically diagnosed in youths who do not actually satisfy Diagnostic and Statistical Manual of Mental Disorders (4th ed., text revision; DSM-IV-TR; American Psychiatric Association, 1994) criteria, yet cases that would satisfy full DSM-IV-TR criteria are often undetected clinically. Evidence-based assessment methods…
Descriptors: Evidence, Mental Health, Mental Disorders, Clinical Diagnosis
Bolfarine, Heleno; Bazan, Jorge Luis – Journal of Educational and Behavioral Statistics, 2010
A Bayesian inference approach using Markov Chain Monte Carlo (MCMC) is developed for the logistic positive exponent (LPE) model proposed by Samejima and for a new skewed Logistic Item Response Theory (IRT) model, named Reflection LPE model. Both models lead to asymmetric item characteristic curves (ICC) and can be appropriate because a symmetric…
Descriptors: Markov Processes, Item Response Theory, Bayesian Statistics, Monte Carlo Methods
Maraun, Michael; Gabriel, Stephanie – Psychological Methods, 2010
In his article, "An Alternative to Null-Hypothesis Significance Tests," Killeen (2005) urged the discipline to abandon the practice of "p[subscript obs]"-based null hypothesis testing and to quantify the signal-to-noise characteristics of experimental outcomes with replication probabilities. He described the coefficient that he…
Descriptors: Hypothesis Testing, Statistical Inference, Probability, Statistical Significance
Ruscio, John – Assessment, 2009
Determining whether individuals belong to different latent classes (taxa) or vary along one or more latent factors (dimensions) has implications for assessment. For example, no instrument can simultaneously maximize the efficiency of categorical and continuous measurement. Methods such as taxometric analysis can test the relative fit of taxonic…
Descriptors: Classification, Measurement, Measurement Techniques, Evaluation Research
Zajonc, Tristan – ProQuest LLC, 2012
Effective policymaking requires understanding the causal effects of competing proposals. Relevant causal quantities include proposals' expected effect on different groups of recipients, the impact of policies over time, the potential trade-offs between competing objectives, and, ultimately, the optimal policy. This dissertation studies causal…
Descriptors: Public Policy, Policy Formation, Bayesian Statistics, Economic Development
Kaplan, David; Turner, Alyn – OECD Publishing (NJ1), 2012
The OECD Program for International Student Assessment (PISA) and the OECD Teaching and Learning International Survey (TALIS) constitute two of the largest ongoing international student and teacher surveys presently underway. Data generated from these surveys offer researchers and policy-makers opportunities to identify particular educational…
Descriptors: Outcomes of Education, Teacher Surveys, Policy Analysis, Educational Change
Killeen, Peter R. – Psychological Methods, 2010
Lecoutre, Lecoutre, and Poitevineau (2010) have provided sophisticated grounding for "p[subscript rep]." Computing it precisely appears, fortunately, no more difficult than doing so approximately. Their analysis will help move predictive inference into the mainstream. Iverson, Wagenmakers, and Lee (2010) have also validated…
Descriptors: Replication (Evaluation), Measurement Techniques, Research Design, Research Methodology

Peer reviewed
Direct link
