NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 5 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Verhavert, San; Bouwer, Renske; Donche, Vincent; De Maeyer, Sven – Assessment in Education: Principles, Policy & Practice, 2019
Comparative Judgement (CJ) aims to improve the quality of performance-based assessments by letting multiple assessors judge pairs of performances. CJ is generally associated with high levels of reliability, but there is also a large variation in reliability between assessments. This study investigates which assessment characteristics influence the…
Descriptors: Meta Analysis, Reliability, Comparative Analysis, Value Judgment
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Xiong, Xiaolu; Zhao, Siyuan; Van Inwegen, Eric G.; Beck, Joseph E. – International Educational Data Mining Society, 2016
Over the last couple of decades, there have been a large variety of approaches towards modeling student knowledge within intelligent tutoring systems. With the booming development of deep learning and large-scale artificial neural networks, there have been empirical successes in a number of machine learning and data mining applications, including…
Descriptors: Intelligent Tutoring Systems, Computer Software, Bayesian Statistics, Knowledge Level
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Galyardt, April; Goldin, Ilya – Journal of Educational Data Mining, 2015
In educational technology and learning sciences, there are multiple uses for a predictive model of whether a student will perform a task correctly or not. For example, an intelligent tutoring system may use such a model to estimate whether or not a student has mastered a skill. We analyze the significance of data recency in making such…
Descriptors: Achievement Rating, Performance Based Assessment, Bayesian Statistics, Data Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Zhu, Xiaowen; Stone, Clement A. – Educational and Psychological Measurement, 2012
This study examined the relative effectiveness of Bayesian model comparison methods in selecting an appropriate graded response (GR) model for performance assessment applications. Three popular methods were considered: deviance information criterion (DIC), conditional predictive ordinate (CPO), and posterior predictive model checking (PPMC). Using…
Descriptors: Bayesian Statistics, Item Response Theory, Comparative Analysis, Models
Iseli, Markus R.; Koenig, Alan D.; Lee, John J.; Wainess, Richard – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2010
Assessment of complex task performance is crucial to evaluating personnel in critical job functions such as Navy damage control operations aboard ships. Games and simulations can be instrumental in this process, as they can present a broad range of complex scenarios without involving harm to people or property. However, "automatic"…
Descriptors: Performance Tests, Performance Based Assessment, Decision Making Skills, Military Training