Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 7 |
| Since 2017 (last 10 years) | 11 |
Descriptor
| Bayesian Statistics | 11 |
| Test Validity | 11 |
| Item Response Theory | 6 |
| Test Reliability | 6 |
| Comparative Analysis | 4 |
| Foreign Countries | 4 |
| Correlation | 3 |
| Computation | 2 |
| Decision Making | 2 |
| Factor Analysis | 2 |
| Mathematics Tests | 2 |
| More ▼ | |
Source
Author
| Anirudhan Badrinath | 1 |
| Ariza-Hernandez, Francisco J. | 1 |
| Aryadoust, Vahid | 1 |
| Bao, Lei | 1 |
| Chen, Cheng | 1 |
| Chen, Yunxiao | 1 |
| Cogo-Moreira, Hugo | 1 |
| Dogan, Nuri | 1 |
| Flore, Paulette C. | 1 |
| Foo, Stacy | 1 |
| Fritchman, Joseph | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 11 |
| Reports - Research | 7 |
| Reports - Descriptive | 2 |
| Tests/Questionnaires | 2 |
| Information Analyses | 1 |
| Reports - Evaluative | 1 |
Education Level
| Higher Education | 3 |
| Postsecondary Education | 3 |
| Early Childhood Education | 1 |
| Elementary Secondary Education | 1 |
| High Schools | 1 |
| Preschool Education | 1 |
| Secondary Education | 1 |
Audience
Location
| Brazil | 1 |
| Mexico | 1 |
| Netherlands | 1 |
| Turkey | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| Raven Progressive Matrices | 1 |
What Works Clearinghouse Rating
Anirudhan Badrinath; Zachary Pardos – Journal of Educational Data Mining, 2025
Bayesian Knowledge Tracing (BKT) is a well-established model for formative assessment, with optimization typically using expectation maximization, conjugate gradient descent, or brute force search. However, one of the flaws of existing optimization techniques for BKT models is convergence to undesirable local minima that negatively impact…
Descriptors: Bayesian Statistics, Intelligent Tutoring Systems, Problem Solving, Audience Response Systems
Qi, Hongchao; Rizopoulos, Dimitris; Rosmalen, Joost – Research Synthesis Methods, 2023
The meta-analytic-predictive (MAP) approach is a Bayesian method to incorporate historical controls in new trials that aims to increase the statistical power and reduce the required sample size. Here we investigate how to calculate the sample size of the new trial when historical data is available, and the MAP approach is used in the analysis. In…
Descriptors: Sample Size, Computation, Meta Analysis, Bayesian Statistics
Chen, Yunxiao; Lee, Yi-Hsuan; Li, Xiaoou – Journal of Educational and Behavioral Statistics, 2022
In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this article, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric…
Descriptors: Standardized Tests, Test Items, Test Validity, Scores
Zachary J. Roman; Patrick Schmidt; Jason M. Miller; Holger Brandt – Structural Equation Modeling: A Multidisciplinary Journal, 2024
Careless and insufficient effort responding (C/IER) is a situation where participants respond to survey instruments without considering the item content. This phenomena adds noise to data leading to erroneous inference. There are multiple approaches to identifying and accounting for C/IER in survey settings, of these approaches the best performing…
Descriptors: Structural Equation Models, Bayesian Statistics, Response Style (Tests), Robustness (Statistics)
Liu, Tingting; Aryadoust, Vahid; Foo, Stacy – Language Testing, 2022
This study evaluated the validity of the Michigan English Test (MET) Listening Section by investigating its underlying factor structure and the replicability of its factor structure across multiple test forms. Data from 3255 test takers across four forms of the MET Listening Section were used. To investigate the factor structure, each form was…
Descriptors: Factor Structure, Language Tests, Second Language Learning, Second Language Instruction
Stoevenbelt, Andrea H.; Wicherts, Jelte M.; Flore, Paulette C.; Phillips, Lorraine A. T.; Pietschnig, Jakob; Verschuere, Bruno; Voracek, Martin; Schwabe, Inga – Educational and Psychological Measurement, 2023
When cognitive and educational tests are administered under time limits, tests may become speeded and this may affect the reliability and validity of the resulting test scores. Prior research has shown that time limits may create or enlarge gender gaps in cognitive and academic testing. On average, women complete fewer items than men when a test…
Descriptors: Timed Tests, Gender Differences, Item Response Theory, Correlation
Rodríguez-Vásquez, Flor Monserrat; Ariza-Hernandez, Francisco J. – EURASIA Journal of Mathematics, Science and Technology Education, 2021
The evaluation of learning in mathematics is a worldwide problem, therefore, new methods are required to assess the understanding of mathematical concepts. In this paper, we propose to use the Item Response Theory to analyze the understanding level of undergraduate students about the real function mathematical concept. The Bayesian approach was…
Descriptors: Bayesian Statistics, Mathematics Education, Item Response Theory, Undergraduate Students
Lúcio, Patrícia Silva; Vandekerckhove, Joachim; Polanczyk, Guilherme V.; Cogo-Moreira, Hugo – Journal of Psychoeducational Assessment, 2021
The present study compares the fit of two- and three-parameter logistic (2PL and 3PL) models of item response theory in the performance of preschool children on the Raven's Colored Progressive Matrices. The test of Raven is widely used for evaluating nonverbal intelligence of factor g. Studies comparing models with real data are scarce on the…
Descriptors: Guessing (Tests), Item Response Theory, Test Validity, Preschool Children
Kilic, Abdullah Faruk; Dogan, Nuri – International Journal of Assessment Tools in Education, 2021
Weighted least squares (WLS), weighted least squares mean-and-variance-adjusted (WLSMV), unweighted least squares mean-and-variance-adjusted (ULSMV), maximum likelihood (ML), robust maximum likelihood (MLR) and Bayesian estimation methods were compared in mixed item response type data via Monte Carlo simulation. The percentage of polytomous items,…
Descriptors: Factor Analysis, Computation, Least Squares Statistics, Maximum Likelihood Statistics
Bao, Lei; Koenig, Kathleen; Xiao, Yang; Fritchman, Joseph; Zhou, Shaona; Chen, Cheng – Physical Review Physics Education Research, 2022
Abilities in scientific thinking and reasoning have been emphasized as core areas of initiatives, such as the Next Generation Science Standards or the College Board Standards for College Success in Science, which focus on the skills the future will demand of today's students. Although there is rich literature on studies of how these abilities…
Descriptors: Physics, Science Instruction, Teaching Methods, Thinking Skills
Wilkin, John P. – College & Research Libraries, 2017
The 1961 Copyright Office study on renewals, authored by Barbara Ringer, has cast an outsized influence on discussions of the U.S. 1923-1963 public domain. As more concrete data emerge from initiatives such as the large-scale determination process in the Copyright Review Management System (CRMS) project, questions are raised about the reliability…
Descriptors: Comparative Analysis, Copyrights, Misconceptions, Test Reliability

Peer reviewed
Direct link
