ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	7
Since 2017 (last 10 years)	11
Since 2007 (last 20 years)	20

Descriptor

Bayesian Statistics	44
Test Validity	44
Test Reliability	18
Item Response Theory	10
Comparative Analysis	8
Foreign Countries	8
Item Analysis	8
Statistical Analysis	7
Test Items	7
Adaptive Testing	6
Correlation	6
Criterion Referenced Tests	6
Higher Education	6
Mathematical Models	6
Psychometrics	6
Test Construction	6
College Entrance Examinations	5
Evaluation Methods	5
Predictive Validity	5
Response Style (Tests)	5
Scores	5
Decision Making	4
Factor Analysis	4
Guessing (Tests)	4
Item Banks	4
More ▼

Publication Type

Journal Articles	25
Reports - Research	25
Reports - Evaluative	8
Reports - Descriptive	3
Speeches/Meeting Papers	3
Tests/Questionnaires	3
Information Analyses	2
Collected Works - Proceedings	1
Collected Works - Serials	1

Education Level

Higher Education	5
Postsecondary Education	5
Preschool Education	3
Early Childhood Education	2
Elementary Secondary Education	2
High Schools	2
Elementary Education	1
Grade 1	1
Kindergarten	1
Secondary Education	1

Audience

Researchers

Location

Australia	1
Brazil	1
Germany (Berlin)	1
Mexico	1
Netherlands	1
South Africa	1
Spain	1
Turkey	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	2
Graduate Management Admission…	1
Raven Progressive Matrices	1

What Works Clearinghouse Rating

Showing 1 to 15 of 44 results Save | Export

Optimizing Bayesian Knowledge Tracing with Neural Network Parameter Generation

Peer reviewed
PDF on ERIC

Download full text

Anirudhan Badrinath; Zachary Pardos – Journal of Educational Data Mining, 2025

Bayesian Knowledge Tracing (BKT) is a well-established model for formative assessment, with optimization typically using expectation maximization, conjugate gradient descent, or brute force search. However, one of the flaws of existing optimization techniques for BKT models is convergence to undesirable local minima that negatively impact…

Descriptors: Bayesian Statistics, Intelligent Tutoring Systems, Problem Solving, Audience Response Systems

Sample Size Calculation for Clinical Trials Analyzed with the Meta-Analytic-Predictive Approach

Peer reviewed

Direct link

Qi, Hongchao; Rizopoulos, Dimitris; Rosmalen, Joost – Research Synthesis Methods, 2023

The meta-analytic-predictive (MAP) approach is a Bayesian method to incorporate historical controls in new trials that aims to increase the statistical power and reduce the required sample size. Here we investigate how to calculate the sample size of the new trial when historical data is available, and the MAP approach is used in the analysis. In…

Descriptors: Sample Size, Computation, Meta Analysis, Bayesian Statistics

Item Pool Quality Control in Educational Testing: Change Point Model, Compound Risk, and Sequential Detection

Peer reviewed

Direct link

Chen, Yunxiao; Lee, Yi-Hsuan; Li, Xiaoou – Journal of Educational and Behavioral Statistics, 2022

In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this article, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric…

Descriptors: Standardized Tests, Test Items, Test Validity, Scores

Identifying Dynamic Shifts to Careless and Insufficient Effort Behavior in Questionnaire Responses; a Novel Approach and Experimental Validation

Peer reviewed

Direct link

Zachary J. Roman; Patrick Schmidt; Jason M. Miller; Holger Brandt – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Careless and insufficient effort responding (C/IER) is a situation where participants respond to survey instruments without considering the item content. This phenomena adds noise to data leading to erroneous inference. There are multiple approaches to identifying and accounting for C/IER in survey settings, of these approaches the best performing…

Descriptors: Structural Equation Models, Bayesian Statistics, Response Style (Tests), Robustness (Statistics)

Examining the Factor Structure and Its Replicability across Multiple Listening Test Forms: Validity Evidence for the Michigan English Test

Peer reviewed

Direct link

Liu, Tingting; Aryadoust, Vahid; Foo, Stacy – Language Testing, 2022

This study evaluated the validity of the Michigan English Test (MET) Listening Section by investigating its underlying factor structure and the replicability of its factor structure across multiple test forms. Data from 3255 test takers across four forms of the MET Listening Section were used. To investigate the factor structure, each form was…

Descriptors: Factor Structure, Language Tests, Second Language Learning, Second Language Instruction

Are Speeded Tests Unfair? Modeling the Impact of Time Limits on the Gender Gap in Mathematics

Peer reviewed

Direct link

Stoevenbelt, Andrea H.; Wicherts, Jelte M.; Flore, Paulette C.; Phillips, Lorraine A. T.; Pietschnig, Jakob; Verschuere, Bruno; Voracek, Martin; Schwabe, Inga – Educational and Psychological Measurement, 2023

When cognitive and educational tests are administered under time limits, tests may become speeded and this may affect the reliability and validity of the resulting test scores. Prior research has shown that time limits may create or enlarge gender gaps in cognitive and academic testing. On average, women complete fewer items than men when a test…

Descriptors: Timed Tests, Gender Differences, Item Response Theory, Correlation

Bayesian Assessment of Undergraduate Students about the Real Function Mathematical Concept

Peer reviewed
PDF on ERIC

Download full text

Rodríguez-Vásquez, Flor Monserrat; Ariza-Hernandez, Francisco J. – EURASIA Journal of Mathematics, Science and Technology Education, 2021

The evaluation of learning in mathematics is a worldwide problem, therefore, new methods are required to assess the understanding of mathematical concepts. In this paper, we propose to use the Item Response Theory to analyze the understanding level of undergraduate students about the real function mathematical concept. The Bayesian approach was…

Descriptors: Bayesian Statistics, Mathematics Education, Item Response Theory, Undergraduate Students

Is It Worthy to Take Account of the "Guessing" in the Performance of the Raven Test? Calling for the Principle of Parsimony for Test Validation

Peer reviewed

Direct link

Lúcio, Patrícia Silva; Vandekerckhove, Joachim; Polanczyk, Guilherme V.; Cogo-Moreira, Hugo – Journal of Psychoeducational Assessment, 2021

The present study compares the fit of two- and three-parameter logistic (2PL and 3PL) models of item response theory in the performance of preschool children on the Raven's Colored Progressive Matrices. The test of Raven is widely used for evaluating nonverbal intelligence of factor g. Studies comparing models with real data are scarce on the…

Descriptors: Guessing (Tests), Item Response Theory, Test Validity, Preschool Children

Comparison of Confirmatory Factor Analysis Estimation Methods on Mixed-Format Data

Peer reviewed
PDF on ERIC

Download full text

Kilic, Abdullah Faruk; Dogan, Nuri – International Journal of Assessment Tools in Education, 2021

Weighted least squares (WLS), weighted least squares mean-and-variance-adjusted (WLSMV), unweighted least squares mean-and-variance-adjusted (ULSMV), maximum likelihood (ML), robust maximum likelihood (MLR) and Bayesian estimation methods were compared in mixed item response type data via Monte Carlo simulation. The percentage of polytomous items,…

Descriptors: Factor Analysis, Computation, Least Squares Statistics, Maximum Likelihood Statistics

Theoretical Model and Quantitative Assessment of Scientific Thinking and Reasoning

Peer reviewed

Direct link

Bao, Lei; Koenig, Kathleen; Xiao, Yang; Fritchman, Joseph; Zhou, Shaona; Chen, Cheng – Physical Review Physics Education Research, 2022

Abilities in scientific thinking and reasoning have been emphasized as core areas of initiatives, such as the Next Generation Science Standards or the College Board Standards for College Success in Science, which focus on the skills the future will demand of today's students. Although there is rich literature on studies of how these abilities…

Descriptors: Physics, Science Instruction, Teaching Methods, Thinking Skills

How Large Is the "Public Domain"? A Comparative Analysis of Ringer's 1961 Copyright Renewal Study and HathiTrust CRMS Data

Peer reviewed

Direct link

Wilkin, John P. – College & Research Libraries, 2017

The 1961 Copyright Office study on renewals, authored by Barbara Ringer, has cast an outsized influence on discussions of the U.S. 1923-1963 public domain. As more concrete data emerge from initiatives such as the large-scale determination process in the Copyright Review Management System (CRMS) project, questions are raised about the reliability…

Descriptors: Comparative Analysis, Copyrights, Misconceptions, Test Reliability

Variance Difference between Maximum Likelihood Estimation Method and Expected A Posteriori Estimation Method Viewed from Number of Test Items

Peer reviewed
PDF on ERIC

Download full text

Mahmud, Jumailiyah; Sutikno, Muzayanah; Naga, Dali S. – Educational Research and Reviews, 2016

The aim of this study is to determine variance difference between maximum likelihood and expected A posteriori estimation methods viewed from number of test items of aptitude test. The variance presents an accuracy generated by both maximum likelihood and Bayes estimation methods. The test consists of three subtests, each with 40 multiple-choice…

Descriptors: Maximum Likelihood Statistics, Computation, Item Response Theory, Test Items

Developing a Computer-Based Assessment of Complex Problem Solving in Chemistry

Peer reviewed

Direct link

Scherer, Ronny; Meßinger-Koppelt, Jenny; Tiemann, Rüdiger – International Journal of STEM Education, 2014

Background: Complex problem-solving competence is regarded as a key construct in science education. But due to the necessity of using interactive and intransparent assessment procedures, appropriate measures of the construct are rare. This paper consequently presents the development and validation of a computer-based problem-solving environment,…

Descriptors: Computer Assisted Testing, Problem Solving, Chemistry, Science Tests

Student Wellbeing at a University in Post-Apartheid South Africa: A Comparison with a British University Sample Using the GP-CORE Measure

Peer reviewed

Direct link

Young, Charles; Campbell, Megan – British Journal of Guidance & Counselling, 2014

This article provides GP-CORE norms for a South African university sample, which are compared to published data obtained from a United Kingdom university sample. The measure appears to be both reliable and valid for this multilingual and multicultural South African sample. The profiles of the psychological distress reported by white South African…

Descriptors: Foreign Countries, Well Being, Comparative Analysis, Psychological Needs

A Review of ETS Differential Item Functioning Assessment Procedures: Flagging Rules, Minimum Sample Size Requirements, and Criterion Refinement. Research Report. ETS RR-12-08

Peer reviewed
PDF on ERIC

Download full text

Zwick, Rebecca – ETS Research Report Series, 2012

Differential item functioning (DIF) analysis is a key component in the evaluation of the fairness and validity of educational tests. The goal of this project was to review the status of ETS DIF analysis procedures, focusing on three aspects: (a) the nature and stringency of the statistical rules used to flag items, (b) the minimum sample size…

Descriptors: Test Bias, Sample Size, Bayesian Statistics, Evaluation Methods

Previous Page | Next Page »

Pages: 1 | 2 | 3

Educational and Psychological…	3
Applied Psychological…	2
Journal of Educational…	2
Journal of Psychoeducational…	2
British Journal of Guidance &…	1
Bulletin of Science,…	1
College & Research Libraries	1
ETS Research Report Series	1
EURASIA Journal of…	1
Early Child Development and…	1
Educational Research and…	1
Health Education (Washington…	1
International Journal of…	1
International Journal of STEM…	1
Journal of Educational Data…	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of Experimental…	1
Journal of School Psychology	1
Language Testing	1
Personnel Psychology	1
Physical Review Physics…	1
Psychometrika	1
Research Synthesis Methods	1
Structural Equation Modeling:…	1
More ▼

Jensema, Carl J.	2
Kingston, Neal M.	2
Zwick, Rebecca	2
Anirudhan Badrinath	1
Ariza-Hernandez, Francisco J.	1
Aryadoust, Vahid	1
Aye, Lu	1
Bao, Lei	1
Boldt, Robert F.	1
Braun, Henry I.	1
Campbell, Janell	1
Campbell, Megan	1
Campbell, Richard	1
Carvajal, Jorge	1
Chen, Cheng	1
Chen, Yunxiao	1
Cogo-Moreira, Hugo	1
Dogan, Nuri	1
Eddy, Colleen	1
Faggen, Jane	1
Flore, Paulette C.	1
Foo, Stacy	1
Fritchman, Joseph	1
Haladyna, Tom	1
More ▼