ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	8
Since 2017 (last 10 years)	13
Since 2007 (last 20 years)	22

Descriptor

Bayesian Statistics	31
Test Reliability	31
Item Response Theory	12
Test Validity	10
Foreign Countries	8
Test Items	7
Adaptive Testing	6
Comparative Analysis	6
Computer Assisted Testing	6
Error of Measurement	6
Statistical Analysis	6
Test Construction	6
Measures (Individuals)	5
Undergraduate Students	5
Accuracy	4
Evaluation Methods	4
Mathematical Models	4
Maximum Likelihood Statistics	4
Psychometrics	4
Scores	4
Computation	3
Correlation	3
Criterion Referenced Tests	3
Cutting Scores	3
Elementary Secondary Education	3
More ▼

Source

Educational and Psychological…	4
Applied Measurement in…	2
Journal of Speech, Language,…	2
Regional Educational…	2
British Journal of Guidance &…	1
ETS Research Report Series	1
EURASIA Journal of…	1
Early Child Development and…	1
Education and Information…	1
Educational Research and…	1
International Journal of…	1
Journal of Counseling…	1
Journal of Educational Data…	1
Journal of Educational and…	1
Psychological Methods	1
Scandinavian Journal of…	1
Structural Equation Modeling:…	1
More ▼

Publication Type

Reports - Research	31
Journal Articles	21
Speeches/Meeting Papers	2
Tests/Questionnaires	2

Education Level

Higher Education	8
Postsecondary Education	6
Elementary Secondary Education	3
Secondary Education	3
Early Childhood Education	2
Elementary Education	2
High Schools	2
Grade 4	1
Intermediate Grades	1
Junior High Schools	1
Kindergarten	1
Middle Schools	1
Preschool Education	1
Primary Education	1
More ▼

Audience

Location

New Jersey	2
Taiwan	2
Mexico	1
Netherlands	1
South Africa	1
Spain	1
Sweden	1
Trinidad and Tobago	1
United Kingdom	1

Laws, Policies, & Programs

Every Student Succeeds Act…

Assessments and Surveys

Graduate Record Examinations	1
MacArthur Communicative…	1
National Merit Scholarship…	1
Preliminary Scholastic…	1
Preschool and Kindergarten…	1
School and College Ability…	1
Students Evaluation of…	1
Trends in International…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 31 results Save | Export

Bayesian Maximal Reliability Evaluation Using Latent Variable Modeling

Peer reviewed

Direct link

Tenko Raykov; George A. Marcoulides; Natalja Menold – Applied Measurement in Education, 2024

We discuss an application of Bayesian factor analysis for estimation of the optimal linear combination and associated maximal reliability of a multi-component measuring instrument. The described procedure yields point and credibility interval estimates of this reliability coefficient, which are readily obtained in educational and behavioral…

Descriptors: Bayesian Statistics, Test Reliability, Error of Measurement, Measurement Equipment

Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2025

To mitigate the potential damaging consequences of rapid guessing (RG), a form of noneffortful responding, researchers have proposed a number of scoring approaches. The present simulation study examines the robustness of the most popular of these approaches, the unidimensional effort-moderated (EM) scoring procedure, to multidimensional RG (i.e.,…

Descriptors: Scoring, Guessing (Tests), Reaction Time, Item Response Theory

Optimizing Bayesian Knowledge Tracing with Neural Network Parameter Generation

Peer reviewed
PDF on ERIC

Download full text

Anirudhan Badrinath; Zachary Pardos – Journal of Educational Data Mining, 2025

Bayesian Knowledge Tracing (BKT) is a well-established model for formative assessment, with optimization typically using expectation maximization, conjugate gradient descent, or brute force search. However, one of the flaws of existing optimization techniques for BKT models is convergence to undesirable local minima that negatively impact…

Descriptors: Bayesian Statistics, Intelligent Tutoring Systems, Problem Solving, Audience Response Systems

Stabilizing School Performance Indicators in New Jersey to Reduce the Effect of Random Error. Appendixes. REL 2025-009

Peer reviewed
PDF on ERIC

Download full text

Regional Educational Laboratory Mid-Atlantic, 2024

These are the appendixes for the report, "Stabilizing School Performance Indicators in New Jersey to Reduce the Effect of Random Error." This study applied a stabilization model called Bayesian hierarchical modeling to group-level data (with groups assigned according to demographic designations) within schools in New Jersey with the aim…

Descriptors: Institutional Evaluation, Elementary Secondary Education, Bayesian Statistics, Test Reliability

Identifying Dynamic Shifts to Careless and Insufficient Effort Behavior in Questionnaire Responses; a Novel Approach and Experimental Validation

Peer reviewed

Direct link

Zachary J. Roman; Patrick Schmidt; Jason M. Miller; Holger Brandt – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Careless and insufficient effort responding (C/IER) is a situation where participants respond to survey instruments without considering the item content. This phenomena adds noise to data leading to erroneous inference. There are multiple approaches to identifying and accounting for C/IER in survey settings, of these approaches the best performing…

Descriptors: Structural Equation Models, Bayesian Statistics, Response Style (Tests), Robustness (Statistics)

Stabilizing School Performance Indicators in New Jersey to Reduce the Effect of Random Error. REL 2025-009

Peer reviewed
PDF on ERIC

Download full text

Morgan Rosendahl; Brian Gill; Jennifer E. Starling – Regional Educational Laboratory Mid-Atlantic, 2024

The Every Student Succeeds Act of 2015 requires states to use a variety of indicators, including standardized tests and attendance records, to designate schools for support and improvement based on schoolwide performance and the performance of groups of students within schools. Schoolwide and group-level performance indicators are also…

Descriptors: Institutional Evaluation, Elementary Secondary Education, Bayesian Statistics, Test Reliability

Are Speeded Tests Unfair? Modeling the Impact of Time Limits on the Gender Gap in Mathematics

Peer reviewed

Direct link

Stoevenbelt, Andrea H.; Wicherts, Jelte M.; Flore, Paulette C.; Phillips, Lorraine A. T.; Pietschnig, Jakob; Verschuere, Bruno; Voracek, Martin; Schwabe, Inga – Educational and Psychological Measurement, 2023

When cognitive and educational tests are administered under time limits, tests may become speeded and this may affect the reliability and validity of the resulting test scores. Prior research has shown that time limits may create or enlarge gender gaps in cognitive and academic testing. On average, women complete fewer items than men when a test…

Descriptors: Timed Tests, Gender Differences, Item Response Theory, Correlation

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Psychometric Properties of a Swedish Translation of the Preschool and Kindergarten Behavior Scales (PKBS): A Bayesian Structural Equation Modeling Analysis

Peer reviewed

Direct link

Thomas, Sarah; Eichas, Kyle; Eninger, Lilianne; Ferrer-Wreder, Laura – Scandinavian Journal of Educational Research, 2021

This cross-sectional study established the psychometric properties and factor structure of the Preschool and Kindergarten Behavior Scales (PKBS) and an index of empathy in a sample of Swedish four to six year olds (N = 115). Using Bayesian structural equation modeling, we found that a five-factor PKBS and one-factor empathy model provided good fit…

Descriptors: Psychometrics, Swedish, Foreign Countries, Test Construction

Toward Routine Assessments of Auditory Filter Shape

Peer reviewed

Direct link

Shen, Yi; Kern, Allison B.; Richards, Virginia M. – Journal of Speech, Language, and Hearing Research, 2019

Purpose: A Bayesian adaptive procedure, that is, the quick auditory filter (qAF) procedure, has been shown to improve the efficiency for estimating auditory filter shapes of listeners with normal hearing. The current study evaluates the accuracy and test-retest reliability of the qAF procedure for naïve listeners with a variety of ages and hearing…

Descriptors: Auditory Discrimination, Bayesian Statistics, Hearing (Physiology), Hearing Impairments

Bayesian Assessment of Undergraduate Students about the Real Function Mathematical Concept

Peer reviewed
PDF on ERIC

Download full text

Rodríguez-Vásquez, Flor Monserrat; Ariza-Hernandez, Francisco J. – EURASIA Journal of Mathematics, Science and Technology Education, 2021

The evaluation of learning in mathematics is a worldwide problem, therefore, new methods are required to assess the understanding of mathematical concepts. In this paper, we propose to use the Item Response Theory to analyze the understanding level of undergraduate students about the real function mathematical concept. The Bayesian approach was…

Descriptors: Bayesian Statistics, Mathematics Education, Item Response Theory, Undergraduate Students

A Bayesian-Inspired Item Response Theory-Based Framework to Produce Very Short Versions of MacArthur-Bates Communicative Development Inventories

Peer reviewed

Direct link

Chai, Jun Ho; Lo, Chang Huan; Mayor, Julien – Journal of Speech, Language, and Hearing Research, 2020

Purpose: This study introduces a framework to produce very short versions of the MacArthur-Bates Communicative Development Inventories (CDIs) by combining the Bayesian-inspired approach introduced by Mayor and Mani (2019) with an item response theory-based computerized adaptive testing that adapts to the ability of each child, in line with…

Descriptors: Bayesian Statistics, Item Response Theory, Measures (Individuals), Language Skills

The Learning Behaviors Scale: National Standardization in Trinidad and Tobago

Peer reviewed

Direct link

Chao, Jessica L.; McDermott, Paul A.; Watkins, Marley W.; Drogalis, Anna Rhoad; Worrell, Frank C.; Hall, Tracey E. – International Journal of School & Educational Psychology, 2018

This study reports on the national standardization and validation of the Learning Behaviors Scale (LBS) for use in Trinidad and Tobago. The LBS is a teacher rating scale centering on observable behaviors relevant to identifying childhood approaches to classroom learning. Teachers observed a stratified sample of 900 students across the islands'…

Descriptors: Foreign Countries, Program Validation, Behavior Rating Scales, National Standards

Using Testlet Response Theory to Examine Local Dependence in C-Tests

Peer reviewed

Direct link

Eckes, Thomas; Baghaei, Purya – Applied Measurement in Education, 2015

C-tests are gap-filling tests widely used to assess general language proficiency for purposes of placement, screening, or provision of feedback to language learners. C-tests consist of several short texts in which parts of words are missing. We addressed the issue of local dependence in C-tests using an explicit modeling approach based on testlet…

Descriptors: Language Proficiency, Language Tests, Item Response Theory, Test Reliability

Variance Difference between Maximum Likelihood Estimation Method and Expected A Posteriori Estimation Method Viewed from Number of Test Items

Peer reviewed
PDF on ERIC

Download full text

Mahmud, Jumailiyah; Sutikno, Muzayanah; Naga, Dali S. – Educational Research and Reviews, 2016

The aim of this study is to determine variance difference between maximum likelihood and expected A posteriori estimation methods viewed from number of test items of aptitude test. The variance presents an accuracy generated by both maximum likelihood and Bayes estimation methods. The test consists of three subtests, each with 40 multiple-choice…

Descriptors: Maximum Likelihood Statistics, Computation, Item Response Theory, Test Items

Previous Page | Next Page »

Pages: 1 | 2 | 3

Huang, Hung-Yu	2
Reckase, Mark D.	2
Wang, Wen-Chung	2
Anirudhan Badrinath	1
Ariza-Hernandez, Francisco J.	1
Baghaei, Purya	1
Brennan, Robert L.	1
Brian Gill	1
Campbell, Megan	1
Chai, Jun Ho	1
Chao, Jessica L.	1
Drogalis, Anna Rhoad	1
Eckes, Thomas	1
Eichas, Kyle	1
Eninger, Lilianne	1
Faggen, Jane	1
Ferrer-Wreder, Laura	1
Flore, Paulette C.	1
Gelbal, Selahattin	1
George A. Marcoulides	1
Guo, Hongwen	1
Hall, Tracey E.	1
Holger Brandt	1
Huynh, Huynh	1
More ▼