ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	7
Since 2016 (last 10 years)	11
Since 2006 (last 20 years)	16

Source

Applied Measurement in…

Publication Type

Journal Articles	18
Reports - Research	15
Reports - Evaluative	2
Reports - Descriptive	1

Education Level

Higher Education	2
Middle Schools	2
Postsecondary Education	2
Elementary Education	1
Grade 10	1
Grade 11	1
Grade 4	1
Grade 9	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
Secondary Education	1
More ▼

Audience

Practitioners

Location

Canada	1
New York	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Applied Measurement in Education X

Showing 1 to 15 of 18 results Save | Export

Bayesian Maximal Reliability Evaluation Using Latent Variable Modeling

Peer reviewed

Direct link

Tenko Raykov; George A. Marcoulides; Natalja Menold – Applied Measurement in Education, 2024

We discuss an application of Bayesian factor analysis for estimation of the optimal linear combination and associated maximal reliability of a multi-component measuring instrument. The described procedure yields point and credibility interval estimates of this reliability coefficient, which are readily obtained in educational and behavioral…

Descriptors: Bayesian Statistics, Test Reliability, Error of Measurement, Measurement Equipment

Bayesian Logistic Regression: A New Method to Calibrate Pretest Items in Multistage Adaptive Testing

Peer reviewed

Direct link

TsungHan Ho – Applied Measurement in Education, 2023

An operational multistage adaptive test (MST) requires the development of a large item bank and the effort to continuously replenish the item bank due to concerns about test security and validity over the long term. New items should be pretested and linked to the item bank before being used operationally. The linking item volume fluctuations in…

Descriptors: Bayesian Statistics, Regression (Statistics), Test Items, Pretesting

Using Bayesian Networks for Cognitive Assessment of Student Understanding of Buoyancy: A Granular Hierarchy Model

Peer reviewed

Direct link

Wang, Ling Ling; Jian, Sun Xiao; Liu, Yan Lou; Xin, Tao – Applied Measurement in Education, 2023

Cognitive diagnostic assessment based on Bayesian networks (BN) is developed in this paper to evaluate student understanding of the physical concept of buoyancy. we propose a three-order granular-hierarchy BN model which accounts for both fine-grained attributes and high-level proficiencies. Conditional independence in the BN structure is tested…

Descriptors: Bayesian Statistics, Networks, Cognitive Measurement, Diagnostic Tests

Maintaining Score Scales over Time: A Comparison of Five Scoring Methods

Peer reviewed

Direct link

Kim, Stella Yun; Lee, Won-Chan – Applied Measurement in Education, 2023

This study evaluates various scoring methods including number-correct scoring, IRT theta scoring, and hybrid scoring in terms of scale-score stability over time. A simulation study was conducted to examine the relative performance of five scoring methods in terms of preserving the first two moments of scale scores for a population in a chain of…

Descriptors: Scoring, Comparative Analysis, Item Response Theory, Simulation

Bayesian Estimation and Testing of a Linear Logistic Test Model for Learning during the Test

Peer reviewed

Direct link

Lozano, José H.; Revuelta, Javier – Applied Measurement in Education, 2021

The present study proposes a Bayesian approach for estimating and testing the operation-specific learning model, a variant of the linear logistic test model that allows for the measurement of the learning that occurs during a test as a result of the repeated use of the operations involved in the items. The advantages of using a Bayesian framework…

Descriptors: Bayesian Statistics, Computation, Learning, Testing

Dissecting Knowledge, Guessing, and Blunder in Multiple Choice Assessments

Peer reviewed

Direct link

Abu-Ghazalah, Rashid M.; Dubins, David N.; Poon, Gregory M. K. – Applied Measurement in Education, 2023

Multiple choice results are inherently probabilistic outcomes, as correct responses reflect a combination of knowledge and guessing, while incorrect responses additionally reflect blunder, a confidently committed mistake. To objectively resolve knowledge from responses in an MC test structure, we evaluated probabilistic models that explicitly…

Descriptors: Guessing (Tests), Multiple Choice Tests, Probability, Models

Using Bayesian Networks to Characterize Student Performance across Multiple Assessments of Individual Standards

Peer reviewed

Direct link

Xu, Jiajun; Dadey, Nathan – Applied Measurement in Education, 2022

This paper explores how student performance across the full set of multiple modular assessments of individual standards, which we refer to as mini-assessments, from a large scale, operational program of interim assessment can be summarized using Bayesian networks. We follow a completely data-driven approach in which no constraints are imposed to…

Descriptors: Bayesian Statistics, Learning Analytics, Scores, Academic Achievement

Dynamic Bayesian Networks in Educational Measurement: Reviewing and Advancing the State of the Field

Peer reviewed

Direct link

Reichenberg, Ray – Applied Measurement in Education, 2018

As the popularity of rich assessment scenarios increases so must the availability of psychometric models capable of handling the resulting data. Dynamic Bayesian networks (DBNs) offer a fast, flexible option for characterizing student ability across time under psychometrically complex conditions. In this article, a brief introduction to DBNs is…

Descriptors: Bayesian Statistics, Measurement, Student Evaluation, Psychometrics

Using the Bayes Factors to Evaluate Person Fit in the Item Response Theory

Peer reviewed

Direct link

Pan, Tianshu; Yin, Yue – Applied Measurement in Education, 2017

In this article, we propose using the Bayes factors (BF) to evaluate person fit in item response theory models under the framework of Bayesian evaluation of an informative diagnostic hypothesis. We first discuss the theoretical foundation for this application and how to analyze person fit using BF. To demonstrate the feasibility of this approach,…

Descriptors: Bayesian Statistics, Goodness of Fit, Item Response Theory, Monte Carlo Methods

Sensitivity of School-Performance Ratings to Scaling Decisions

Peer reviewed

Direct link

Ng, Hui Leng; Koretz, Daniel – Applied Measurement in Education, 2015

Policymakers usually leave decisions about scaling the scores used for accountability to their appointed technical advisory committees and the testing contractors. However, scaling decisions can have an appreciable impact on school ratings. Using middle-school data from New York State, we examined the consistency of school ratings based on two…

Descriptors: School Effectiveness, Scaling, Middle Schools, Accountability

Using Testlet Response Theory to Examine Local Dependence in C-Tests

Peer reviewed

Direct link

Eckes, Thomas; Baghaei, Purya – Applied Measurement in Education, 2015

C-tests are gap-filling tests widely used to assess general language proficiency for purposes of placement, screening, or provision of feedback to language learners. C-tests consist of several short texts in which parts of words are missing. We addressed the issue of local dependence in C-tests using an explicit modeling approach based on testlet…

Descriptors: Language Proficiency, Language Tests, Item Response Theory, Test Reliability

A Bayesian Hierarchical Selection Model for Academic Growth with Missing Data

Peer reviewed

Direct link

Allen, Jeff – Applied Measurement in Education, 2017

Using a sample of schools testing annually in grades 9-11 with a vertically linked series of assessments, a latent growth curve model is used to model test scores with student intercepts and slopes nested within school. Missed assessments can occur because of student mobility, student dropout, absenteeism, and other reasons. Missing data…

Descriptors: Achievement Gains, Academic Achievement, Growth Models, Scores

Parameter Recovery and Classification Accuracy under Conditions of Testlet Dependency: A Comparison of the Traditional 2PL, Testlet, and Bi-Factor Models

Peer reviewed

Direct link

Koziol, Natalie A. – Applied Measurement in Education, 2016

Testlets, or groups of related items, are commonly included in educational assessments due to their many logistical and conceptual advantages. Despite their advantages, testlets introduce complications into the theory and practice of educational measurement. Responses to items within a testlet tend to be correlated even after controlling for…

Descriptors: Classification, Accuracy, Comparative Analysis, Models

Collateral Information for Equating in Small Samples: A Preliminary Investigation

Peer reviewed

Direct link

Kim, Sooyeon; Livingston, Samuel A.; Lewis, Charles – Applied Measurement in Education, 2011

This article describes a preliminary investigation of an empirical Bayes (EB) procedure for using collateral information to improve equating of scores on test forms taken by small numbers of examinees. Resampling studies were done on two different forms of the same test. In each study, EB and non-EB versions of two equating methods--chained linear…

Descriptors: Sample Size, Equated Scores, Bayesian Statistics, Accuracy

Item Selection and Ability Estimation Procedures for a Mixed-Format Adaptive Test

Peer reviewed

Direct link

Ho, Tsung-Han; Dodd, Barbara G. – Applied Measurement in Education, 2012

In this study we compared five item selection procedures using three ability estimation methods in the context of a mixed-format adaptive test based on the generalized partial credit model. The item selection procedures used were maximum posterior weighted information, maximum expected information, maximum posterior weighted Kullback-Leibler…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Selection

Previous Page | Next Page »

Pages: 1 | 2

Bayesian Statistics	18
Test Items	7
Comparative Analysis	6
Item Response Theory	6
Computation	5
Models	4
Scores	4
Adaptive Testing	3
Computer Assisted Testing	3
Evaluation Methods	3
Maximum Likelihood Statistics	3
Probability	3
Simulation	3
Academic Achievement	2
Accuracy	2
Classification	2
Correlation	2
Difficulty Level	2
Error of Measurement	2
Goodness of Fit	2
Item Analysis	2
Knowledge Level	2
Monte Carlo Methods	2
Regression (Statistics)	2
Responses	2
More ▼

Abu-Ghazalah, Rashid M.	1
Allen, Jeff	1
Baghaei, Purya	1
Chen, Lisue	1
Dadey, Nathan	1
Dodd, Barbara G.	1
Du, Yi	1
Dubins, David N.	1
Eckes, Thomas	1
Gao, Furong	1
George A. Marcoulides	1
Ho, Tsung-Han	1
Jian, Sun Xiao	1
Kim, Sooyeon	1
Kim, Stella Yun	1
Koretz, Daniel	1
Koziol, Natalie A.	1
Lee, Won-Chan	1
Lewis, Charles	1
Liu, Yan Lou	1
Livingston, Samuel A.	1
Lozano, José H.	1
Natalja Menold	1
Ng, Hui Leng	1
More ▼