ERIC - Search Results

Publication Date

In 2026	0
Since 2025	8

Source

Journal of Educational…

Author

Alex J. Mechaber	1
Andreas Frey	1
Aron Fink	1
Brian E. Clauser	1
Chia-Wen Chen	1
Christoph König	1
Emre Gonulates	1
Hamid Mohammadi	1
Harold Doran	1
Hua-Hua Chang	1
Jinnie Shin	1
Kai North	1
Kit-Tai Hau	1
Kylie Gorney	1
Le An Ha	1
Mark D. Reckase	1
Mark J. Gierl	1
Peter Baldwin	1
Po-Hsi Chen	1
Tahereh Firoozi	1
Ted Diaz	1
Testsuhiro Yamada	1
Tong Wu	1
Vanessa Culver	1
Victoria Yaneva	1
More ▼

Publication Type

Journal Articles	8
Reports - Research	7
Reports - Descriptive	1

Education Level

Secondary Education	2
Higher Education	1
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Influence of Intersectional Routing Modules between Dimensions on Measurement Precision in Multidimensional Multistage Testing

Peer reviewed

Direct link

Yi-Ling Wu; Yao-Hsuan Huang; Chia-Wen Chen; Po-Hsi Chen – Journal of Educational Measurement, 2025

Multistage testing (MST), a variant of computerized adaptive testing (CAT), differs from conventional CAT in that it is adapted at the module level rather than at the individual item level. Typically, all examinees begin the MST with a linear test form in the first stage, commonly known as the routing stage. In 2020, Han introduced an innovative…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Format, Measurement

A Generalized Objective Function for Computer Adaptive Item Selection

Peer reviewed

Direct link

Harold Doran; Testsuhiro Yamada; Ted Diaz; Emre Gonulates; Vanessa Culver – Journal of Educational Measurement, 2025

Computer adaptive testing (CAT) is an increasingly common mode of test administration offering improved test security, better measurement precision, and the potential for shorter testing experiences. This article presents a new item selection algorithm based on a generalized objective function to support multiple types of testing conditions and…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Algorithms

The Vulnerability of AI-Based Scoring Systems to Gaming Strategies: A Case Study

Peer reviewed

Direct link

Peter Baldwin; Victoria Yaneva; Kai North; Le An Ha; Yiyun Zhou; Alex J. Mechaber; Brian E. Clauser – Journal of Educational Measurement, 2025

Recent developments in the use of large-language models have led to substantial improvements in the accuracy of content-based automated scoring of free-text responses. The reported accuracy levels suggest that automated systems could have widespread applicability in assessment. However, before they are used in operational testing, other aspects of…

Descriptors: Artificial Intelligence, Scoring, Computational Linguistics, Accuracy

Using Multiple Maximum Exposure Rates in Computerized Adaptive Testing

Peer reviewed

Direct link

Kylie Gorney; Mark D. Reckase – Journal of Educational Measurement, 2025

In computerized adaptive testing, item exposure control methods are often used to provide a more balanced usage of the item pool. Many of the most popular methods, including the restricted method (Revuelta and Ponsoda), use a single maximum exposure rate to limit the proportion of times that each item is administered. However, Barrada et al.…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Banks

Evaluating the Consistency and Reliability of Attribution Methods in Automated Short Answer Grading (ASAG) Systems: Toward an Explainable Scoring System

Peer reviewed

Direct link

Wallace N. Pinto Jr.; Jinnie Shin – Journal of Educational Measurement, 2025

In recent years, the application of explainability techniques to automated essay scoring and automated short-answer grading (ASAG) models, particularly those based on transformer architectures, has gained significant attention. However, the reliability and consistency of these techniques remain underexplored. This study systematically investigates…

Descriptors: Automation, Grading, Computer Assisted Testing, Scoring

Using Automated Procedures to Score Educational Essays Written in Three Languages

Peer reviewed

Direct link

Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025

The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…

Descriptors: College Students, Slavic Languages, German, Italian

A Highly Adaptive Testing Design for PISA

Peer reviewed

Direct link

Andreas Frey; Christoph König; Aron Fink – Journal of Educational Measurement, 2025

The highly adaptive testing (HAT) design is introduced as an alternative test design for the Programme for International Student Assessment (PISA). The principle of HAT is to be as adaptive as possible when selecting items while accounting for PISA's nonstatistical constraints and addressing issues concerning PISA such as item position effects.…

Descriptors: Adaptive Testing, Test Construction, Alternative Assessment, Achievement Tests

Utilizing Response Time for Item Selection in On-the-Fly Multistage Adaptive Testing for PISA Assessment

Peer reviewed

Direct link

Xiuxiu Tang; Yi Zheng; Tong Wu; Kit-Tai Hau; Hua-Hua Chang – Journal of Educational Measurement, 2025

Multistage adaptive testing (MST) has been recently adopted for international large-scale assessments such as Programme for International Student Assessment (PISA). MST offers improved measurement efficiency over traditional nonadaptive tests and improved practical convenience over single-item-adaptive computerized adaptive testing (CAT). As a…

Descriptors: Reaction Time, Test Items, Achievement Tests, Foreign Countries

Computer Assisted Testing	8
Adaptive Testing	5
Test Items	4
Accuracy	3
Evaluation Methods	3
Item Response Theory	3
Test Construction	3
Achievement Tests	2
Computer Software	2
Foreign Countries	2
International Assessment	2
Item Banks	2
Measurement	2
Scoring	2
Secondary School Students	2
Test Reliability	2
Algorithms	1
Alternative Assessment	1
Artificial Intelligence	1
Attribution Theory	1
Automation	1
Classification	1
College Students	1
Comparative Testing	1
Computation	1
More ▼