ERIC - Search Results

Publication Date

In 2026	0
Since 2025	8
Since 2022 (last 5 years)	17
Since 2017 (last 10 years)	29
Since 2007 (last 20 years)	47

Descriptor

Computer Assisted Testing	100
Test Items	57
Adaptive Testing	53
Test Construction	36
Item Response Theory	28
Simulation	19
Item Banks	18
Comparative Analysis	17
Scores	14
Automation	12
Psychometrics	12
Scoring	12
Test Format	12
Higher Education	10
Item Analysis	10
College Students	9
Difficulty Level	9
Evaluation Methods	8
Models	8
Test Length	8
Comparative Testing	7
Mathematics Tests	7
Accuracy	6
Error Patterns	6
Measurement Techniques	6
More ▼

Source

Journal of Educational…

100

Publication Type

Journal Articles	100
Reports - Research	59
Reports - Evaluative	28
Reports - Descriptive	8
Speeches/Meeting Papers	6
Book/Product Reviews	3
Information Analyses	2
Opinion Papers	1

Education Level

Higher Education	2
Secondary Education	2
Elementary Education	1
Postsecondary Education	1

Audience

Researchers

Location

United Kingdom

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	4
Indiana Statewide Testing for…	2
Program for International…	2
Advanced Placement…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 100 results Save | Export

Two-Phase Content-Balancing CD-CAT Online Item Calibration

Peer reviewed

Direct link

Jing Huang; Yuxiao Zhang; Jason W. Morphew; Jayson M. Nissen; Ben Van Dusen; Hua Hua Chang – Journal of Educational Measurement, 2025

Online calibration estimates new item parameters alongside previously calibrated items, supporting efficient item replenishment. However, most existing online calibration procedures for Cognitive Diagnostic Computerized Adaptive Testing (CD-CAT) lack mechanisms to ensure content balance during live testing. This limitation can lead to uneven…

Descriptors: Adaptive Testing, Computer Assisted Testing, Cognitive Measurement, Test Items

A Highly Adaptive Testing Design for PISA

Peer reviewed

Direct link

Andreas Frey; Christoph König; Aron Fink – Journal of Educational Measurement, 2025

The highly adaptive testing (HAT) design is introduced as an alternative test design for the Programme for International Student Assessment (PISA). The principle of HAT is to be as adaptive as possible when selecting items while accounting for PISA's nonstatistical constraints and addressing issues concerning PISA such as item position effects.…

Descriptors: Adaptive Testing, Test Construction, Alternative Assessment, Achievement Tests

Utilizing Response Time for Item Selection in On-the-Fly Multistage Adaptive Testing for PISA Assessment

Peer reviewed

Direct link

Xiuxiu Tang; Yi Zheng; Tong Wu; Kit-Tai Hau; Hua-Hua Chang – Journal of Educational Measurement, 2025

Multistage adaptive testing (MST) has been recently adopted for international large-scale assessments such as Programme for International Student Assessment (PISA). MST offers improved measurement efficiency over traditional nonadaptive tests and improved practical convenience over single-item-adaptive computerized adaptive testing (CAT). As a…

Descriptors: Reaction Time, Test Items, Achievement Tests, Foreign Countries

Influence of Intersectional Routing Modules between Dimensions on Measurement Precision in Multidimensional Multistage Testing

Peer reviewed

Direct link

Yi-Ling Wu; Yao-Hsuan Huang; Chia-Wen Chen; Po-Hsi Chen – Journal of Educational Measurement, 2025

Multistage testing (MST), a variant of computerized adaptive testing (CAT), differs from conventional CAT in that it is adapted at the module level rather than at the individual item level. Typically, all examinees begin the MST with a linear test form in the first stage, commonly known as the routing stage. In 2020, Han introduced an innovative…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Format, Measurement

A Generalized Objective Function for Computer Adaptive Item Selection

Peer reviewed

Direct link

Harold Doran; Testsuhiro Yamada; Ted Diaz; Emre Gonulates; Vanessa Culver – Journal of Educational Measurement, 2025

Computer adaptive testing (CAT) is an increasingly common mode of test administration offering improved test security, better measurement precision, and the potential for shorter testing experiences. This article presents a new item selection algorithm based on a generalized objective function to support multiple types of testing conditions and…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Algorithms

Using Multiple Maximum Exposure Rates in Computerized Adaptive Testing

Peer reviewed

Direct link

Kylie Gorney; Mark D. Reckase – Journal of Educational Measurement, 2025

In computerized adaptive testing, item exposure control methods are often used to provide a more balanced usage of the item pool. Many of the most popular methods, including the restricted method (Revuelta and Ponsoda), use a single maximum exposure rate to limit the proportion of times that each item is administered. However, Barrada et al.…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Banks

Evaluating the Consistency and Reliability of Attribution Methods in Automated Short Answer Grading (ASAG) Systems: Toward an Explainable Scoring System

Peer reviewed

Direct link

Wallace N. Pinto Jr.; Jinnie Shin – Journal of Educational Measurement, 2025

In recent years, the application of explainability techniques to automated essay scoring and automated short-answer grading (ASAG) models, particularly those based on transformer architectures, has gained significant attention. However, the reliability and consistency of these techniques remain underexplored. This study systematically investigates…

Descriptors: Automation, Grading, Computer Assisted Testing, Scoring

Using Automated Procedures to Score Educational Essays Written in Three Languages

Peer reviewed

Direct link

Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025

The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…

Descriptors: College Students, Slavic Languages, German, Italian

Score Comparability between Online Proctored and In-Person Credentialing Exams

Peer reviewed

Direct link

Jones, Paul; Tong, Ye; Liu, Jinghua; Borglum, Joshua; Primoli, Vince – Journal of Educational Measurement, 2022

This article studied two methods to detect mode effects in two credentialing exams. In Study 1, we used a "modal scale comparison approach," where the same pool of items was calibrated separately, without transformation, within two TC cohorts (TC1 and TC2) and one OP cohort (OP1) matched on their pool-based scale score distributions. The…

Descriptors: Scores, Credentials, Licensing Examinations (Professions), Computer Assisted Testing

Toward Argument-Based Fairness with an Application to AI-Enhanced Educational Assessments

Peer reviewed

Direct link

A. Corinne Huggins-Manley; Brandon M. Booth; Sidney K. D'Mello – Journal of Educational Measurement, 2022

The field of educational measurement places validity and fairness as central concepts of assessment quality. Prior research has proposed embedding fairness arguments within argument-based validity processes, particularly when fairness is conceived as comparability in assessment properties across groups. However, we argue that a more flexible…

Descriptors: Educational Assessment, Persuasive Discourse, Validity, Artificial Intelligence

Using Response Time in Multidimensional Computerized Adaptive Testing

Peer reviewed

Direct link

He, Yinhong; Qi, Yuanyuan – Journal of Educational Measurement, 2023

In multidimensional computerized adaptive testing (MCAT), item selection strategies are generally constructed based on responses, and they do not consider the response times required by items. This study constructed two new criteria (referred to as DT-inc and DT) for MCAT item selection by utilizing information from response times. The new designs…

Descriptors: Reaction Time, Adaptive Testing, Computer Assisted Testing, Test Items

Online Monitoring of Test-Taking Behavior Based on Item Responses and Response Times

Peer reviewed

Direct link

Han, Suhwa; Kang, Hyeon-Ah – Journal of Educational Measurement, 2023

The study presents multivariate sequential monitoring procedures for examining test-taking behaviors online. The procedures monitor examinee's responses and response times and signal aberrancy as soon as significant change is identifieddetected in the test-taking behavior. The study in particular proposes three schemes to track different…

Descriptors: Test Wiseness, Student Behavior, Item Response Theory, Computer Assisted Testing

Pretest Item Calibration in Computerized Multistage Adaptive Testing

Peer reviewed

Direct link

Ersen, Rabia Karatoprak; Lee, Won-Chan – Journal of Educational Measurement, 2023

The purpose of this study was to compare calibration and linking methods for placing pretest item parameter estimates on the item pool scale in a 1-3 computerized multistage adaptive testing design in terms of item parameter recovery. Two models were used: embedded-section, in which pretest items were administered within a separate module, and…

Descriptors: Pretesting, Test Items, Computer Assisted Testing, Adaptive Testing

Online Calibration in Multidimensional Computerized Adaptive Testing with Polytomously Scored Items

Peer reviewed

Direct link

Yuan, Lu; Huang, Yingshi; Li, Shuhang; Chen, Ping – Journal of Educational Measurement, 2023

Online calibration is a key technology for item calibration in computerized adaptive testing (CAT) and has been widely used in various forms of CAT, including unidimensional CAT, multidimensional CAT (MCAT), CAT with polytomously scored items, and cognitive diagnostic CAT. However, as multidimensional and polytomous assessment data become more…

Descriptors: Computer Assisted Testing, Adaptive Testing, Computation, Test Items

Efficiency of Targeted Multistage Calibration Designs under Practical Constraints: A Simulation Study

Peer reviewed

Direct link

Berger, Stéphanie; Verschoor, Angela J.; Eggen, Theo J. H. M.; Moser, Urs – Journal of Educational Measurement, 2019

Calibration of an item bank for computer adaptive testing requires substantial resources. In this study, we investigated whether the efficiency of calibration under the Rasch model could be enhanced by improving the match between item difficulty and student ability. We introduced targeted multistage calibration designs, a design type that…

Descriptors: Simulation, Computer Assisted Testing, Test Items, Difficulty Level

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Vispoel, Walter P.	5
Bennett, Randy Elliot	4
Bridgeman, Brent	4
Chang, Hua-Hua	4
Wainer, Howard	4
van der Linden, Wim J.	4
Rock, Donald A.	3
Stocking, Martha L.	3
Tatsuoka, Kikumi K.	3
Veldkamp, Bernard P.	3
Bejar, Isaac I.	2
Bleiler, Timothy	2
Cai, Yan	2
Chen, Shu-Ying	2
Choi, Seung W.	2
Finkelman, Matthew	2
Kang, Hyeon-Ah	2
Kim, Dong-In	2
Lewis, Charles	2
Li, Jie	2
Luecht, Richard M.	2
Morley, Mary	2
Potenza, Maria T.	2
Roussos, Louis A.	2
More ▼