ERIC - Search Results

Publication Date

In 2025	9
Since 2024	31

Publication Type

Journal Articles	29
Reports - Research	23
Information Analyses	3
Dissertations/Theses -…	2
Reports - Descriptive	2
Reports - Evaluative	2

Education Level

Higher Education	11
Postsecondary Education	11
Secondary Education	4
Elementary Education	2
Grade 8	2
Junior High Schools	2
Middle Schools	2
Early Childhood Education	1
Elementary Secondary Education	1
Preschool Education	1

Audience

Location

Turkey	2
United Kingdom	2
China	1
Italy	1
Japan (Tokyo)	1
New Zealand	1
Oman	1
Poland	1
Portugal	1
Thailand	1
United Kingdom (England)	1
Vietnam	1
More ▼

Laws, Policies, & Programs

Head Start

Assessments and Surveys

California Critical Thinking…	1
Cornell Critical Thinking Test	1
International English…	1
Phonological Awareness…	1
Program for International…	1
Trends in International…	1
Watson Glaser Critical…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 31 results Save | Export

Simultaneous Linear Equating for Scenarios with Optional Test Versions or across Multiple Alternative Anchors

Peer reviewed
PDF on ERIC

Download full text

Tom Benton – Practical Assessment, Research & Evaluation, 2025

This paper proposes an extension of linear equating that may be useful in one of two fairly common assessment scenarios. One is where different students have taken different combinations of test forms. This might occur, for example, where students have some free choice over the exam papers they take within a particular qualification. In this…

Descriptors: Equated Scores, Test Format, Test Items, Computation

IRT Linking Methods for the Bifactor Model with Mixed Format Tests

Peer reviewed

Direct link

Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025

This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…

Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis

Information Functions of Rank-2PL Models for Forced-Choice Questionnaires

Peer reviewed

Direct link

Jianbin Fu; Xuan Tan; Patrick C. Kyllonen – Journal of Educational Measurement, 2024

This paper presents the item and test information functions of the Rank two-parameter logistic models (Rank-2PLM) for items with two (pair) and three (triplet) statements in forced-choice questionnaires. The Rank-2PLM model for pairs is the MUPP-2PLM (Multi-Unidimensional Pairwise Preference) and, for triplets, is the Triplet-2PLM. Fisher's…

Descriptors: Questionnaires, Test Items, Item Response Theory, Models

An Experimental Comparison of Multiple-Choice and Short-Answer Questions on a High-Stakes Test for Medical Students

Peer reviewed

Direct link

Janet Mee; Ravi Pandian; Justin Wolczynski; Amy Morales; Miguel Paniagua; Polina Harik; Peter Baldwin; Brian E. Clauser – Advances in Health Sciences Education, 2024

Recent advances in automated scoring technology have made it practical to replace multiple-choice questions (MCQs) with short-answer questions (SAQs) in large-scale, high-stakes assessments. However, most previous research comparing these formats has used small examinee samples testing under low-stakes conditions. Additionally, previous studies…

Descriptors: Multiple Choice Tests, High Stakes Tests, Test Format, Test Items

Analysis of Mixed-Format Assessments Using Measurement Models and Topic Modeling

Peer reviewed

Direct link

Jiawei Xiong; George Engelhard; Allan S. Cohen – Measurement: Interdisciplinary Research and Perspectives, 2025

It is common to find mixed-format data results from the use of both multiple-choice (MC) and constructed-response (CR) questions on assessments. Dealing with these mixed response types involves understanding what the assessment is measuring, and the use of suitable measurement models to estimate latent abilities. Past research in educational…

Descriptors: Responses, Test Items, Test Format, Grade 8

A Comparison of Yen's Q3 Coefficient and Rasch Testlet Modeling for Identifying Local Item Dependence: Evidence from Two Vocabulary Matching Tests

Peer reviewed

Direct link

Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025

This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…

Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis

The Effects of Reverse Items on Psychometric Properties and Respondents' Scale Scores According to Different Item Reversal Strategies

Peer reviewed
PDF on ERIC

Download full text

Mustafa Ilhan; Nese Güler; Gülsen Tasdelen Teker; Ömer Ergenekon – International Journal of Assessment Tools in Education, 2024

This study aimed to examine the effects of reverse items created with different strategies on psychometric properties and respondents' scale scores. To this end, three versions of a 10-item scale in the research were developed: 10 positive items were integrated in the first form (Form-P) and five positive and five reverse items in the other two…

Descriptors: Test Items, Psychometrics, Scores, Measures (Individuals)

From Likert to Forced Choice: Statement Parameter Invariance and Context Effects in Personality Assessment

Peer reviewed

Direct link

Jianbin Fu; Patrick C. Kyllonen; Xuan Tan – Measurement: Interdisciplinary Research and Perspectives, 2024

Users of forced-choice questionnaires (FCQs) to measure personality commonly assume statement parameter invariance across contexts -- between Likert and forced-choice (FC) items and between different FC items that share a common statement. In this paper, an empirical study was designed to check these two assumptions for an FCQ assessment measuring…

Descriptors: Measurement Techniques, Questionnaires, Personality Measures, Interpersonal Competence

A Systematic Review of Differential Item Functioning in Second Language Assessment

Peer reviewed

Direct link

Xueliang Chen; Vahid Aryadoust; Wenxin Zhang – Language Testing, 2025

The growing diversity among test takers in second or foreign language (L2) assessments makes the importance of fairness front and center. This systematic review aimed to examine how fairness in L2 assessments was evaluated through differential item functioning (DIF) analysis. A total of 83 articles from 27 journals were included in a systematic…

Descriptors: Second Language Learning, Language Tests, Test Items, Item Analysis

The Impact of Scoring Later on Mixed Format Adaptive Testing

Direct link

Jing Ma – ProQuest LLC, 2024

This study investigated the impact of scoring polytomous items later on measurement precision, classification accuracy, and test security in mixed-format adaptive testing. Utilizing the shadow test approach, a simulation study was conducted across various test designs, lengths, number and location of polytomous item. Results showed that while…

Descriptors: Scoring, Adaptive Testing, Test Items, Classification

The Effect of Question Positioning on Data Quality in Web Surveys

Peer reviewed

Direct link

Cornelia Eva Neuert – Sociological Methods & Research, 2024

The quality of data in surveys is affected by response burden and questionnaire length. With an increasing number of questions, respondents can become bored, tired, and annoyed and may take shortcuts to reduce the effort needed to complete the survey. In this article, direct evidence is presented on how the position of items within a web…

Descriptors: Online Surveys, Test Items, Test Format, Test Construction

Eye Movements and Reading Comprehension Performance: Examining the Relationships among Test Format, Working Memory Capacity and Reading Comprehension

Peer reviewed

Direct link

Corrin Moss; Sharon Kwabi; Scott P. Ardoin; Katherine S. Binder – Reading and Writing: An Interdisciplinary Journal, 2024

The ability to form a mental model of a text is an essential component of successful reading comprehension (RC), and purpose for reading can influence mental model construction. Participants were assigned to one of two conditions during an RC test to alter their purpose for reading: concurrent (texts and questions were presented simultaneously)…

Descriptors: Eye Movements, Reading Comprehension, Test Format, Short Term Memory

Impact of Differential Item Functioning on Item Model Fit Using Concurrent Equating Method

Peer reviewed

Direct link

Zeynep Uzun; Tuncay Ögretmen – Large-scale Assessments in Education, 2025

This study aimed to evaluate the item model fit by equating the forms of the PISA 2018 mathematics subtest with concurrent common items equating in samples from Türkiye, the UK, and Italy. The answers given in mathematics subtest Forms 2, 8, and 12 were used in this context. Analyzes were performed using the Dichotomous Rasch Model in the WINSTEPS…

Descriptors: Item Response Theory, Test Items, Foreign Countries, Mathematics Tests

Improvement of the Quality of Question Papers for Online Examinations toward Simultaneous Enhancement of Students' Learning

Peer reviewed

Direct link

Srikanth Allamsetty; M. V. S. S. Chandra; Neelima Madugula; Byamakesh Nayak – IEEE Transactions on Learning Technologies, 2024

The present study is related to the problem associated with student assessment with online examinations at higher educational institutes (HEIs). With the current COVID-19 outbreak, the majority of educational institutes are conducting online examinations to assess their students, where there would always be a chance that the students go for…

Descriptors: Computer Assisted Testing, Accountability, Higher Education, Comparative Analysis

IRT Characteristic Curve Linking Methods Weighted by Information for Mixed-Format Tests

Peer reviewed

Direct link

Shaojie Wang; Won-Chan Lee; Minqiang Zhang; Lixin Yuan – Applied Measurement in Education, 2024

To reduce the impact of parameter estimation errors on IRT linking results, recent work introduced two information-weighted characteristic curve methods for dichotomous items. These two methods showed outstanding performance in both simulation and pseudo-form pseudo-group analysis. The current study expands upon the concept of information…

Descriptors: Item Response Theory, Test Format, Test Length, Error of Measurement

Previous Page | Next Page »

Pages: 1 | 2 | 3

Applied Measurement in…	2
Measurement:…	2
Practical Assessment,…	2
ProQuest LLC	2
AERA Open	1
Advances in Health Sciences…	1
Education and Information…	1
Educational Psychology Review	1
Educational and Psychological…	1
IEEE Transactions on Learning…	1
Innovations in Education and…	1
Interactive Learning…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Education and…	1
Journal of Educational…	1
Language Assessment Quarterly	1
Language Testing	1
Language Testing in Asia	1
Large-scale Assessments in…	1
Reading and Writing: An…	1
Reading in a Foreign Language	1
Research Matters	1
Sociological Methods &…	1
More ▼

Test Format	31
Test Items	31
Foreign Countries	13
Item Response Theory	11
Item Analysis	7
Scores	7
Test Construction	7
Computer Assisted Testing	6
Comparative Analysis	5
Difficulty Level	5
Language Tests	5
Mathematics Tests	5
Second Language Learning	5
Achievement Tests	4
College Students	4
English (Second Language)	4
Models	4
Multiple Choice Tests	4
Accuracy	3
Artificial Intelligence	3
Equated Scores	3
Eye Movements	3
Measurement Techniques	3
Test Validity	3
Adaptive Testing	2
More ▼

Jianbin Fu	2
Patrick C. Kyllonen	2
Tim Stoeckel	2
Xuan Tan	2
Agnieszka Slezak-Swiat	1
Ahmed Al - Badri	1
Allan S. Cohen	1
Amy Morales	1
Bianca A. Simonsmeier	1
Brian E. Clauser	1
Byamakesh Nayak	1
Chandima Daskon	1
Christina M. Cassano	1
Cornelia Eva Neuert	1
Corrin Moss	1
Daniela S.M. Pereira	1
Duyen Thi Bich Nguyen	1
Emma Walland	1
Filipe Manuel Vidal Falcão	1
Fu-Yun Yu	1
George Engelhard	1
Gülsen Tasdelen Teker	1
Hung Tan Ha	1
Inga Laukaityte	1
Janet Mee	1
More ▼