ERIC - Search Results

Publication Date

In 2025	7
Since 2024	23
Since 2021 (last 5 years)	112
Since 2016 (last 10 years)	223
Since 2006 (last 20 years)	325

Descriptor

Test Format	587
Test Items	587
Multiple Choice Tests	196
Test Construction	175
Foreign Countries	173
Difficulty Level	147
Higher Education	124
Item Analysis	114
Scores	113
Item Response Theory	105
Comparative Analysis	104
Computer Assisted Testing	94
Test Reliability	80
Test Validity	70
Language Tests	68
English (Second Language)	67
College Students	66
Statistical Analysis	61
Mathematics Tests	60
Second Language Learning	55
Reading Tests	53
Responses	49
Achievement Tests	48
Correlation	48
Undergraduate Students	46
More ▼

Publication Type

Reports - Research	587
Journal Articles	410
Speeches/Meeting Papers	114
Tests/Questionnaires	32
Numerical/Quantitative Data	10
Information Analyses	4
Reports - Evaluative	4
Collected Works - General	1
Collected Works - Serials	1
Opinion Papers	1
Reference Materials -…	1
More ▼

Education Level

Higher Education	118
Postsecondary Education	106
Secondary Education	73
Elementary Education	43
Middle Schools	35
Junior High Schools	27
Grade 8	23
High Schools	22
Intermediate Grades	15
Elementary Secondary Education	14
Grade 4	13
Early Childhood Education	9
Grade 7	9
Grade 6	8
Grade 3	7
Primary Education	7
Grade 5	6
Grade 9	6
Grade 12	3
Kindergarten	3
Preschool Education	2
Grade 1	1
Grade 10	1
Grade 11	1
Grade 2	1
More ▼

Audience

Researchers	22
Practitioners	7
Teachers	5
Administrators	1

Location

Turkey	26
Germany	14
Japan	11
Israel	10
Canada	9
Netherlands	8
Australia	7
United Kingdom	7
Iran	6
Malaysia	5
Sweden	5
United States	5
New Jersey	4
United Kingdom (England)	4
China	3
Hong Kong	3
Indonesia	3
Pennsylvania	3
Philippines	3
South Korea	3
Taiwan	3
Vietnam	3
Belgium	2
Illinois	2
Japan (Tokyo)	2
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	1
Head Start	1
Individuals with Disabilities…	1
No Child Left Behind Act 2001	1
Perkins Loan Program	1

What Works Clearinghouse Rating

Showing 1 to 15 of 587 results Save | Export

Simultaneous Linear Equating for Scenarios with Optional Test Versions or across Multiple Alternative Anchors

Peer reviewed
PDF on ERIC

Download full text

Tom Benton – Practical Assessment, Research & Evaluation, 2025

This paper proposes an extension of linear equating that may be useful in one of two fairly common assessment scenarios. One is where different students have taken different combinations of test forms. This might occur, for example, where students have some free choice over the exam papers they take within a particular qualification. In this…

Descriptors: Equated Scores, Test Format, Test Items, Computation

IRT Linking Methods for the Bifactor Model with Mixed Format Tests

Peer reviewed

Direct link

Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025

This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…

Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis

An Experimental Comparison of Multiple-Choice and Short-Answer Questions on a High-Stakes Test for Medical Students

Peer reviewed

Direct link

Janet Mee; Ravi Pandian; Justin Wolczynski; Amy Morales; Miguel Paniagua; Polina Harik; Peter Baldwin; Brian E. Clauser – Advances in Health Sciences Education, 2024

Recent advances in automated scoring technology have made it practical to replace multiple-choice questions (MCQs) with short-answer questions (SAQs) in large-scale, high-stakes assessments. However, most previous research comparing these formats has used small examinee samples testing under low-stakes conditions. Additionally, previous studies…

Descriptors: Multiple Choice Tests, High Stakes Tests, Test Format, Test Items

Analysis of Mixed-Format Assessments Using Measurement Models and Topic Modeling

Peer reviewed

Direct link

Jiawei Xiong; George Engelhard; Allan S. Cohen – Measurement: Interdisciplinary Research and Perspectives, 2025

It is common to find mixed-format data results from the use of both multiple-choice (MC) and constructed-response (CR) questions on assessments. Dealing with these mixed response types involves understanding what the assessment is measuring, and the use of suitable measurement models to estimate latent abilities. Past research in educational…

Descriptors: Responses, Test Items, Test Format, Grade 8

Do Subject Matter Experts' Judgments of Multiple-Choice Format Suitability Predict Item Quality?

Peer reviewed

Direct link

Berenbon, Rebecca F.; McHugh, Bridget C. – Educational Measurement: Issues and Practice, 2023

To assemble a high-quality test, psychometricians rely on subject matter experts (SMEs) to write high-quality items. However, SMEs are not typically given the opportunity to provide input on which content standards are most suitable for multiple-choice questions (MCQs). In the present study, we explored the relationship between perceived MCQ…

Descriptors: Test Items, Multiple Choice Tests, Standards, Difficulty Level

Constructing a Robust Score Scale from IRT Scores with Informed Boundaries

Peer reviewed

Direct link

Choe, Edison M.; Han, Kyung T. – Journal of Educational Measurement, 2022

In operational testing, item response theory (IRT) models for dichotomous responses are popular for measuring a single latent construct [theta], such as cognitive ability in a content domain. Estimates of [theta], also called IRT scores or [theta hat], can be computed using estimators based on the likelihood function, such as maximum likelihood…

Descriptors: Scores, Item Response Theory, Test Items, Test Format

A Comparison of Yen's Q3 Coefficient and Rasch Testlet Modeling for Identifying Local Item Dependence: Evidence from Two Vocabulary Matching Tests

Peer reviewed

Direct link

Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025

This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…

Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis

The Effects of Reverse Items on Psychometric Properties and Respondents' Scale Scores According to Different Item Reversal Strategies

Peer reviewed
PDF on ERIC

Download full text

Mustafa Ilhan; Nese Güler; Gülsen Tasdelen Teker; Ömer Ergenekon – International Journal of Assessment Tools in Education, 2024

This study aimed to examine the effects of reverse items created with different strategies on psychometric properties and respondents' scale scores. To this end, three versions of a 10-item scale in the research were developed: 10 positive items were integrated in the first form (Form-P) and five positive and five reverse items in the other two…

Descriptors: Test Items, Psychometrics, Scores, Measures (Individuals)

From Likert to Forced Choice: Statement Parameter Invariance and Context Effects in Personality Assessment

Peer reviewed

Direct link

Jianbin Fu; Patrick C. Kyllonen; Xuan Tan – Measurement: Interdisciplinary Research and Perspectives, 2024

Users of forced-choice questionnaires (FCQs) to measure personality commonly assume statement parameter invariance across contexts -- between Likert and forced-choice (FC) items and between different FC items that share a common statement. In this paper, an empirical study was designed to check these two assumptions for an FCQ assessment measuring…

Descriptors: Measurement Techniques, Questionnaires, Personality Measures, Interpersonal Competence

Application of Two-Parameter Item Response Theory for Determining Form-Dependent Items on Exams Using Different Item Orders

Peer reviewed
PDF on ERIC

Download full text

Pentecost, Thomas C.; Raker, Jeffery R.; Murphy, Kristen L. – Practical Assessment, Research & Evaluation, 2023

Using multiple versions of an assessment has the potential to introduce item environment effects. These types of effects result in version dependent item characteristics (i.e., difficulty and discrimination). Methods to detect such effects and resulting implications are important for all levels of assessment where multiple forms of an assessment…

Descriptors: Item Response Theory, Test Items, Test Format, Science Tests

Effect of Missing Data on Test Equating Methods Under NEAT Design

Peer reviewed
PDF on ERIC

Download full text

Semih Asiret; Seçil Ömür Sünbül – International Journal of Psychology and Educational Studies, 2023

In this study, it was aimed to examine the effect of missing data in different patterns and sizes on test equating methods under the NEAT design for different factors. For this purpose, as part of this study, factors such as sample size, average difficulty level difference between the test forms, difference between the ability distribution,…

Descriptors: Research Problems, Data, Test Items, Equated Scores

On the Relationship between Item Stem Formulation and Criterion Validity of Multiple-Component Measuring Instruments

Peer reviewed

Direct link

Menold, Natalja; Raykov, Tenko – Educational and Psychological Measurement, 2022

The possible dependency of criterion validity on item formulation in a multicomponent measuring instrument is examined. The discussion is concerned with evaluation of the differences in criterion validity between two or more groups (populations/subpopulations) that have been administered instruments with items having differently formulated item…

Descriptors: Test Items, Measures (Individuals), Test Validity, Difficulty Level

The Effect of Question Positioning on Data Quality in Web Surveys

Peer reviewed

Direct link

Cornelia Eva Neuert – Sociological Methods & Research, 2024

The quality of data in surveys is affected by response burden and questionnaire length. With an increasing number of questions, respondents can become bored, tired, and annoyed and may take shortcuts to reduce the effort needed to complete the survey. In this article, direct evidence is presented on how the position of items within a web…

Descriptors: Online Surveys, Test Items, Test Format, Test Construction

Eye Movements and Reading Comprehension Performance: Examining the Relationships among Test Format, Working Memory Capacity and Reading Comprehension

Peer reviewed

Direct link

Corrin Moss; Sharon Kwabi; Scott P. Ardoin; Katherine S. Binder – Reading and Writing: An Interdisciplinary Journal, 2024

The ability to form a mental model of a text is an essential component of successful reading comprehension (RC), and purpose for reading can influence mental model construction. Participants were assigned to one of two conditions during an RC test to alter their purpose for reading: concurrent (texts and questions were presented simultaneously)…

Descriptors: Eye Movements, Reading Comprehension, Test Format, Short Term Memory

Impact of Differential Item Functioning on Item Model Fit Using Concurrent Equating Method

Peer reviewed

Direct link

Zeynep Uzun; Tuncay Ögretmen – Large-scale Assessments in Education, 2025

This study aimed to evaluate the item model fit by equating the forms of the PISA 2018 mathematics subtest with concurrent common items equating in samples from Türkiye, the UK, and Italy. The answers given in mathematics subtest Forms 2, 8, and 12 were used in this context. Analyzes were performed using the Dichotomous Rasch Model in the WINSTEPS…

Descriptors: Item Response Theory, Test Items, Foreign Countries, Mathematics Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 40

Educational and Psychological…	36
Applied Measurement in…	24
Journal of Educational…	24
ETS Research Report Series	16
Language Testing	13
International Journal of…	11
Journal of Experimental…	11
Educational Assessment	8
Language Assessment Quarterly	8
Practical Assessment,…	8
Applied Psychological…	7
Grantee Submission	7
International Journal of…	7
Educational Measurement:…	6
Online Submission	6
Journal of Psychoeducational…	5
Journal of Technology,…	5
Teaching of Psychology	5
College Board	4
Field Methods	4
Language Testing in Asia	4
Advances in Health Sciences…	3
Assessment & Evaluation in…	3
Education and Information…	3
Educational Research Quarterly	3
More ▼

Plake, Barbara S.	11
Huntley, Renee M.	7
Kim, Sooyeon	7
Katz, Irvin R.	6
Allalouf, Avi	5
Bulut, Okan	4
DeMars, Christine E.	4
Goldhammer, Frank	4
Keehner, Madeleine	4
Sykes, Robert C.	4
Tollefson, Nona	4
Ackerman, Terry	3
Anderson, Paul S.	3
Baghaei, Purya	3
Downing, Steven M.	3
Han, Kyung T.	3
Kim, Doyoung	3
Lee, Won-Chan	3
McLean, Stuart	3
Moon, Jung Aa	3
O'Grady, Stefan	3
Pommerich, Mary	3
Powers, Donald E.	3
Sireci, Stephen G.	3
More ▼

Program for International…	13
National Assessment of…	11
Graduate Record Examinations	9
ACT Assessment	8
SAT (College Admission Test)	8
Test of English as a Foreign…	8
Trends in International…	8
Advanced Placement…	4
Peabody Picture Vocabulary…	4
International English…	3
Stanford Achievement Tests	3
State Trait Anxiety Inventory	3
Test of English for…	3
College Level Examination…	2
Embedded Figures Test	2
Graduate Management Admission…	2
Iowa Tests of Basic Skills	2
Mathematics Anxiety Rating…	2
Preliminary Scholastic…	2
Academic Motivation Scale	1
Armed Services Vocational…	1
Beck Depression Inventory	1
Clinical Evaluation of…	1
Computer Attitude Scale	1
Cornell Critical Thinking Test	1
More ▼