ERIC - Search Results

Publication Date

In 2026	0
Since 2025	12
Since 2022 (last 5 years)	90
Since 2017 (last 10 years)	210
Since 2007 (last 20 years)	324

Descriptor

Test Format	593
Test Items	593
Multiple Choice Tests	196
Test Construction	176
Foreign Countries	175
Difficulty Level	150
Higher Education	124
Item Analysis	115
Scores	115
Comparative Analysis	106
Item Response Theory	105
Computer Assisted Testing	97
Test Reliability	81
Test Validity	72
Language Tests	68
College Students	67
English (Second Language)	67
Mathematics Tests	62
Statistical Analysis	61
Second Language Learning	55
Reading Tests	53
Achievement Tests	50
Responses	49
Correlation	48
Undergraduate Students	46
More ▼

Publication Type

Reports - Research	593
Journal Articles	414
Speeches/Meeting Papers	114
Tests/Questionnaires	32
Numerical/Quantitative Data	10
Information Analyses	4
Reports - Evaluative	4
Collected Works - General	2
Collected Works - Serials	1
Opinion Papers	1
Reference Materials -…	1
More ▼

Education Level

Higher Education	122
Postsecondary Education	110
Secondary Education	75
Elementary Education	44
Middle Schools	36
Junior High Schools	28
Grade 8	24
High Schools	22
Intermediate Grades	15
Elementary Secondary Education	14
Grade 4	13
Early Childhood Education	9
Grade 7	9
Grade 6	8
Grade 3	7
Primary Education	7
Grade 5	6
Grade 9	6
Grade 12	3
Kindergarten	3
Preschool Education	2
Grade 1	1
Grade 10	1
Grade 11	1
Grade 2	1
More ▼

Audience

Researchers	22
Practitioners	7
Teachers	5
Administrators	1

Location

Turkey	26
Germany	14
Japan	11
Israel	10
Canada	9
Netherlands	8
Australia	7
United Kingdom	7
Iran	6
Malaysia	6
Sweden	5
United States	5
New Jersey	4
South Korea	4
Taiwan	4
United Kingdom (England)	4
China	3
Hong Kong	3
Indonesia	3
Pennsylvania	3
Philippines	3
Vietnam	3
Belgium	2
Illinois	2
Japan (Tokyo)	2
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	1
Head Start	1
Individuals with Disabilities…	1
No Child Left Behind Act 2001	1
Perkins Loan Program	1

What Works Clearinghouse Rating

Showing 1 to 15 of 593 results Save | Export

Simultaneous Linear Equating for Scenarios with Optional Test Versions or across Multiple Alternative Anchors

Peer reviewed
PDF on ERIC

Download full text

Tom Benton – Practical Assessment, Research & Evaluation, 2025

This paper proposes an extension of linear equating that may be useful in one of two fairly common assessment scenarios. One is where different students have taken different combinations of test forms. This might occur, for example, where students have some free choice over the exam papers they take within a particular qualification. In this…

Descriptors: Equated Scores, Test Format, Test Items, Computation

IRT Linking Methods for the Bifactor Model with Mixed Format Tests

Peer reviewed

Direct link

Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025

This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…

Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis

An Experimental Comparison of Multiple-Choice and Short-Answer Questions on a High-Stakes Test for Medical Students

Peer reviewed

Direct link

Janet Mee; Ravi Pandian; Justin Wolczynski; Amy Morales; Miguel Paniagua; Polina Harik; Peter Baldwin; Brian E. Clauser – Advances in Health Sciences Education, 2024

Recent advances in automated scoring technology have made it practical to replace multiple-choice questions (MCQs) with short-answer questions (SAQs) in large-scale, high-stakes assessments. However, most previous research comparing these formats has used small examinee samples testing under low-stakes conditions. Additionally, previous studies…

Descriptors: Multiple Choice Tests, High Stakes Tests, Test Format, Test Items

Analysis of Mixed-Format Assessments Using Measurement Models and Topic Modeling

Peer reviewed

Direct link

Jiawei Xiong; George Engelhard; Allan S. Cohen – Measurement: Interdisciplinary Research and Perspectives, 2025

It is common to find mixed-format data results from the use of both multiple-choice (MC) and constructed-response (CR) questions on assessments. Dealing with these mixed response types involves understanding what the assessment is measuring, and the use of suitable measurement models to estimate latent abilities. Past research in educational…

Descriptors: Responses, Test Items, Test Format, Grade 8

The Effect of Polytomous Item Ratio on Ability Estimation in Multistage Tests

Peer reviewed
PDF on ERIC

Download full text

Hasibe Yahsi Sari; Hulya Kelecioglu – International Journal of Assessment Tools in Education, 2025

The aim of the study is to examine the effect of polytomous item ratio on ability estimation in different conditions in multistage tests (MST) using mixed tests. The study is simulation-based research. In the PISA 2018 application, the ability parameters of the individuals and the item pool were created by using the item parameters estimated from…

Descriptors: Test Items, Test Format, Accuracy, Test Length

Do Subject Matter Experts' Judgments of Multiple-Choice Format Suitability Predict Item Quality?

Peer reviewed

Direct link

Berenbon, Rebecca F.; McHugh, Bridget C. – Educational Measurement: Issues and Practice, 2023

To assemble a high-quality test, psychometricians rely on subject matter experts (SMEs) to write high-quality items. However, SMEs are not typically given the opportunity to provide input on which content standards are most suitable for multiple-choice questions (MCQs). In the present study, we explored the relationship between perceived MCQ…

Descriptors: Test Items, Multiple Choice Tests, Standards, Difficulty Level

Constructing a Robust Score Scale from IRT Scores with Informed Boundaries

Peer reviewed

Direct link

Choe, Edison M.; Han, Kyung T. – Journal of Educational Measurement, 2022

In operational testing, item response theory (IRT) models for dichotomous responses are popular for measuring a single latent construct [theta], such as cognitive ability in a content domain. Estimates of [theta], also called IRT scores or [theta hat], can be computed using estimators based on the likelihood function, such as maximum likelihood…

Descriptors: Scores, Item Response Theory, Test Items, Test Format

A Comparison of Yen's Q3 Coefficient and Rasch Testlet Modeling for Identifying Local Item Dependence: Evidence from Two Vocabulary Matching Tests

Peer reviewed

Direct link

Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025

This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…

Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis

The Effects of Reverse Items on Psychometric Properties and Respondents' Scale Scores According to Different Item Reversal Strategies

Peer reviewed
PDF on ERIC

Download full text

Mustafa Ilhan; Nese Güler; Gülsen Tasdelen Teker; Ömer Ergenekon – International Journal of Assessment Tools in Education, 2024

This study aimed to examine the effects of reverse items created with different strategies on psychometric properties and respondents' scale scores. To this end, three versions of a 10-item scale in the research were developed: 10 positive items were integrated in the first form (Form-P) and five positive and five reverse items in the other two…

Descriptors: Test Items, Psychometrics, Scores, Measures (Individuals)

From Likert to Forced Choice: Statement Parameter Invariance and Context Effects in Personality Assessment

Peer reviewed

Direct link

Jianbin Fu; Patrick C. Kyllonen; Xuan Tan – Measurement: Interdisciplinary Research and Perspectives, 2024

Users of forced-choice questionnaires (FCQs) to measure personality commonly assume statement parameter invariance across contexts -- between Likert and forced-choice (FC) items and between different FC items that share a common statement. In this paper, an empirical study was designed to check these two assumptions for an FCQ assessment measuring…

Descriptors: Measurement Techniques, Questionnaires, Personality Measures, Interpersonal Competence

Application of Two-Parameter Item Response Theory for Determining Form-Dependent Items on Exams Using Different Item Orders

Peer reviewed
PDF on ERIC

Download full text

Pentecost, Thomas C.; Raker, Jeffery R.; Murphy, Kristen L. – Practical Assessment, Research & Evaluation, 2023

Using multiple versions of an assessment has the potential to introduce item environment effects. These types of effects result in version dependent item characteristics (i.e., difficulty and discrimination). Methods to detect such effects and resulting implications are important for all levels of assessment where multiple forms of an assessment…

Descriptors: Item Response Theory, Test Items, Test Format, Science Tests

Unpacking the Impact of Item Difficulty: Traditional Testing in Online Learning

Peer reviewed
PDF on ERIC

Download full text

Necati Taskin – International Journal of Technology in Education, 2025

This study examines the effect of item order (random, increasingly difficult, and decreasingly difficult) on student performance, test parameters, and student perceptions in multiple-choice tests administered in a paper-and-pencil format after online learning. In the research conducted using an explanatory sequential mixed methods design,…

Descriptors: Test Items, Difficulty Level, Online Courses, College Freshmen

Effect of Missing Data on Test Equating Methods Under NEAT Design

Peer reviewed
PDF on ERIC

Download full text

Semih Asiret; Seçil Ömür Sünbül – International Journal of Psychology and Educational Studies, 2023

In this study, it was aimed to examine the effect of missing data in different patterns and sizes on test equating methods under the NEAT design for different factors. For this purpose, as part of this study, factors such as sample size, average difficulty level difference between the test forms, difference between the ability distribution,…

Descriptors: Research Problems, Data, Test Items, Equated Scores

On the Relationship between Item Stem Formulation and Criterion Validity of Multiple-Component Measuring Instruments

Peer reviewed

Direct link

Menold, Natalja; Raykov, Tenko – Educational and Psychological Measurement, 2022

The possible dependency of criterion validity on item formulation in a multicomponent measuring instrument is examined. The discussion is concerned with evaluation of the differences in criterion validity between two or more groups (populations/subpopulations) that have been administered instruments with items having differently formulated item…

Descriptors: Test Items, Measures (Individuals), Test Validity, Difficulty Level

The Effect of Question Positioning on Data Quality in Web Surveys

Peer reviewed

Direct link

Cornelia Eva Neuert – Sociological Methods & Research, 2024

The quality of data in surveys is affected by response burden and questionnaire length. With an increasing number of questions, respondents can become bored, tired, and annoyed and may take shortcuts to reduce the effort needed to complete the survey. In this article, direct evidence is presented on how the position of items within a web…

Descriptors: Online Surveys, Test Items, Test Format, Test Construction

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 40

Educational and Psychological…	36
Applied Measurement in…	24
Journal of Educational…	24
ETS Research Report Series	16
Language Testing	13
International Journal of…	11
Journal of Experimental…	11
International Journal of…	9
Educational Assessment	8
Language Assessment Quarterly	8
Practical Assessment,…	8
Applied Psychological…	7
Grantee Submission	7
Educational Measurement:…	6
Online Submission	6
Journal of Psychoeducational…	5
Journal of Technology,…	5
Teaching of Psychology	5
College Board	4
Field Methods	4
Language Testing in Asia	4
Advances in Health Sciences…	3
Assessment & Evaluation in…	3
Education and Information…	3
Educational Research Quarterly	3
More ▼

Plake, Barbara S.	11
Huntley, Renee M.	7
Kim, Sooyeon	7
Katz, Irvin R.	6
Allalouf, Avi	5
Bulut, Okan	4
DeMars, Christine E.	4
Goldhammer, Frank	4
Keehner, Madeleine	4
Sykes, Robert C.	4
Tollefson, Nona	4
Ackerman, Terry	3
Anderson, Paul S.	3
Baghaei, Purya	3
Downing, Steven M.	3
Han, Kyung T.	3
Kim, Doyoung	3
Lee, Won-Chan	3
McLean, Stuart	3
Moon, Jung Aa	3
O'Grady, Stefan	3
Pommerich, Mary	3
Powers, Donald E.	3
Sireci, Stephen G.	3
More ▼

Program for International…	14
National Assessment of…	11
ACT Assessment	10
Graduate Record Examinations	9
Trends in International…	9
SAT (College Admission Test)	8
Test of English as a Foreign…	8
Advanced Placement…	4
Peabody Picture Vocabulary…	4
International English…	3
Stanford Achievement Tests	3
State Trait Anxiety Inventory	3
Test of English for…	3
College Level Examination…	2
Embedded Figures Test	2
Graduate Management Admission…	2
Iowa Tests of Basic Skills	2
Mathematics Anxiety Rating…	2
Preliminary Scholastic…	2
Academic Motivation Scale	1
Armed Services Vocational…	1
Beck Depression Inventory	1
Clinical Evaluation of…	1
Computer Attitude Scale	1
Cornell Critical Thinking Test	1
More ▼