ERIC - Search Results

Publication Date

In 2025	7
Since 2024	29
Since 2021 (last 5 years)	140
Since 2016 (last 10 years)	271
Since 2006 (last 20 years)	435

Descriptor

Test Format	943
Test Items	943
Test Construction	360
Multiple Choice Tests	260
Foreign Countries	222
Difficulty Level	195
Higher Education	179
Computer Assisted Testing	156
Item Response Theory	148
Item Analysis	146
Scores	143
Comparative Analysis	141
Test Validity	122
Test Reliability	116
Language Tests	108
Mathematics Tests	93
English (Second Language)	88
Scoring	86
Achievement Tests	82
Testing	77
Reading Tests	76
Elementary Secondary Education	72
Student Evaluation	72
Science Tests	71
Statistical Analysis	71
More ▼

Education Level

Higher Education	146
Postsecondary Education	123
Secondary Education	94
Elementary Education	60
Middle Schools	45
Elementary Secondary Education	36
Junior High Schools	35
Grade 8	34
High Schools	31
Intermediate Grades	24
Grade 4	22
Grade 6	10
Grade 7	10
Early Childhood Education	9
Grade 3	9
Grade 5	9
Primary Education	8
Grade 12	7
Grade 9	6
Kindergarten	3
Preschool Education	3
Grade 1	2
Grade 10	2
Grade 2	2
Grade 11	1
More ▼

Audience

Practitioners	62
Teachers	46
Researchers	31
Students	15
Administrators	12
Parents	6
Policymakers	4
Community	1
Counselors	1

Location

Turkey	26
Canada	15
Germany	15
Australia	13
Israel	13
Japan	12
Netherlands	10
United Kingdom	9
United States	9
Arizona	6
Iran	6
Sweden	6
China	5
Malaysia	5
New Jersey	5
South Korea	5
United Kingdom (England)	5
Louisiana	4
Belgium	3
Florida	3
Hong Kong	3
Indonesia	3
Nigeria	3
Ohio	3
Oregon	3
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	2
No Child Left Behind Act 2001	2
Elementary and Secondary…	1
Head Start	1
Job Training Partnership Act…	1
Perkins Loan Program	1

What Works Clearinghouse Rating

Showing 1 to 15 of 943 results Save | Export

Simultaneous Linear Equating for Scenarios with Optional Test Versions or across Multiple Alternative Anchors

Peer reviewed
PDF on ERIC

Download full text

Tom Benton – Practical Assessment, Research & Evaluation, 2025

This paper proposes an extension of linear equating that may be useful in one of two fairly common assessment scenarios. One is where different students have taken different combinations of test forms. This might occur, for example, where students have some free choice over the exam papers they take within a particular qualification. In this…

Descriptors: Equated Scores, Test Format, Test Items, Computation

IRT Linking Methods for the Bifactor Model with Mixed Format Tests

Peer reviewed

Direct link

Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025

This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…

Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis

Information Functions of Rank-2PL Models for Forced-Choice Questionnaires

Peer reviewed

Direct link

Jianbin Fu; Xuan Tan; Patrick C. Kyllonen – Journal of Educational Measurement, 2024

This paper presents the item and test information functions of the Rank two-parameter logistic models (Rank-2PLM) for items with two (pair) and three (triplet) statements in forced-choice questionnaires. The Rank-2PLM model for pairs is the MUPP-2PLM (Multi-Unidimensional Pairwise Preference) and, for triplets, is the Triplet-2PLM. Fisher's…

Descriptors: Questionnaires, Test Items, Item Response Theory, Models

An Experimental Comparison of Multiple-Choice and Short-Answer Questions on a High-Stakes Test for Medical Students

Peer reviewed

Direct link

Janet Mee; Ravi Pandian; Justin Wolczynski; Amy Morales; Miguel Paniagua; Polina Harik; Peter Baldwin; Brian E. Clauser – Advances in Health Sciences Education, 2024

Recent advances in automated scoring technology have made it practical to replace multiple-choice questions (MCQs) with short-answer questions (SAQs) in large-scale, high-stakes assessments. However, most previous research comparing these formats has used small examinee samples testing under low-stakes conditions. Additionally, previous studies…

Descriptors: Multiple Choice Tests, High Stakes Tests, Test Format, Test Items

Analysis of Mixed-Format Assessments Using Measurement Models and Topic Modeling

Peer reviewed

Direct link

Jiawei Xiong; George Engelhard; Allan S. Cohen – Measurement: Interdisciplinary Research and Perspectives, 2025

It is common to find mixed-format data results from the use of both multiple-choice (MC) and constructed-response (CR) questions on assessments. Dealing with these mixed response types involves understanding what the assessment is measuring, and the use of suitable measurement models to estimate latent abilities. Past research in educational…

Descriptors: Responses, Test Items, Test Format, Grade 8

Do Subject Matter Experts' Judgments of Multiple-Choice Format Suitability Predict Item Quality?

Peer reviewed

Direct link

Berenbon, Rebecca F.; McHugh, Bridget C. – Educational Measurement: Issues and Practice, 2023

To assemble a high-quality test, psychometricians rely on subject matter experts (SMEs) to write high-quality items. However, SMEs are not typically given the opportunity to provide input on which content standards are most suitable for multiple-choice questions (MCQs). In the present study, we explored the relationship between perceived MCQ…

Descriptors: Test Items, Multiple Choice Tests, Standards, Difficulty Level

Constructing a Robust Score Scale from IRT Scores with Informed Boundaries

Peer reviewed

Direct link

Choe, Edison M.; Han, Kyung T. – Journal of Educational Measurement, 2022

In operational testing, item response theory (IRT) models for dichotomous responses are popular for measuring a single latent construct [theta], such as cognitive ability in a content domain. Estimates of [theta], also called IRT scores or [theta hat], can be computed using estimators based on the likelihood function, such as maximum likelihood…

Descriptors: Scores, Item Response Theory, Test Items, Test Format

A Comparison of Yen's Q3 Coefficient and Rasch Testlet Modeling for Identifying Local Item Dependence: Evidence from Two Vocabulary Matching Tests

Peer reviewed

Direct link

Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025

This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…

Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis

What Is Actually Equated in "Test Equating"? A Didactic Note

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2022

The current literature on test equating generally defines it as the process necessary to obtain score comparability between different test forms. The definition is in contrast with Lord's foundational paper which viewed equating as the process required to obtain comparability of measurement scale between forms. The distinction between the notions…

Descriptors: Equated Scores, Test Items, Scores, Probability

The Effects of Reverse Items on Psychometric Properties and Respondents' Scale Scores According to Different Item Reversal Strategies

Peer reviewed
PDF on ERIC

Download full text

Mustafa Ilhan; Nese Güler; Gülsen Tasdelen Teker; Ömer Ergenekon – International Journal of Assessment Tools in Education, 2024

This study aimed to examine the effects of reverse items created with different strategies on psychometric properties and respondents' scale scores. To this end, three versions of a 10-item scale in the research were developed: 10 positive items were integrated in the first form (Form-P) and five positive and five reverse items in the other two…

Descriptors: Test Items, Psychometrics, Scores, Measures (Individuals)

From Likert to Forced Choice: Statement Parameter Invariance and Context Effects in Personality Assessment

Peer reviewed

Direct link

Jianbin Fu; Patrick C. Kyllonen; Xuan Tan – Measurement: Interdisciplinary Research and Perspectives, 2024

Users of forced-choice questionnaires (FCQs) to measure personality commonly assume statement parameter invariance across contexts -- between Likert and forced-choice (FC) items and between different FC items that share a common statement. In this paper, an empirical study was designed to check these two assumptions for an FCQ assessment measuring…

Descriptors: Measurement Techniques, Questionnaires, Personality Measures, Interpersonal Competence

Application of Two-Parameter Item Response Theory for Determining Form-Dependent Items on Exams Using Different Item Orders

Peer reviewed
PDF on ERIC

Download full text

Pentecost, Thomas C.; Raker, Jeffery R.; Murphy, Kristen L. – Practical Assessment, Research & Evaluation, 2023

Using multiple versions of an assessment has the potential to introduce item environment effects. These types of effects result in version dependent item characteristics (i.e., difficulty and discrimination). Methods to detect such effects and resulting implications are important for all levels of assessment where multiple forms of an assessment…

Descriptors: Item Response Theory, Test Items, Test Format, Science Tests

A Systematic Review of Differential Item Functioning in Second Language Assessment

Peer reviewed

Direct link

Xueliang Chen; Vahid Aryadoust; Wenxin Zhang – Language Testing, 2025

The growing diversity among test takers in second or foreign language (L2) assessments makes the importance of fairness front and center. This systematic review aimed to examine how fairness in L2 assessments was evaluated through differential item functioning (DIF) analysis. A total of 83 articles from 27 journals were included in a systematic…

Descriptors: Second Language Learning, Language Tests, Test Items, Item Analysis

The Impact of Scoring Later on Mixed Format Adaptive Testing

Direct link

Jing Ma – ProQuest LLC, 2024

This study investigated the impact of scoring polytomous items later on measurement precision, classification accuracy, and test security in mixed-format adaptive testing. Utilizing the shadow test approach, a simulation study was conducted across various test designs, lengths, number and location of polytomous item. Results showed that while…

Descriptors: Scoring, Adaptive Testing, Test Items, Classification

Effect of Missing Data on Test Equating Methods Under NEAT Design

Peer reviewed
PDF on ERIC

Download full text

Semih Asiret; Seçil Ömür Sünbül – International Journal of Psychology and Educational Studies, 2023

In this study, it was aimed to examine the effect of missing data in different patterns and sizes on test equating methods under the NEAT design for different factors. For this purpose, as part of this study, factors such as sample size, average difficulty level difference between the test forms, difference between the ability distribution,…

Descriptors: Research Problems, Data, Test Items, Equated Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 63

Educational and Psychological…	48
Journal of Educational…	33
Applied Measurement in…	31
ProQuest LLC	28
ETS Research Report Series	16
Language Testing	16
International Journal of…	15
Educational Measurement:…	14
Practical Assessment,…	13
Journal of Experimental…	11
Applied Psychological…	10
Educational Assessment	10
International Journal of…	8
Language Assessment Quarterly	8
College Board	7
Grantee Submission	7
Online Submission	7
Journal of Psychoeducational…	6
Journal of Technology,…	6
Assessment & Evaluation in…	5
Teaching of Psychology	5
Advances in Health Sciences…	4
Field Methods	4
Journal of Economic Education	4
Language Testing in Asia	4
More ▼

Plake, Barbara S.	12
Kim, Sooyeon	9
Huntley, Renee M.	8
Wainer, Howard	8
Haladyna, Thomas M.	7
Katz, Irvin R.	7
van der Linden, Wim J.	7
Allalouf, Avi	6
DeMars, Christine E.	5
Downing, Steven M.	5
Hambleton, Ronald K.	5
Sykes, Robert C.	5
Walker, Michael E.	5
Anderson, Paul S.	4
Bulut, Okan	4
Goldhammer, Frank	4
Herman, Joan	4
Keehner, Madeleine	4
Lawrence, Ida M.	4
Martinez, Michael E.	4
Pommerich, Mary	4
Sireci, Stephen G.	4
Stansfield, Charles W.	4
Stocking, Martha L.	4
More ▼

Reports - Research	585
Journal Articles	552
Speeches/Meeting Papers	172
Reports - Evaluative	153
Reports - Descriptive	86
Tests/Questionnaires	54
Guides - Non-Classroom	46
Dissertations/Theses -…	29
Information Analyses	29
Opinion Papers	19
Guides - Classroom - Teacher	18
Numerical/Quantitative Data	13
Guides - Classroom - Learner	4
Reference Materials - General	4
Books	3
Collected Works - General	3
ERIC Publications	3
Guides - General	3
Non-Print Media	3
Reference Materials -…	3
Computer Programs	2
ERIC Digests in Full Text	2
Multilingual/Bilingual…	2
Book/Product Reviews	1
Collected Works - Proceedings	1
More ▼

National Assessment of…	21
SAT (College Admission Test)	18
Trends in International…	15
Program for International…	14
Test of English as a Foreign…	13
ACT Assessment	12
Graduate Record Examinations	12
Advanced Placement…	8
Peabody Picture Vocabulary…	5
General Educational…	3
Graduate Management Admission…	3
International English…	3
Preliminary Scholastic…	3
Stanford Achievement Tests	3
State Trait Anxiety Inventory	3
Test of English for…	3
Texas Educational Assessment…	3
Armed Services Vocational…	2
College Level Examination…	2
Cornell Critical Thinking Test	2
Embedded Figures Test	2
Gates MacGinitie Reading Tests	2
Iowa Tests of Basic Skills	2
Law School Admission Test	2
Mathematics Anxiety Rating…	2
More ▼