ERIC - Search Results

Publication Date

In 2025	17
Since 2024	73
Since 2021 (last 5 years)	278
Since 2016 (last 10 years)	509
Since 2006 (last 20 years)	827

Descriptor

Item Analysis	1478
Test Items	1478
Test Construction	477
Foreign Countries	378
Difficulty Level	369
Test Validity	295
Item Response Theory	264
Test Reliability	243
Multiple Choice Tests	236
Comparative Analysis	227
Scores	202
Statistical Analysis	197
Achievement Tests	192
Language Tests	167
Higher Education	160
Psychometrics	159
Computer Assisted Testing	156
Test Bias	155
Second Language Learning	153
Correlation	152
Mathematics Tests	150
English (Second Language)	149
Test Format	146
Latent Trait Theory	141
Factor Analysis	123
More ▼

Education Level

Higher Education	219
Postsecondary Education	183
Secondary Education	165
Elementary Education	116
Elementary Secondary Education	72
High Schools	61
Middle Schools	61
Junior High Schools	43
Intermediate Grades	31
Grade 8	30
Grade 4	25
Early Childhood Education	24
Grade 5	23
Grade 6	20
Primary Education	20
Grade 7	19
Grade 3	13
Adult Education	11
Grade 12	11
Grade 9	11
Kindergarten	11
Grade 10	7
Preschool Education	7
Grade 1	5
Grade 2	5
More ▼

Audience

Researchers	85
Practitioners	19
Teachers	16
Administrators	2
Counselors	1
Policymakers	1
Students	1

Location

Turkey	42
Canada	24
Australia	21
Iran	20
Japan	18
Germany	16
China	15
United States	14
Taiwan	11
Indonesia	10
Oregon	10
Singapore	10
Europe	8
Malaysia	8
South Korea	8
United Kingdom	8
Israel	7
Italy	7
California	6
Florida	6
Georgia	6
India	6
Massachusetts	6
Netherlands	6
Nigeria	6
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	9
No Child Left Behind Act 2001	6
Elementary and Secondary…	2
Every Student Succeeds Act…	2
Education Consolidation…	1
National Defense Education Act	1

What Works Clearinghouse Rating

Does not meet standards

Showing 1 to 15 of 1,478 results Save | Export

Are the Steps on Likert Scales Equidistant? Responses on Visual Analog Scales Allow Estimating Their Distances

Peer reviewed

Direct link

Miguel A. García-Pérez – Educational and Psychological Measurement, 2024

A recurring question regarding Likert items is whether the discrete steps that this response format allows represent constant increments along the underlying continuum. This question appears unsolvable because Likert responses carry no direct information to this effect. Yet, any item administered in Likert format can identically be administered…

Descriptors: Likert Scales, Test Construction, Test Items, Item Analysis

IRT Linking Methods for the Bifactor Model with Mixed Format Tests

Peer reviewed

Direct link

Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025

This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…

Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis

Peer reviewed

Direct link

Chan Zhang; Shuaiying Cao; Minglei Wang; Jiangyan Wang; Lirui He – Field Methods, 2025

Previous research on grid questions has mostly focused on their comparability with the item-by-item method and the use of shading to help respondents navigate through a grid. This study extends prior work by examining whether lexical similarity among grid items affects how respondents answer the questions in an experiment where we manipulated…

Descriptors: Foreign Countries, Surveys, Test Construction, Design

Detecting Differential Item Functioning with Multiple Causes: A Comparison of Three Methods

Peer reviewed

Direct link

Xiaowen Liu – International Journal of Testing, 2024

Differential item functioning (DIF) often arises from multiple sources. Within the context of multidimensional item response theory, this study examined DIF items with varying secondary dimensions using the three DIF methods: SIBTEST, Mantel-Haenszel, and logistic regression. The effect of the number of secondary dimensions on DIF detection rates…

Descriptors: Item Analysis, Test Items, Item Response Theory, Correlation

A Comparison of Yen's Q3 Coefficient and Rasch Testlet Modeling for Identifying Local Item Dependence: Evidence from Two Vocabulary Matching Tests

Peer reviewed

Direct link

Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025

This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…

Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis

Correcting for Extreme Response Style: Model Choice Matters

Peer reviewed

Direct link

Martijn Schoenmakers; Jesper Tijmstra; Jeroen Vermunt; Maria Bolsinova – Educational and Psychological Measurement, 2024

Extreme response style (ERS), the tendency of participants to select extreme item categories regardless of the item content, has frequently been found to decrease the validity of Likert-type questionnaire results. For this reason, various item response theory (IRT) models have been proposed to model ERS and correct for it. Comparisons of these…

Descriptors: Item Response Theory, Response Style (Tests), Models, Likert Scales

Item-Validity Analysis of the SED-S in a Multicentre Study of Adults with Intellectual Disabilities

Peer reviewed

Direct link

Hauke Hermann; Annemieke Witte; Gloria Kempelmann; Brian F. Barrett; Sandra Zaal; Jolanda Vonk; Filip Morisse; Anna Pöhlmann; Paula S. Sterkenburg; Tanja Sappok – Journal of Applied Research in Intellectual Disabilities, 2024

Background: Valid and reliable instruments for measuring emotional development are critical for a proper diagnostic assignment in individuals with intellectual disabilities. This exploratory study examined the psychometric properties of the items on the Scale of Emotional Development--Short (SED-S). Method: The sample included 612 adults with…

Descriptors: Measures (Individuals), Emotional Development, Intellectual Disability, Psychometrics

Practical Considerations in Item Calibration with Small Samples under Multistage Test Design: A Case Study. Research Report. ETS RR-24-03

Peer reviewed
PDF on ERIC

Download full text

Hongwen Guo; Matthew S. Johnson; Daniel F. McCaffrey; Lixong Gu – ETS Research Report Series, 2024

The multistage testing (MST) design has been gaining attention and popularity in educational assessments. For testing programs that have small test-taker samples, it is challenging to calibrate new items to replenish the item pool. In the current research, we used the item pools from an operational MST program to illustrate how research studies…

Descriptors: Test Items, Test Construction, Sample Size, Scaling

Parameters and Models of Item Response Theory (IRT): A Review of Literature

Peer reviewed

Direct link

Gyamfi, Abraham; Acquaye, Rosemary – Acta Educationis Generalis, 2023

Introduction: Item response theory (IRT) has received much attention in validation of assessment instrument because it allows the estimation of students' ability from any set of the items. Item response theory allows the difficulty and discrimination levels of each item on the test to be estimated. In the framework of IRT, item characteristics are…

Descriptors: Item Response Theory, Models, Test Items, Difficulty Level

Exploration of Latent Structure in Test Revision and Review Log Data

Peer reviewed

Direct link

Zhang, Susu; Li, Anqi; Wang, Shiyu – Educational Measurement: Issues and Practice, 2023

In computer-based tests allowing revision and reviews, examinees' sequence of visits and answer changes to questions can be recorded. The variable-length revision log data introduce new complexities to the collected data but, at the same time, provide additional information on examinees' test-taking behavior, which can inform test development and…

Descriptors: Computer Assisted Testing, Test Construction, Test Wiseness, Test Items

The Impact of Measurement Model Misspecification on Coefficient Omega Estimates of Composite Reliability

Peer reviewed

Direct link

Stephanie M. Bell; R. Philip Chalmers; David B. Flora – Educational and Psychological Measurement, 2024

Coefficient omega indices are model-based composite reliability estimates that have become increasingly popular. A coefficient omega index estimates how reliably an observed composite score measures a target construct as represented by a factor in a factor-analysis model; as such, the accuracy of omega estimates is likely to depend on correct…

Descriptors: Influences, Models, Measurement Techniques, Reliability

An Automated Item Pool Assembly Framework for Maximizing Item Utilization for CAT

Peer reviewed

Direct link

Hwanggyu Lim; Kyung T. Han – Educational Measurement: Issues and Practice, 2024

Computerized adaptive testing (CAT) has gained deserved popularity in the administration of educational and professional assessments, but continues to face test security challenges. To ensure sustained quality assurance and testing integrity, it is imperative to establish and maintain multiple stable item pools that are consistent in terms of…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Banks

Deep Learning Imputation for Asymmetric and Incomplete Likert-Type Items

Peer reviewed

Direct link

Zachary K. Collier; Minji Kong; Olushola Soyoye; Kamal Chawla; Ann M. Aviles; Yasser Payne – Journal of Educational and Behavioral Statistics, 2024

Asymmetric Likert-type items in research studies can present several challenges in data analysis, particularly concerning missing data. These items are often characterized by a skewed scaling, where either there is no neutral response option or an unequal number of possible positive and negative responses. The use of conventional techniques, such…

Descriptors: Likert Scales, Test Items, Item Analysis, Evaluation Methods

A Psychometric Framework for Evaluating Fairness in Algorithmic Decision Making: Differential Algorithmic Functioning

Peer reviewed

Direct link

Youmi Suk; Kyung T. Han – Journal of Educational and Behavioral Statistics, 2024

As algorithmic decision making is increasingly deployed in every walk of life, many researchers have raised concerns about fairness-related bias from such algorithms. But there is little research on harnessing psychometric methods to uncover potential discriminatory bias inside decision-making algorithms. The main goal of this article is to…

Descriptors: Psychometrics, Ethics, Decision Making, Algorithms

Extending an Identified Four-Parameter IRT Model: The Confirmatory Set-4PNO Model

Peer reviewed

Direct link

Justin L. Kern – Journal of Educational and Behavioral Statistics, 2024

Given the frequent presence of slipping and guessing in item responses, models for the inclusion of their effects are highly important. Unfortunately, the most common model for their inclusion, the four-parameter item response theory model, potentially has severe deficiencies related to its possible unidentifiability. With this issue in mind, the…

Descriptors: Item Response Theory, Models, Bayesian Statistics, Generalization

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 99

Educational and Psychological…	95
Journal of Educational…	62
ProQuest LLC	39
Applied Psychological…	31
Language Testing	30
ETS Research Report Series	29
Applied Measurement in…	24
Educational Measurement:…	21
Online Submission	21
Journal of Educational and…	20
International Journal of…	19
International Journal of…	18
Language Assessment Quarterly	14
Behavioral Research and…	12
Journal of Psychoeducational…	12
Physical Review Physics…	12
Educational Assessment	11
Grantee Submission	11
International Journal of…	11
Psychometrika	11
Practical Assessment,…	10
English Language Teaching	7
International Journal of…	7
Journal of Chemical Education	7
Measurement and Evaluation in…	7
More ▼

Reckase, Mark D.	16
Tindal, Gerald	13
Hambleton, Ronald K.	12
Alonzo, Julie	10
Plake, Barbara S.	9
Dorans, Neil J.	8
Weiss, David J.	8
Gierl, Mark J.	7
Lai, Cheng Fei	7
Lord, Frederic M.	6
McKinley, Robert L.	6
Roid, Gale	6
Samejima, Fumiko	6
Sireci, Stephen G.	6
Yen, Wendy M.	6
Aryadoust, Vahid	5
Rudner, Lawrence M.	5
van der Linden, Wim J.	5
von Davier, Matthias	5
Ackerman, Terry A.	4
Albano, Anthony D.	4
Baghaei, Purya	4
Benson, Jeri	4
Bulut, Okan	4
More ▼

Reports - Research	1053
Journal Articles	926
Speeches/Meeting Papers	202
Reports - Evaluative	184
Reports - Descriptive	92
Tests/Questionnaires	77
Dissertations/Theses -…	40
Numerical/Quantitative Data	33
Information Analyses	31
Guides - Non-Classroom	24
Opinion Papers	15
Guides - General	6
Books	5
Collected Works - General	4
Guides - Classroom - Teacher	4
Computer Programs	2
ERIC Publications	2
Collected Works - Proceedings	1
Collected Works - Serials	1
ERIC Digests in Full Text	1
Guides - Classroom - Learner	1
Historical Materials	1
Non-Print Media	1
Reference Materials -…	1
Reference Materials -…	1
More ▼

Program for International…	31
SAT (College Admission Test)	26
National Assessment of…	19
Test of English as a Foreign…	19
Trends in International…	19
Graduate Record Examinations	14
ACT Assessment	11
California Achievement Tests	11
Iowa Tests of Basic Skills	8
International English…	7
Stanford Achievement Tests	7
Metropolitan Achievement Tests	6
Peabody Picture Vocabulary…	5
Raven Progressive Matrices	5
Wechsler Adult Intelligence…	5
Wechsler Intelligence Scale…	5
Comprehensive Tests of Basic…	4
Sequential Tests of…	4
Test of English for…	4
Beck Depression Inventory	3
New Jersey College Basic…	3
SRA Achievement Series	3
Armed Services Vocational…	2
Bem Sex Role Inventory	2
Eysenck Personality Inventory	2
More ▼