ERIC - Search Results

Publication Date

In 2025	1
Since 2024	4
Since 2021 (last 5 years)	17
Since 2016 (last 10 years)	33
Since 2006 (last 20 years)	72

Descriptor

Evaluation Methods	104
Item Analysis	104
Test Items	104
Test Construction	40
Item Response Theory	31
Test Bias	19
Computer Assisted Testing	18
Difficulty Level	17
Comparative Analysis	16
Test Validity	16
Foreign Countries	14
Simulation	14
Data Analysis	13
Mathematics Tests	13
Models	13
Student Evaluation	13
Psychometrics	12
Statistical Analysis	12
Internet	11
Measures (Individuals)	11
Academic Standards	10
Computer System Design	10
Evaluation Criteria	10
Item Banks	10
Multiple Choice Tests	10
More ▼

Publication Type

Journal Articles	67
Reports - Research	61
Reports - Evaluative	25
Numerical/Quantitative Data	9
Speeches/Meeting Papers	8
Reports - Descriptive	7
Tests/Questionnaires	7
Dissertations/Theses -…	3
Information Analyses	3
Opinion Papers	2
Guides - General	1
Non-Print Media	1
Reference Materials - General	1
More ▼

Education Level

Elementary Secondary Education	15
Elementary Education	14
Secondary Education	12
Higher Education	10
High Schools	6
Postsecondary Education	5
Grade 4	3
Grade 5	3
Grade 6	3
Grade 8	3
Intermediate Grades	3
Early Childhood Education	2
Kindergarten	2
Middle Schools	2
Grade 10	1
Grade 11	1
Grade 2	1
Grade 3	1
Grade 7	1
Preschool Education	1
Primary Education	1
More ▼

Audience

Researchers	3
Teachers	2

Location

Oregon	8
Australia	2
Taiwan	2
California	1
China	1
Florida	1
Germany	1
India	1
Italy	1
Malaysia	1
North Carolina (Charlotte)	1
Turkey	1
United Kingdom	1
United Kingdom (England)	1
United States	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…

Assessments and Surveys

National Assessment of…	2
Program for International…	2
Flesch Kincaid Grade Level…	1
Florida Comprehensive…	1
Graduate Record Examinations	1
International English…	1
SAT (College Admission Test)	1
Test of English as a Foreign…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 104 results Save | Export

IRT Linking Methods for the Bifactor Model with Mixed Format Tests

Peer reviewed

Direct link

Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025

This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…

Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis

Deep Learning Imputation for Asymmetric and Incomplete Likert-Type Items

Peer reviewed

Direct link

Zachary K. Collier; Minji Kong; Olushola Soyoye; Kamal Chawla; Ann M. Aviles; Yasser Payne – Journal of Educational and Behavioral Statistics, 2024

Asymmetric Likert-type items in research studies can present several challenges in data analysis, particularly concerning missing data. These items are often characterized by a skewed scaling, where either there is no neutral response option or an unequal number of possible positive and negative responses. The use of conventional techniques, such…

Descriptors: Likert Scales, Test Items, Item Analysis, Evaluation Methods

Assessing Dimensionality of IRT Models Using Traditional and Revised Parallel Analyses

Peer reviewed

Direct link

Guo, Wenjing; Choi, Youn-Jeng – Educational and Psychological Measurement, 2023

Determining the number of dimensions is extremely important in applying item response theory (IRT) models to data. Traditional and revised parallel analyses have been proposed within the factor analysis framework, and both have shown some promise in assessing dimensionality. However, their performance in the IRT framework has not been…

Descriptors: Item Response Theory, Evaluation Methods, Factor Analysis, Guidelines

There Are Many Greater Lower Bounds than Cronbach's [alpha]: A Monte Carlo Simulation Study

Peer reviewed

Direct link

Novak, Josip; Rebernjak, Blaž – Measurement: Interdisciplinary Research and Perspectives, 2023

A Monte Carlo simulation study was conducted to examine the performance of [alpha], [lambda]2, [lambda][subscript 4], [lambda][subscript 2], [omega][subscript T], GLB[subscript MRFA], and GLB[subscript Algebraic] coefficients. Population reliability, distribution shape, sample size, test length, and number of response categories were varied…

Descriptors: Monte Carlo Methods, Evaluation Methods, Reliability, Simulation

Setting and Validating Multiple Standards on a Multistage-Adaptive Test

Peer reviewed

Direct link

Lewis, Jennifer; Lim, Hwanggyu; Padellaro, Frank; Sireci, Stephen G.; Zenisky, April L. – Educational Measurement: Issues and Practice, 2022

Setting cut scores on (MSTs) is difficult, particularly when the test spans several grade levels, and the selection of items from MST panels must reflect the operational test specifications. In this study, we describe, illustrate, and evaluate three methods for mapping panelists' Angoff ratings into cut scores on the scale underlying an MST. The…

Descriptors: Cutting Scores, Adaptive Testing, Test Items, Item Analysis

Using Cumulative Sum Control Chart to Detect Aberrant Responses in Educational Assessments

Peer reviewed
PDF on ERIC

Download full text

Wan, Siyu; Keller, Lisa A. – Practical Assessment, Research & Evaluation, 2023

Statistical process control (SPC) charts have been widely used in the field of educational measurement. The cumulative sum (CUSUM) is an established SPC method to detect aberrant responses for educational assessments. There are many studies that investigated the performance of CUSUM in different test settings. This paper describes the CUSUM…

Descriptors: Visual Aids, Educational Assessment, Evaluation Methods, Item Response Theory

Traditional vs Intersectional DIF Analysis: Considerations and a Comparison Using State Testing Data

Peer reviewed

Direct link

Tony Albano; Brian F. French; Thao Thu Vo – Applied Measurement in Education, 2024

Recent research has demonstrated an intersectional approach to the study of differential item functioning (DIF). This approach expands DIF to account for the interactions between what have traditionally been treated as separate grouping variables. In this paper, we compare traditional and intersectional DIF analyses using data from a state testing…

Descriptors: Test Items, Item Analysis, Data Use, Standardized Tests

Dimension-Corrected Somers' D for the Item Analysis Settings

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – International Journal of Educational Methodology, 2020

A new index of item discrimination power (IDP), dimension-corrected Somers' D (D2) is proposed. Somers' D is one of the superior alternatives for item-total- (Rit) and item-rest correlation (Rir) in reflecting the real IDP with items with scales 0/1 and 0/1/2, that is, up to three categories. D also reaches the extreme value +1 and -1 correctly…

Descriptors: Item Analysis, Correlation, Test Items, Simulation

Impacts of Scoring Methods on Multiple-Select Multiple-Choice Item Statistics

Direct link

Alicia A. Stoltenberg – ProQuest LLC, 2024

Multiple-select multiple-choice items, or multiple-choice items with more than one correct answer, are used to quickly assess content on standardized assessments. Because there are multiple keys to these item types, there are also multiple ways to score student responses to these items. The purpose of this study was to investigate how changing the…

Descriptors: Scoring, Evaluation Methods, Multiple Choice Tests, Standardized Tests

A Bayesian General Model to Account for Individual Differences in Operation-Specific Learning within a Test

Peer reviewed

Direct link

Lozano, José H.; Revuelta, Javier – Educational and Psychological Measurement, 2023

The present paper introduces a general multidimensional model to measure individual differences in learning within a single administration of a test. Learning is assumed to result from practicing the operations involved in solving the items. The model accounts for the possibility that the ability to learn may manifest differently for correct and…

Descriptors: Bayesian Statistics, Learning Processes, Test Items, Item Analysis

Motivations for Using the Item Response Theory Nominal Response Model to Rank Responses to Multiple-Choice Items

Peer reviewed

Direct link

Smith, Trevor I.; Bendjilali, Nasrine – Physical Review Physics Education Research, 2022

Several recent studies have employed item response theory (IRT) to rank incorrect responses to commonly used research-based multiple-choice assessments. These studies use Bock's nominal response model (NRM) for applying IRT to categorical (nondichotomous) data, but the response rankings only utilize half of the parameters estimated by the model.…

Descriptors: Item Response Theory, Test Items, Multiple Choice Tests, Science Tests

Modeling NAEP Test-Taking Behavior Using Educational Process Analysis

Peer reviewed
PDF on ERIC

Download full text

Patel, Nirmal; Sharma, Aditya; Shah, Tirth; Lomas, Derek – Journal of Educational Data Mining, 2021

Process Analysis is an emerging approach to discover meaningful knowledge from temporal educational data. The study presented in this paper shows how we used Process Analysis methods on the National Assessment of Educational Progress (NAEP) test data for modeling and predicting student test-taking behavior. Our process-oriented data exploration…

Descriptors: Learning Analytics, National Competency Tests, Evaluation Methods, Prediction

Ensuring Fairness in Difficulty and Content among Parallel Assessments Generated from a Test-Item Database

Download full text

Parry, James R. – Online Submission, 2020

This paper presents research and provides a method to ensure that parallel assessments, that are generated from a large test-item database, maintain equitable difficulty and content coverage each time the assessment is presented. To maintain fairness and validity it is important that all instances of an assessment, that is intended to test the…

Descriptors: Culture Fair Tests, Difficulty Level, Test Items, Test Validity

An Intersectional Approach to DIF: Comparing Outcomes across Methods

Peer reviewed

Direct link

Russell, Michael; Szendey, Olivia; Li, Zhushan – Educational Assessment, 2022

Recent research provides evidence that an intersectional approach to defining reference and focal groups results in a higher percentage of comparisons flagged for potential DIF. The study presented here examined the generalizability of this pattern across methods for examining DIF. While the level of DIF detection differed among the four methods…

Descriptors: Comparative Analysis, Item Analysis, Test Items, Test Construction

Comparison of DIF Methods for the Student Experience in the Research University Survey: A Validity and Methodological Study

Direct link

Thapelo Ncube Whitfield – ProQuest LLC, 2021

Student Experience surveys are used to measure student attitudes towards their campus as well as to initiate conversations for institutional change. Validity evidence to support the interpretations of these surveys' results, however, is lacking. The first purpose of this study was to compare three Differential Item Functioning (DIF) methods on…

Descriptors: College Students, Student Surveys, Student Experience, Student Attitudes

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Educational and Psychological…	11
Behavioral Research and…	8
Journal of Educational…	5
Journal of Educational and…	3
ProQuest LLC	3
Applied Measurement in…	2
Applied Psychological…	2
Educational Assessment	2
Educational Research and…	2
Grantee Submission	2
International Journal of…	2
Measurement and Evaluation in…	2
Measurement:…	2
Practical Assessment,…	2
Achieve, Inc.	1
Assessment and Evaluation in…	1
Australian Educational…	1
British Journal of…	1
Cognition and Instruction	1
College Board	1
ETS Research Report Series	1
Early Education and…	1
Educational Measurement:…	1
Educational Technology &…	1
European Journal of Physics…	1
More ▼

Alonzo, Julie	8
Tindal, Gerald	8
Lai, Cheng Fei	7
Hambleton, Ronald K.	3
Johanson, George A.	2
McKinley, Robert L.	2
Merz, William R.	2
Reckase, Mark D.	2
Rogers, H. Jane	2
Wang, Wen-Chung	2
Ahn, Soyeon	1
Akarsu, Bayram	1
Albano, Anthony D.	1
Alghazali, Tawfeeq	1
Alicia A. Stoltenberg	1
Ann M. Aviles	1
Ariffin, Siti Rahayah	1
Baker, Eva L.	1
Bakla, Arif	1
Barry, Carol	1
Bartolucci, F.	1
Beauchamp, David	1
Bendjilali, Nasrine	1
Bennett, Randy	1
More ▼