ERIC - Search Results

Publication Date

In 2026	0
Since 2025	3
Since 2022 (last 5 years)	11
Since 2017 (last 10 years)	20
Since 2007 (last 20 years)	28

Descriptor

Evaluation Methods	36
Item Analysis	36
Simulation	36
Item Response Theory	16
Test Items	14
Models	12
Sample Size	9
Comparative Analysis	8
Bayesian Statistics	7
Correlation	7
Statistical Analysis	7
Test Bias	7
Error of Measurement	6
Goodness of Fit	5
Responses	5
Computer Assisted Testing	4
Decision Making	4
Factor Analysis	4
Measurement Techniques	4
Monte Carlo Methods	4
Test Length	4
Achievement Tests	3
Classification	3
Computation	3
Data Analysis	3
More ▼

Publication Type

Journal Articles	29
Reports - Research	26
Reports - Evaluative	5
Dissertations/Theses -…	3
Numerical/Quantitative Data	1
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Secondary Education	3
Elementary Education	2
Adult Education	1
Elementary Secondary Education	1
Grade 4	1
Grade 8	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1

Audience

Practitioners	2
Researchers	2

Location

Florida

Laws, Policies, & Programs

Assessments and Surveys

Florida Comprehensive…	1
National Longitudinal Study…	1
Program for International…	1
Test of English as a Foreign…	1
Trends in International…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 36 results Save | Export

IRT Linking Methods for the Bifactor Model with Mixed Format Tests

Peer reviewed

Direct link

Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025

This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…

Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis

Why Forced-Choice and Likert Items Provide the Same Information on Personality, including Social Desirability

Peer reviewed

Direct link

Martin Bäckström; Fredrik Björklund – Educational and Psychological Measurement, 2024

The forced-choice response format is often considered superior to the standard Likert-type format for controlling social desirability in personality inventories. We performed simulations and found that the trait information based on the two formats converges when the number of items is high and forced-choice items are mixed with regard to…

Descriptors: Likert Scales, Item Analysis, Personality Traits, Personality Measures

Bayesian Diagnostic Classification Models for a Partially Known Q-Matrix

Peer reviewed

Direct link

Kazuhiro Yamaguchi – Journal of Educational and Behavioral Statistics, 2025

This study proposes a Bayesian method for diagnostic classification models (DCMs) for a partially known Q-matrix setting between exploratory and confirmatory DCMs. This Q-matrix setting is practical and useful because test experts have pre-knowledge of the Q-matrix but cannot readily specify it completely. The proposed method employs priors for…

Descriptors: Models, Classification, Bayesian Statistics, Evaluation Methods

Identifying Response Styles Using Person Fit Analysis and Response-Styles Models

Peer reviewed

Direct link

Wind, Stefanie A.; Ge, Yuan – Measurement: Interdisciplinary Research and Perspectives, 2023

In selected-response assessments such as attitude surveys with Likert-type rating scales, examinees often select from rating scale categories to reflect their locations on a construct. Researchers have observed that some examinees exhibit "response styles," which are systematic patterns of responses in which examinees are more likely to…

Descriptors: Goodness of Fit, Responses, Likert Scales, Models

Small-Variance Priors in Bayesian Factor Analysis with Ordinal Data

Peer reviewed

Direct link

Liang, Xinya; Cao, Chunhua – Journal of Experimental Education, 2023

To evaluate multidimensional factor structure, a popular method that combines features of confirmatory and exploratory factor analysis is Bayesian structural equation modeling with small-variance normal priors (BSEM-N). This simulation study evaluated BSEM-N as a variable selection and parameter estimation tool in factor analysis with sparse…

Descriptors: Factor Analysis, Bayesian Statistics, Structural Equation Models, Simulation

Assessing Dimensionality of IRT Models Using Traditional and Revised Parallel Analyses

Peer reviewed

Direct link

Guo, Wenjing; Choi, Youn-Jeng – Educational and Psychological Measurement, 2023

Determining the number of dimensions is extremely important in applying item response theory (IRT) models to data. Traditional and revised parallel analyses have been proposed within the factor analysis framework, and both have shown some promise in assessing dimensionality. However, their performance in the IRT framework has not been…

Descriptors: Item Response Theory, Evaluation Methods, Factor Analysis, Guidelines

There Are Many Greater Lower Bounds than Cronbach's [alpha]: A Monte Carlo Simulation Study

Peer reviewed

Direct link

Novak, Josip; Rebernjak, Blaž – Measurement: Interdisciplinary Research and Perspectives, 2023

A Monte Carlo simulation study was conducted to examine the performance of [alpha], [lambda]2, [lambda][subscript 4], [lambda][subscript 2], [omega][subscript T], GLB[subscript MRFA], and GLB[subscript Algebraic] coefficients. Population reliability, distribution shape, sample size, test length, and number of response categories were varied…

Descriptors: Monte Carlo Methods, Evaluation Methods, Reliability, Simulation

Using Cumulative Sum Control Chart to Detect Aberrant Responses in Educational Assessments

Peer reviewed
PDF on ERIC

Download full text

Wan, Siyu; Keller, Lisa A. – Practical Assessment, Research & Evaluation, 2023

Statistical process control (SPC) charts have been widely used in the field of educational measurement. The cumulative sum (CUSUM) is an established SPC method to detect aberrant responses for educational assessments. There are many studies that investigated the performance of CUSUM in different test settings. This paper describes the CUSUM…

Descriptors: Visual Aids, Educational Assessment, Evaluation Methods, Item Response Theory

Bayesian Adaptive Lasso for the Detection of Differential Item Functioning in Graded Response Models

Peer reviewed

Direct link

Na Shan; Ping-Feng Xu – Journal of Educational and Behavioral Statistics, 2025

The detection of differential item functioning (DIF) is important in psychological and behavioral sciences. Standard DIF detection methods perform an item-by-item test iteratively, often assuming that all items except the one under investigation are DIF-free. This article proposes a Bayesian adaptive Lasso method to detect DIF in graded response…

Descriptors: Bayesian Statistics, Item Response Theory, Adolescents, Longitudinal Studies

Dimension-Corrected Somers' D for the Item Analysis Settings

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – International Journal of Educational Methodology, 2020

A new index of item discrimination power (IDP), dimension-corrected Somers' D (D2) is proposed. Somers' D is one of the superior alternatives for item-total- (Rit) and item-rest correlation (Rir) in reflecting the real IDP with items with scales 0/1 and 0/1/2, that is, up to three categories. D also reaches the extreme value +1 and -1 correctly…

Descriptors: Item Analysis, Correlation, Test Items, Simulation

Forced-Choice Ranking Models for Raters' Ranking Data

Peer reviewed

Direct link

Hung, Su-Pin; Huang, Hung-Yu – Journal of Educational and Behavioral Statistics, 2022

To address response style or bias in rating scales, forced-choice items are often used to request that respondents rank their attitudes or preferences among a limited set of options. The rating scales used by raters to render judgments on ratees' performance also contribute to rater bias or errors; consequently, forced-choice items have recently…

Descriptors: Evaluation Methods, Rating Scales, Item Analysis, Preferences

Evaluation of Structure Complexity Magnitude, Degree of Cross-Loading on Secondary Dimension and Model Specification on MIRT Parameter Estimation

Direct link

Hosseinzadeh, Mostafa – ProQuest LLC, 2021

In real-world situations, multidimensional data may appear on large-scale tests or attitudinal surveys. A simple structure, multidimensional model may be used to evaluate the items, ignoring the cross-loading of some items on the secondary dimension. The purpose of this study was to investigate the influence of structure complexity magnitude of…

Descriptors: Item Response Theory, Models, Simulation, Evaluation Methods

Classification of Scale Items with Exploratory Graph Analysis and Machine Learning Methods

Peer reviewed
PDF on ERIC

Download full text

Koyuncu, Ilhan; Kilic, Abdullah Faruk – International Journal of Assessment Tools in Education, 2021

In exploratory factor analysis, although the researchers decide which items belong to which factors by considering statistical results, the decisions taken sometimes can be subjective in case of having items with similar factor loadings and complex factor structures. The aim of this study was to examine the validity of classifying items into…

Descriptors: Classification, Graphs, Factor Analysis, Decision Making

Scale Alignment in Between-Item Multidimensional Rasch Models

Peer reviewed

Direct link

Feuerstahler, Leah; Wilson, Mark – Journal of Educational Measurement, 2019

Scores estimated from multidimensional item response theory (IRT) models are not necessarily comparable across dimensions. In this article, the concept of aligned dimensions is formalized in the context of Rasch models, and two methods are described--delta dimensional alignment (DDA) and logistic regression alignment (LRA)--to transform estimated…

Descriptors: Item Response Theory, Models, Scores, Comparative Analysis

A Log-Linear Modeling Approach for Differential Item Functioning Detection in Polytomously Scored Items

Peer reviewed

Direct link

Yesiltas, Gonca; Paek, Insu – Educational and Psychological Measurement, 2020

A log-linear model (LLM) is a well-known statistical method to examine the relationship among categorical variables. This study investigated the performance of LLM in detecting differential item functioning (DIF) for polytomously scored items via simulations where various sample sizes, ability mean differences (impact), and DIF types were…

Descriptors: Simulation, Sample Size, Item Analysis, Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3

Educational and Psychological…	6
Journal of Educational…	3
Journal of Educational and…	3
ProQuest LLC	3
Applied Measurement in…	2
Applied Psychological…	2
International Journal of…	2
Measurement:…	2
ETS Research Report Series	1
Educational Sciences: Theory…	1
Grantee Submission	1
International Educational…	1
International Journal of…	1
International Journal of…	1
Journal of Experimental…	1
Multivariate Behavioral…	1
Practical Assessment,…	1
Psychometrika	1
School Effectiveness and…	1
More ▼

Wang, Wen-Chung	2
Wilson, Mark	2
Banerjee, Han-Ting Liu	1
Bejar, Isaac I.	1
Beretvas, S. Natasha	1
Cao, Chunhua	1
Chen, Lisue	1
Chernyshenko, Oleksandr S.	1
Choi, Youn-Jeng	1
Chun Wang	1
Cools, Wilfried	1
Craig, Brandon	1
De Fraine, Bieke	1
Douglas, Jeff	1
Feuerstahler, Leah	1
Finkelman, Matthew D.	1
Fredrik Björklund	1
Gao, Furong	1
Ge, Yuan	1
Gierl, Mark J.	1
Grossen, Neal E.	1
Guo, Wenjing	1
Hartig, Johannes	1
Holzel, Britta	1
Hosseinzadeh, Mostafa	1
More ▼