ERIC - Search Results

Publication Date

In 2025	4
Since 2024	7
Since 2021 (last 5 years)	16
Since 2016 (last 10 years)	19
Since 2006 (last 20 years)	38

Descriptor

Evaluation Methods	55
Item Analysis	55
Models	55
Item Response Theory	21
Comparative Analysis	15
Measurement Techniques	13
Test Items	13
Simulation	12
Error of Measurement	8
Test Construction	8
Test Reliability	8
Test Validity	8
Evaluation Criteria	7
Scores	7
Statistical Analysis	7
Computer Assisted Testing	6
Correlation	6
Criterion Referenced Tests	6
Foreign Countries	6
Testing	6
Accountability	5
Data Analysis	5
Decision Making	5
Goodness of Fit	5
Sample Size	5
More ▼

Publication Type

Journal Articles	36
Reports - Research	32
Reports - Evaluative	7
Reports - Descriptive	5
Dissertations/Theses -…	3
Books	1
Collected Works - General	1
Information Analyses	1
Opinion Papers	1
Speeches/Meeting Papers	1

Education Level

Higher Education	7
Adult Education	4
Elementary Secondary Education	4
Postsecondary Education	4
Secondary Education	3
Two Year Colleges	1

Audience

Researchers	2
Practitioners	1

Location

Australia	2
California	1
Denmark	1
Italy	1
New Mexico	1
New York	1
New Zealand	1
South Africa	1
Texas	1
United Kingdom	1
Virginia	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Child Behavior Checklist	1
Graduate Record Examinations	1
National Assessment of…	1
National Longitudinal Study…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 55 results Save | Export

IRT Linking Methods for the Bifactor Model with Mixed Format Tests

Peer reviewed

Direct link

Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025

This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…

Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis

Bayesian Diagnostic Classification Models for a Partially Known Q-Matrix

Peer reviewed

Direct link

Kazuhiro Yamaguchi – Journal of Educational and Behavioral Statistics, 2025

This study proposes a Bayesian method for diagnostic classification models (DCMs) for a partially known Q-matrix setting between exploratory and confirmatory DCMs. This Q-matrix setting is practical and useful because test experts have pre-knowledge of the Q-matrix but cannot readily specify it completely. The proposed method employs priors for…

Descriptors: Models, Classification, Bayesian Statistics, Evaluation Methods

A Note on Standard Errors for Multidimensional Two-Parameter Logistic Models Using Gaussian Variational Estimation

Peer reviewed

Direct link

Jiaying Xiao; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Accurate item parameters and standard errors (SEs) are crucial for many multidimensional item response theory (MIRT) applications. A recent study proposed the Gaussian Variational Expectation Maximization (GVEM) algorithm to improve computational efficiency and estimation accuracy (Cho et al., 2021). However, the SE estimation procedure has yet to…

Descriptors: Error of Measurement, Models, Evaluation Methods, Item Analysis

Identifying Response Styles Using Person Fit Analysis and Response-Styles Models

Peer reviewed

Direct link

Wind, Stefanie A.; Ge, Yuan – Measurement: Interdisciplinary Research and Perspectives, 2023

In selected-response assessments such as attitude surveys with Likert-type rating scales, examinees often select from rating scale categories to reflect their locations on a construct. Researchers have observed that some examinees exhibit "response styles," which are systematic patterns of responses in which examinees are more likely to…

Descriptors: Goodness of Fit, Responses, Likert Scales, Models

A Validation Study of the Extended Relevance Scale Using the D3mirt Package for R

Peer reviewed

Direct link

Erik Forsberg; Anders Sjöberg – Measurement: Interdisciplinary Research and Perspectives, 2025

This paper reports a validation study based on descriptive multidimensional item response theory (DMIRT), implemented in the R package "D3mirt" by using the ERS-C, an extended version of the Relevance subscale from the Moral Foundations Questionnaire including two new items for collectivism (17 items in total). Two latent models are…

Descriptors: Evaluation Methods, Programming Languages, Altruism, Collectivism

Linear Factor Analytic Thurstonian Forced-Choice Models: Current Status and Issues

Peer reviewed

Direct link

Markus T. Jansen; Ralf Schulze – Educational and Psychological Measurement, 2024

Thurstonian forced-choice modeling is considered to be a powerful new tool to estimate item and person parameters while simultaneously testing the model fit. This assessment approach is associated with the aim of reducing faking and other response tendencies that plague traditional self-report trait assessments. As a result of major recent…

Descriptors: Factor Analysis, Models, Item Analysis, Evaluation Methods

Assessing Dimensionality of IRT Models Using Traditional and Revised Parallel Analyses

Peer reviewed

Direct link

Guo, Wenjing; Choi, Youn-Jeng – Educational and Psychological Measurement, 2023

Determining the number of dimensions is extremely important in applying item response theory (IRT) models to data. Traditional and revised parallel analyses have been proposed within the factor analysis framework, and both have shown some promise in assessing dimensionality. However, their performance in the IRT framework has not been…

Descriptors: Item Response Theory, Evaluation Methods, Factor Analysis, Guidelines

Bayesian Adaptive Lasso for the Detection of Differential Item Functioning in Graded Response Models

Peer reviewed

Direct link

Na Shan; Ping-Feng Xu – Journal of Educational and Behavioral Statistics, 2025

The detection of differential item functioning (DIF) is important in psychological and behavioral sciences. Standard DIF detection methods perform an item-by-item test iteratively, often assuming that all items except the one under investigation are DIF-free. This article proposes a Bayesian adaptive Lasso method to detect DIF in graded response…

Descriptors: Bayesian Statistics, Item Response Theory, Adolescents, Longitudinal Studies

Forced-Choice Ranking Models for Raters' Ranking Data

Peer reviewed

Direct link

Hung, Su-Pin; Huang, Hung-Yu – Journal of Educational and Behavioral Statistics, 2022

To address response style or bias in rating scales, forced-choice items are often used to request that respondents rank their attitudes or preferences among a limited set of options. The rating scales used by raters to render judgments on ratees' performance also contribute to rater bias or errors; consequently, forced-choice items have recently…

Descriptors: Evaluation Methods, Rating Scales, Item Analysis, Preferences

A Bayesian General Model to Account for Individual Differences in Operation-Specific Learning within a Test

Peer reviewed

Direct link

Lozano, José H.; Revuelta, Javier – Educational and Psychological Measurement, 2023

The present paper introduces a general multidimensional model to measure individual differences in learning within a single administration of a test. Learning is assumed to result from practicing the operations involved in solving the items. The model accounts for the possibility that the ability to learn may manifest differently for correct and…

Descriptors: Bayesian Statistics, Learning Processes, Test Items, Item Analysis

Using Lasso and Adaptive Lasso to Identify DIF in Multidimensional 2PL Models

Peer reviewed
PDF on ERIC

Download full text

Direct link

Chun Wang; Ruoyi Zhu; Gongjun Xu – Grantee Submission, 2022

Differential item functioning (DIF) analysis refers to procedures that evaluate whether an item's characteristic differs for different groups of persons after controlling for overall differences in performance. DIF is routinely evaluated as a screening step to ensure items behavior the same across groups. Currently, the majority DIF studies focus…

Descriptors: Models, Item Response Theory, Item Analysis, Comparative Analysis

Evaluation of Structure Complexity Magnitude, Degree of Cross-Loading on Secondary Dimension and Model Specification on MIRT Parameter Estimation

Direct link

Hosseinzadeh, Mostafa – ProQuest LLC, 2021

In real-world situations, multidimensional data may appear on large-scale tests or attitudinal surveys. A simple structure, multidimensional model may be used to evaluate the items, ignoring the cross-loading of some items on the secondary dimension. The purpose of this study was to investigate the influence of structure complexity magnitude of…

Descriptors: Item Response Theory, Models, Simulation, Evaluation Methods

Modeling NAEP Test-Taking Behavior Using Educational Process Analysis

Peer reviewed
PDF on ERIC

Download full text

Patel, Nirmal; Sharma, Aditya; Shah, Tirth; Lomas, Derek – Journal of Educational Data Mining, 2021

Process Analysis is an emerging approach to discover meaningful knowledge from temporal educational data. The study presented in this paper shows how we used Process Analysis methods on the National Assessment of Educational Progress (NAEP) test data for modeling and predicting student test-taking behavior. Our process-oriented data exploration…

Descriptors: Learning Analytics, National Competency Tests, Evaluation Methods, Prediction

Scale Alignment in Between-Item Multidimensional Rasch Models

Peer reviewed

Direct link

Feuerstahler, Leah; Wilson, Mark – Journal of Educational Measurement, 2019

Scores estimated from multidimensional item response theory (IRT) models are not necessarily comparable across dimensions. In this article, the concept of aligned dimensions is formalized in the context of Rasch models, and two methods are described--delta dimensional alignment (DDA) and logistic regression alignment (LRA)--to transform estimated…

Descriptors: Item Response Theory, Models, Scores, Comparative Analysis

Investigating Concept Definition and Skill Modeling for Cognitive Diagnosis in Language Learning

Peer reviewed
PDF on ERIC

Download full text

Boxuan Ma; Sora Fukui; Yuji Ando; Shinichi Konomi – Journal of Educational Data Mining, 2024

Language proficiency diagnosis is essential to extract fine-grained information about the linguistic knowledge states and skill mastery levels of test takers based on their performance on language tests. Different from comprehensive standardized tests, many language learning apps often revolve around word-level questions. Therefore, knowledge…

Descriptors: Language Proficiency, Brain Hemisphere Functions, Language Processing, Task Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Educational and Psychological…	5
Journal of Educational…	4
Journal of Educational and…	4
Measurement:…	3
ProQuest LLC	3
Grantee Submission	2
International Journal of…	2
Journal of Educational Data…	2
Psychometrika	2
American Journal of Evaluation	1
Applied Psychological…	1
Asia Pacific Journal of…	1
Australian Journal of…	1
Center for American Progress	1
Decision Sciences Journal of…	1
Educational Research and…	1
Instructional Science	1
International Journal of…	1
Journal of Abnormal Child…	1
Journal of Computer-Based…	1
Journal of Consulting and…	1
Journal of Teacher Education	1
Learning Disability Quarterly	1
Perspectives in Education	1
Practical Assessment,…	1
More ▼

Chun Wang	2
Gongjun Xu	2
Wilson, Mark	2
Albano, Anthony D.	1
Anders Sjöberg	1
Andrich, David	1
Bartolucci, F.	1
Berger, Martijn P. F.	1
Bernhardt, Amery E.	1
Bhaskar, R.	1
Boxuan Ma	1
Brock, Donna-Jean P.	1
Burstein, Leigh	1
Calvitto, Leanne	1
Choi, Youn-Jeng	1
Constable, Elizabeth	1
Dancer, L. Suzanne	1
Denison, D. Brian, Ed.	1
Dillard, Jesse F.	1
Dillon, Amanda	1
Dirkzwager, Arie	1
Douglas, Jeff	1
Dunne, Tim	1
Edmonston, Leon P.	1
More ▼