ERIC - Search Results

Publication Date

In 2025	3
Since 2024	5
Since 2021 (last 5 years)	11
Since 2016 (last 10 years)	18
Since 2006 (last 20 years)	40

Descriptor

Comparative Analysis	52
Evaluation Methods	52
Item Analysis	52
Item Response Theory	17
Test Items	16
Models	15
Test Construction	10
Data Analysis	8
Evaluation Research	8
Foreign Countries	8
Simulation	8
Measures (Individuals)	7
Scores	7
Classification	6
Evaluation Criteria	6
Measurement Techniques	6
Statistical Analysis	6
Student Evaluation	6
Validity	6
Correlation	5
Difficulty Level	5
Factor Analysis	5
Higher Education	5
Questionnaires	5
Computer Assisted Testing	4
More ▼

Publication Type

Journal Articles	38
Reports - Research	29
Reports - Evaluative	13
Reports - Descriptive	4
Speeches/Meeting Papers	3
Dissertations/Theses -…	2
Numerical/Quantitative Data	2
Guides - Non-Classroom	1
Information Analyses	1
Non-Print Media	1
Opinion Papers	1
Reference Materials - General	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	11
Postsecondary Education	5
Elementary Secondary Education	3
Secondary Education	3
Early Childhood Education	2
Elementary Education	2
High Schools	2
Kindergarten	2
Primary Education	2
Grade 5	1
Grade 6	1
Intermediate Grades	1
Middle Schools	1
More ▼

Audience

Practitioners

Location

Australia	2
China	2
Colorado	1
Germany	1
Germany (Berlin)	1
India	1
New Zealand	1
Texas	1
United Kingdom	1
United States	1
Wisconsin	1
Wisconsin (Milwaukee)	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Autism Diagnostic Observation…	1
Child Behavior Checklist	1
Early Childhood Environment…	1
Eysenck Personality Inventory	1
Group Assessment of Logical…	1
National Longitudinal Study…	1
Program for International…	1
Self Perception Profile for…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 52 results Save | Export

IRT Linking Methods for the Bifactor Model with Mixed Format Tests

Peer reviewed

Direct link

Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025

This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…

Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis

Bayesian Diagnostic Classification Models for a Partially Known Q-Matrix

Peer reviewed

Direct link

Kazuhiro Yamaguchi – Journal of Educational and Behavioral Statistics, 2025

This study proposes a Bayesian method for diagnostic classification models (DCMs) for a partially known Q-matrix setting between exploratory and confirmatory DCMs. This Q-matrix setting is practical and useful because test experts have pre-knowledge of the Q-matrix but cannot readily specify it completely. The proposed method employs priors for…

Descriptors: Models, Classification, Bayesian Statistics, Evaluation Methods

A Note on Standard Errors for Multidimensional Two-Parameter Logistic Models Using Gaussian Variational Estimation

Peer reviewed

Direct link

Jiaying Xiao; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Accurate item parameters and standard errors (SEs) are crucial for many multidimensional item response theory (MIRT) applications. A recent study proposed the Gaussian Variational Expectation Maximization (GVEM) algorithm to improve computational efficiency and estimation accuracy (Cho et al., 2021). However, the SE estimation procedure has yet to…

Descriptors: Error of Measurement, Models, Evaluation Methods, Item Analysis

Evaluating the Performance of Estimators in SEM and IRT with Ordinal Variables

Direct link

Klauth, Bo – ProQuest LLC, 2023

In conducting confirmatory factor analysis with ordered response items, the literature suggests that when the number of responses is five and item skewness (IS) is approximately normal, researchers can employ maximum likelihood with robust standard errors (MLR). However, MLR can yield biased factor loadings (FL) and FL standard errors (FLSE) when…

Descriptors: Item Response Theory, Evaluation Methods, Factor Analysis, Error of Measurement

Linear Factor Analytic Thurstonian Forced-Choice Models: Current Status and Issues

Peer reviewed

Direct link

Markus T. Jansen; Ralf Schulze – Educational and Psychological Measurement, 2024

Thurstonian forced-choice modeling is considered to be a powerful new tool to estimate item and person parameters while simultaneously testing the model fit. This assessment approach is associated with the aim of reducing faking and other response tendencies that plague traditional self-report trait assessments. As a result of major recent…

Descriptors: Factor Analysis, Models, Item Analysis, Evaluation Methods

Comparison of Two Exam Evaluation Methods for Objectivity

Peer reviewed
PDF on ERIC

Download full text

Kolarec, Biserka; Nincevic, Marina – International Society for Technology, Education, and Science, 2022

The object of research is a statistics exam that contains problem tasks. One examiner performed two exam evaluation methods to repeatedly evaluate the exam. The goal was to compare the methods for objectivity. One of the two exam evaluation methods we call a serial evaluation method. The serial evaluation method assumes evaluation of all exam…

Descriptors: Statistics Education, Mathematics Tests, Evaluation Methods, Test Construction

Bayesian Adaptive Lasso for the Detection of Differential Item Functioning in Graded Response Models

Peer reviewed

Direct link

Na Shan; Ping-Feng Xu – Journal of Educational and Behavioral Statistics, 2025

The detection of differential item functioning (DIF) is important in psychological and behavioral sciences. Standard DIF detection methods perform an item-by-item test iteratively, often assuming that all items except the one under investigation are DIF-free. This article proposes a Bayesian adaptive Lasso method to detect DIF in graded response…

Descriptors: Bayesian Statistics, Item Response Theory, Adolescents, Longitudinal Studies

Using Lasso and Adaptive Lasso to Identify DIF in Multidimensional 2PL Models

Peer reviewed
PDF on ERIC

Download full text

Direct link

Chun Wang; Ruoyi Zhu; Gongjun Xu – Grantee Submission, 2022

Differential item functioning (DIF) analysis refers to procedures that evaluate whether an item's characteristic differs for different groups of persons after controlling for overall differences in performance. DIF is routinely evaluated as a screening step to ensure items behavior the same across groups. Currently, the majority DIF studies focus…

Descriptors: Models, Item Response Theory, Item Analysis, Comparative Analysis

Scale Alignment in Between-Item Multidimensional Rasch Models

Peer reviewed

Direct link

Feuerstahler, Leah; Wilson, Mark – Journal of Educational Measurement, 2019

Scores estimated from multidimensional item response theory (IRT) models are not necessarily comparable across dimensions. In this article, the concept of aligned dimensions is formalized in the context of Rasch models, and two methods are described--delta dimensional alignment (DDA) and logistic regression alignment (LRA)--to transform estimated…

Descriptors: Item Response Theory, Models, Scores, Comparative Analysis

An Intersectional Approach to DIF: Comparing Outcomes across Methods

Peer reviewed

Direct link

Russell, Michael; Szendey, Olivia; Li, Zhushan – Educational Assessment, 2022

Recent research provides evidence that an intersectional approach to defining reference and focal groups results in a higher percentage of comparisons flagged for potential DIF. The study presented here examined the generalizability of this pattern across methods for examining DIF. While the level of DIF detection differed among the four methods…

Descriptors: Comparative Analysis, Item Analysis, Test Items, Test Construction

A Log-Linear Modeling Approach for Differential Item Functioning Detection in Polytomously Scored Items

Peer reviewed

Direct link

Yesiltas, Gonca; Paek, Insu – Educational and Psychological Measurement, 2020

A log-linear model (LLM) is a well-known statistical method to examine the relationship among categorical variables. This study investigated the performance of LLM in detecting differential item functioning (DIF) for polytomously scored items via simulations where various sample sizes, ability mean differences (impact), and DIF types were…

Descriptors: Simulation, Sample Size, Item Analysis, Scores

An Introduction to the Analysis of Ranked Response Data

Peer reviewed
PDF on ERIC

Download full text

Finch, Holmes – Practical Assessment, Research & Evaluation, 2022

Researchers in many disciplines work with ranking data. This data type is unique in that it is often deterministic in nature (the ranks of items "k"-1 determine the rank of item "k"), and the difference in a pair of rank scores separated by "k" units is equivalent regardless of the actual values of the two ranks in…

Descriptors: Data Analysis, Statistical Inference, Models, College Faculty

Teacher-Designed Mathematical Portfolio Assessments: Motivations, Potential Benefits, and Lessons Learned

Peer reviewed

Direct link

Krall, Geoff – Canadian Journal of Science, Mathematics and Technology Education, 2023

In order to identify the potential benefits and challenges of implementing student portfolios as quality mathematics assessment, a pilot study was conducted with teachers in various secondary school settings. The multi-case study consisted of five teacher participants from geographically and demographically differing contexts, four in the USA and…

Descriptors: Portfolio Assessment, Mathematics Instruction, Evaluation Methods, Pilot Projects

Brief Report: Checklist for Autism Spectrum Disorder--Most Discriminating Items for Diagnosing Autism

Peer reviewed

Direct link

Mayes, Susan D. – Journal of Autism and Developmental Disorders, 2018

The smallest subset of items from the 30-item Checklist for Autism Spectrum Disorder (CASD) that differentiated 607 referred children (3-17 years) with and without autism with 100% accuracy was identified. This 6-item subset (CASD-Short Form) was cross-validated on an independent sample of 397 referred children (1-18 years) with and without autism…

Descriptors: Autism, Pervasive Developmental Disorders, Clinical Diagnosis, Accuracy

A Practical Comparison of Selected Methods of Evaluating Multiple-Choice Options through Classical Item Analysis

Peer reviewed
PDF on ERIC

Download full text

Malec, Wojciech; Krzeminska-Adamek, Malgorzata – Practical Assessment, Research & Evaluation, 2020

The main objective of the article is to compare several methods of evaluating multiple-choice options through classical item analysis. The methods subjected to examination include the tabulation of choice distribution, the interpretation of trace lines, the point-biserial correlation, the categorical analysis of trace lines, and the investigation…

Descriptors: Comparative Analysis, Evaluation Methods, Multiple Choice Tests, Item Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Applied Psychological…	3
Journal of Educational and…	3
Educational and Psychological…	2
Grantee Submission	2
Journal of Educational…	2
Practical Assessment,…	2
ProQuest LLC	2
ACT, Inc.	1
Assessment	1
Assessment & Evaluation in…	1
Canadian Journal of Science,…	1
Child & Family Behavior…	1
College Board	1
Compare: A Journal of…	1
Education Policy Analysis…	1
Educational Assessment	1
Educational Forum	1
Educational Practice and…	1
Educational Research and…	1
Higher Education Review	1
Higher Education in Europe	1
International Educational…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
More ▼

Chun Wang	2
Gongjun Xu	2
Barry, Carol	1
Beech, Anthony	1
Berger, Martijn P. F.	1
Bhola, Dennison S.	1
Borowski, Andreas	1
Bowers, John J.	1
Browne, Kevin D.	1
Buckendahl, Chad W.	1
Cantrell, Pamela	1
Chen, Hanwei	1
Chen, Junjun	1
Clark, Teresa P.	1
Cowie, Bronwen	1
Craig, Leam A.	1
Crisp, Victoria	1
Cui, Zhongmin	1
Davies, Martin	1
Dillon, Amanda	1
Donovan, Jenny	1
Egberink, Iris J. L.	1
Feuerstahler, Leah	1
Finch, Holmes	1
More ▼