ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	10
Since 2017 (last 10 years)	24
Since 2007 (last 20 years)	45

Descriptor

Correlation	49
Error of Measurement	49
Item Response Theory	42
Test Items	18
Comparative Analysis	15
Scores	13
Item Analysis	12
Models	12
Simulation	10
Sample Size	9
Factor Analysis	8
Monte Carlo Methods	8
Accuracy	7
Computation	7
Foreign Countries	7
Goodness of Fit	7
Evaluation Methods	6
Regression (Statistics)	6
Statistical Bias	6
Test Reliability	6
Difficulty Level	5
Effect Size	5
Psychometrics	5
Test Theory	5
Bayesian Statistics	4
More ▼

Publication Type

Journal Articles	37
Reports - Research	34
Reports - Evaluative	7
Dissertations/Theses -…	6
Speeches/Meeting Papers	3
Reports - Descriptive	2
Numerical/Quantitative Data	1

Education Level

Elementary Education	4
Elementary Secondary Education	3
Grade 4	2
Grade 7	2
Higher Education	2
Postsecondary Education	2
Secondary Education	2
Early Childhood Education	1
Grade 2	1
Grade 3	1
Grade 5	1
Grade 6	1
Grade 8	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Primary Education	1
More ▼

Audience

Administrators

Location

Australia	2
China	2
Gambia	1
Germany	1
Iraq	1
Syria	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 49 results Save | Export

Separation of Traits and Extreme Response Style in IRTree Models: The Role of Mimicry Effects for the Meaningful Interpretation of Estimates

Peer reviewed

Direct link

Viola Merhof; Caroline M. Böhm; Thorsten Meiser – Educational and Psychological Measurement, 2024

Item response tree (IRTree) models are a flexible framework to control self-reported trait measurements for response styles. To this end, IRTree models decompose the responses to rating items into sub-decisions, which are assumed to be made on the basis of either the trait being measured or a response style, whereby the effects of such person…

Descriptors: Item Response Theory, Test Interpretation, Test Reliability, Test Validity

Educational Implications of Comparing Unidimensional and Multidimensional Item Response Theories

Peer reviewed
PDF on ERIC

Download full text

Seyma Erbay Mermer – Pegem Journal of Education and Instruction, 2024

This study aims to compare item and student parameters of dichotomously scored multidimensional constructs estimated based on unidimensional and multidimensional Item Response Theory (IRT) under different conditions of sample size, interdimensional correlation and number of dimensions. This research, conducted with simulations, is of a basic…

Descriptors: Item Response Theory, Correlation, Error of Measurement, Comparative Analysis

Detecting Differential Item Functioning with Multiple Causes: A Comparison of Three Methods

Peer reviewed

Direct link

Xiaowen Liu – International Journal of Testing, 2024

Differential item functioning (DIF) often arises from multiple sources. Within the context of multidimensional item response theory, this study examined DIF items with varying secondary dimensions using the three DIF methods: SIBTEST, Mantel-Haenszel, and logistic regression. The effect of the number of secondary dimensions on DIF detection rates…

Descriptors: Item Analysis, Test Items, Item Response Theory, Correlation

Rotation Local Solutions in Multidimensional Item Response Theory Models

Peer reviewed

Direct link

Hoang V. Nguyen; Niels G. Waller – Educational and Psychological Measurement, 2024

We conducted an extensive Monte Carlo study of factor-rotation local solutions (LS) in multidimensional, two-parameter logistic (M2PL) item response models. In this study, we simulated more than 19,200 data sets that were drawn from 96 model conditions and performed more than 7.6 million rotations to examine the influence of (a) slope parameter…

Descriptors: Monte Carlo Methods, Item Response Theory, Correlation, Error of Measurement

Detecting Careless Responding in Multidimensional Forced-Choice Questionnaires

Peer reviewed

Direct link

Rebekka Kupffer; Susanne Frick; Eunike Wetzel – Educational and Psychological Measurement, 2024

The multidimensional forced-choice (MFC) format is an alternative to rating scales in which participants rank items according to how well the items describe them. Currently, little is known about how to detect careless responding in MFC data. The aim of this study was to adapt a number of indices used for rating scales to the MFC format and…

Descriptors: Measurement Techniques, Alternative Assessment, Rating Scales, Questionnaires

Resolving and Re-Scoring Constructed Response Items in Mixed-Format Assessments: An Exploration of Three Approaches

Peer reviewed

Direct link

Stefanie A. Wind; Yangmeng Xu – Educational Assessment, 2024

We explored three approaches to resolving or re-scoring constructed-response items in mixed-format assessments: rater agreement, person fit, and targeted double scoring (TDS). We used a simulation study to consider how the three approaches impact the psychometric properties of student achievement estimates, with an emphasis on person fit. We found…

Descriptors: Interrater Reliability, Error of Measurement, Evaluation Methods, Examiners

Relative Robustness of CDMs and (M)IRT in Measuring Growth in Latent Skills

Peer reviewed

Direct link

Huang, Qi; Bolt, Daniel M. – Educational and Psychological Measurement, 2023

Previous studies have demonstrated evidence of latent skill continuity even in tests intentionally designed for measurement of binary skills. In addition, the assumption of binary skills when continuity is present has been shown to potentially create a lack of invariance in item and latent ability parameters that may undermine applications. In…

Descriptors: Item Response Theory, Test Items, Skill Development, Robustness (Statistics)

Comparing Mimic and Mimic-Interaction to Alignment Methods for Investigating Measurement Invariance Concerning a Continuous Violator

Peer reviewed

Direct link

Yuanfang Liu; Mark H. C. Lai; Ben Kelcey – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Measurement invariance holds when a latent construct is measured in the same way across different levels of background variables (continuous or categorical) while controlling for the true value of that construct. Using Monte Carlo simulation, this paper compares the multiple indicators, multiple causes (MIMIC) model and MIMIC-interaction to a…

Descriptors: Classification, Accuracy, Error of Measurement, Correlation

An Analysis of Differential Bundle Functioning in Multidimensional Tests Using the SIBTEST Procedure

Peer reviewed
PDF on ERIC

Download full text

Özdogan, Didem; Kelecioglu, Hülya – International Journal of Assessment Tools in Education, 2022

This study aims to analyze the differential bundle functioning in multidimensional tests with a specific purpose to detect this effect through differentiating the location of the item with DIF in the test, the correlation between the dimensions, the sample size, and the ratio of reference to focal group size. The first 10 items of the test that is…

Descriptors: Correlation, Sample Size, Test Items, Item Analysis

Evaluation of Structure Complexity Magnitude, Degree of Cross-Loading on Secondary Dimension and Model Specification on MIRT Parameter Estimation

Direct link

Hosseinzadeh, Mostafa – ProQuest LLC, 2021

In real-world situations, multidimensional data may appear on large-scale tests or attitudinal surveys. A simple structure, multidimensional model may be used to evaluate the items, ignoring the cross-loading of some items on the secondary dimension. The purpose of this study was to investigate the influence of structure complexity magnitude of…

Descriptors: Item Response Theory, Models, Simulation, Evaluation Methods

Factor Structure and Psychometric Properties of the Digital Stress Scale in a Chinese College Sample

Peer reviewed

Direct link

Chunlei Gao; Mingqing Jian; Ailin Yuan – SAGE Open, 2024

The Digital Stress Scale (DSS) is used to measure digital stress, which is the perceived stress and anxiety associated with social media use. In this study, the Chinese version of the DSS was validated using a sample of 721 Chinese college students, 321 males and 400 females (KMO = 0.923; Bartlett = 5,058.492, p < 0.001). Confirmatory factor…

Descriptors: Factor Structure, Factor Analysis, Psychometrics, Anxiety

Differential Item Functioning Effect Size from the Multigroup Confirmatory Factor Analysis for a Meta-Analysis: A Simulation Study

Peer reviewed

Direct link

Park, Sung Eun; Ahn, Soyeon; Zopluoglu, Cengiz – Educational and Psychological Measurement, 2021

This study presents a new approach to synthesizing differential item functioning (DIF) effect size: First, using correlation matrices from each study, we perform a multigroup confirmatory factor analysis (MGCFA) that examines measurement invariance of a test item between two subgroups (i.e., focal and reference groups). Then we synthesize, across…

Descriptors: Item Analysis, Effect Size, Difficulty Level, Monte Carlo Methods

A Polytomous Scoring Approach to Handle Not-Reached Items in Low-Stakes Assessments

Peer reviewed

Direct link

Gorgun, Guher; Bulut, Okan – Educational and Psychological Measurement, 2021

In low-stakes assessments, some students may not reach the end of the test and leave some items unanswered due to various reasons (e.g., lack of test-taking motivation, poor time management, and test speededness). Not-reached items are often treated as incorrect or not-administered in the scoring process. However, when the proportion of…

Descriptors: Scoring, Test Items, Response Style (Tests), Mathematics Tests

A New Interpretation of Augmented Subscores and Their Added Value in Terms of Parallel Forms

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational Measurement, 2018

The value-added method of Haberman is arguably one of the most popular methods to evaluate the quality of subscores. The method is based on the classical test theory and deems a subscore to be of added value if the subscore predicts the corresponding true subscore better than does the total score. Sinharay provided an interpretation of the added…

Descriptors: Scores, Value Added Models, Raw Scores, Item Response Theory

Using Smartphone Technology for Research on Refugees: Evidence from Germany

Peer reviewed

Direct link

Keusch, Florian; Leonard, Mariel M.; Sajons, Christoph; Steiner, Susan – Sociological Methods & Research, 2021

Researchers attempting to survey refugees over time face methodological issues because of the transient nature of the target population. In this article, we examine whether applying smartphone technology could alleviate these issues. We interviewed 529 refugees and afterward invited them to four follow-up mobile web surveys and to install a…

Descriptors: Handheld Devices, Telecommunications, Ownership, Computer Software

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Educational and Psychological…	11
ProQuest LLC	6
Applied Psychological…	4
Journal of Educational…	4
International Journal of…	2
Structural Equation Modeling:…	2
Advances in Health Sciences…	1
CALICO Journal	1
Education Policy Center at…	1
Educational Assessment	1
IDEA Center, Inc.	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Education and…	1
Journal of Marriage and the…	1
Measurement:…	1
Online Submission	1
Pegem Journal of Education…	1
Psychological Assessment	1
Research Papers in Education	1
SAGE Open	1
Sociological Methods &…	1
More ▼

Ahn, Soyeon	2
Kelecioglu, Hülya	2
Ailin Yuan	1
Algina, James	1
Andrews, Benjamin James	1
Andrich, David	1
Ankenmann, Robert D.	1
Anwyll, Steve	1
Asil, Mustafa	1
Aydin, Burak	1
Beidel, Deborah C.	1
Ben Kelcey	1
Benton, Stephen L.	1
Bichi, Ado Abdu	1
Bolt, Daniel M.	1
Brown, Gavin T. L.	1
Bukhari, Nurliyana	1
Bulut, Okan	1
Caroline M. Böhm	1
Castellano, Katherine E.	1
Chang, Yu-Wen	1
Charlin, Bernard	1
Cho, Sun-Joo	1
Chunlei Gao	1
More ▼