ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	9
Since 2017 (last 10 years)	21
Since 2007 (last 20 years)	35

Descriptor

Correlation	41
Sample Size	41
Test Items	41
Item Response Theory	22
Simulation	16
Test Length	14
Accuracy	11
Computation	11
Difficulty Level	11
Item Analysis	11
Models	11
Factor Analysis	9
Comparative Analysis	8
Error of Measurement	8
Statistical Analysis	8
Computer Software	6
Evaluation Methods	6
Monte Carlo Methods	6
Bayesian Statistics	5
Classification	5
Maximum Likelihood Statistics	5
Reliability	5
Test Reliability	5
Computer Assisted Testing	4
Data Analysis	4
More ▼

Publication Type

Reports - Research	34
Journal Articles	31
Dissertations/Theses -…	5
Speeches/Meeting Papers	5
Reports - Evaluative	2
Tests/Questionnaires	1

Education Level

Higher Education	3
Postsecondary Education	3
Elementary Education	2
Elementary Secondary Education	1
Secondary Education	1

Audience

Location

California	1
India	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing 1 to 15 of 41 results Save | Export

Effects of the Quantity and Magnitude of Cross-Loading and Model Specification on MIRT Item Parameter Recovery

Peer reviewed

Direct link

Mostafa Hosseinzadeh; Ki Lynn Matlock Cole – Educational and Psychological Measurement, 2024

In real-world situations, multidimensional data may appear on large-scale tests or psychological surveys. The purpose of this study was to investigate the effects of the quantity and magnitude of cross-loadings and model specification on item parameter recovery in multidimensional Item Response Theory (MIRT) models, especially when the model was…

Descriptors: Item Response Theory, Models, Maximum Likelihood Statistics, Algorithms

Online Calibration in Multidimensional Computerized Adaptive Testing with Polytomously Scored Items

Peer reviewed

Direct link

Yuan, Lu; Huang, Yingshi; Li, Shuhang; Chen, Ping – Journal of Educational Measurement, 2023

Online calibration is a key technology for item calibration in computerized adaptive testing (CAT) and has been widely used in various forms of CAT, including unidimensional CAT, multidimensional CAT (MCAT), CAT with polytomously scored items, and cognitive diagnostic CAT. However, as multidimensional and polytomous assessment data become more…

Descriptors: Computer Assisted Testing, Adaptive Testing, Computation, Test Items

How to Obtain the Most Error-Free Estimate of Reliability? Eight Sources of Deflation in the Estimates of Reliability to Avoid

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…

Descriptors: Test Reliability, Scores, Test Items, Correlation

There Are Many Greater Lower Bounds than Cronbach's [alpha]: A Monte Carlo Simulation Study

Peer reviewed

Direct link

Novak, Josip; Rebernjak, Blaž – Measurement: Interdisciplinary Research and Perspectives, 2023

A Monte Carlo simulation study was conducted to examine the performance of [alpha], [lambda]2, [lambda][subscript 4], [lambda][subscript 2], [omega][subscript T], GLB[subscript MRFA], and GLB[subscript Algebraic] coefficients. Population reliability, distribution shape, sample size, test length, and number of response categories were varied…

Descriptors: Monte Carlo Methods, Evaluation Methods, Reliability, Simulation

An Analysis of Differential Bundle Functioning in Multidimensional Tests Using the SIBTEST Procedure

Peer reviewed
PDF on ERIC

Download full text

Özdogan, Didem; Kelecioglu, Hülya – International Journal of Assessment Tools in Education, 2022

This study aims to analyze the differential bundle functioning in multidimensional tests with a specific purpose to detect this effect through differentiating the location of the item with DIF in the test, the correlation between the dimensions, the sample size, and the ratio of reference to focal group size. The first 10 items of the test that is…

Descriptors: Correlation, Sample Size, Test Items, Item Analysis

Comparison of Cronbach's Alpha and McDonald's Omega for Ordinal Data: Are They Different?

Peer reviewed
PDF on ERIC

Download full text

Fatih Orcan – International Journal of Assessment Tools in Education, 2023

Among all, Cronbach's Alpha and McDonald's Omega are commonly used for reliability estimations. The alpha uses inter-item correlations while omega is based on a factor analysis result. This study uses simulated ordinal data sets to test whether the alpha and omega produce different estimates. Their performances were compared according to the…

Descriptors: Statistical Analysis, Monte Carlo Methods, Correlation, Factor Analysis

An Evaluation of Fit Indices Used in Model Selection of Dichotomous Mixture IRT Models

Peer reviewed

Direct link

Sedat Sen; Allan S. Cohen – Educational and Psychological Measurement, 2024

A Monte Carlo simulation study was conducted to compare fit indices used for detecting the correct latent class in three dichotomous mixture item response theory (IRT) models. Ten indices were considered: Akaike's information criterion (AIC), the corrected AIC (AICc), Bayesian information criterion (BIC), consistent AIC (CAIC), Draper's…

Descriptors: Goodness of Fit, Item Response Theory, Sample Size, Classification

The Recovery of Correlation between Latent Abilities Using Compensatory and Noncompensatory Multidimensional IRT Models

Peer reviewed

Direct link

Fu, Yanyan; Strachan, Tyler; Ip, Edward H.; Willse, John T.; Chen, Shyh-Huei; Ackerman, Terry – International Journal of Testing, 2020

This research examined correlation estimates between latent abilities when using the two-dimensional and three-dimensional compensatory and noncompensatory item response theory models. Simulation study results showed that the recovery of the latent correlation was best when the test contained 100% of simple structure items for all models and…

Descriptors: Item Response Theory, Models, Test Items, Simulation

Closed Formula of Test Length Required for Adaptive Testing with Medium Probability of Solution

Peer reviewed

Direct link

Kárász, Judit T.; Széll, Krisztián; Takács, Szabolcs – Quality Assurance in Education: An International Perspective, 2023

Purpose: Based on the general formula, which depends on the length and difficulty of the test, the number of respondents and the number of ability levels, this study aims to provide a closed formula for the adaptive tests with medium difficulty (probability of solution is p = 1/2) to determine the accuracy of the parameters for each item and in…

Descriptors: Test Length, Probability, Comparative Analysis, Difficulty Level

Investigation of the Effect of Parameter Estimation and Classification Accuracy in Mixture IRT Models under Different Conditions

Peer reviewed
PDF on ERIC

Download full text

Saatcioglu, Fatima Munevver; Atar, Hakan Yavuz – International Journal of Assessment Tools in Education, 2022

This study aims to examine the effects of mixture item response theory (IRT) models on item parameter estimation and classification accuracy under different conditions. The manipulated variables of the simulation study are set as mixture IRT models (Rasch, 2PL, 3PL); sample size (600, 1000); the number of items (10, 30); the number of latent…

Descriptors: Accuracy, Classification, Item Response Theory, Programming Languages

Comparison of Different Forms of a Test with or without Items That Exhibit DIF

Peer reviewed
PDF on ERIC

Download full text

Tulek, Onder Kamil; Kose, Ibrahim Alper – Eurasian Journal of Educational Research, 2019

Purpose: This research investigates Tests that include DIF items and which are purified from DIF items. While doing this, the ability estimations and purified DIF items are compared to understand whether there is a correlation between the estimations. Method: The researcher used to R 3.4.1 in order to compare the items and after this situation;…

Descriptors: Test Items, Item Analysis, Item Response Theory, Test Length

The Performance of the Semigeneralized Partial Credit Model for Handling Item-Level Missingness

Peer reviewed

Direct link

Zhou, Sherry; Huggins-Manley, Anne Corinne – Educational and Psychological Measurement, 2020

The semi-generalized partial credit model (Semi-GPCM) has been proposed as a unidimensional modeling method for handling not applicable scale responses and neutral scale responses, and it has been suggested that the model may be of use in handling missing data in scale items. The purpose of this study is to evaluate the ability of the…

Descriptors: Models, Statistical Analysis, Response Style (Tests), Test Items

Comparing Small-Sample Equating with Angoff Judgement for Linking Cut-Scores on Two Tests

Download full text

Bramley, Tom – Research Matters, 2020

The aim of this study was to compare, by simulation, the accuracy of mapping a cut-score from one test to another by expert judgement (using the Angoff method) versus the accuracy with a small-sample equating method (chained linear equating). As expected, the standard-setting method resulted in more accurate equating when we assumed a higher level…

Descriptors: Cutting Scores, Standard Setting (Scoring), Equated Scores, Accuracy

Differential Item Functioning Effect Size from the Multigroup Confirmatory Factor Analysis for a Meta-Analysis: A Simulation Study

Peer reviewed

Direct link

Park, Sung Eun; Ahn, Soyeon; Zopluoglu, Cengiz – Educational and Psychological Measurement, 2021

This study presents a new approach to synthesizing differential item functioning (DIF) effect size: First, using correlation matrices from each study, we perform a multigroup confirmatory factor analysis (MGCFA) that examines measurement invariance of a test item between two subgroups (i.e., focal and reference groups). Then we synthesize, across…

Descriptors: Item Analysis, Effect Size, Difficulty Level, Monte Carlo Methods

A Multigroup Factor Analysis Approach to Analyzing Simple Matrix Sampling Planned Missing Data: (When) Does It Work?

Peer reviewed

Direct link

Dai, Ting; Du, Yang; Cromley, Jennifer G.; Fechter, Tia M.; Nelson, Frank – AERA Online Paper Repository, 2019

Certain planned-missing designs (e.g., simple-matrix sampling) cause zero covariances between variables not jointly observed, making it impossible to do analyses beyond mean estimations without specialized analyses. We tested a multigroup confirmatory factor analysis (CFA) approach by Cudeck (2000), which obtains a model-estimated…

Descriptors: Factor Analysis, Educational Research, Research Design, Data Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3

Educational and Psychological…	8
ProQuest LLC	5
Journal of Educational…	4
International Journal of…	3
AERA Online Paper Repository	1
Applied Psychological…	1
ETS Research Report Series	1
Educational Sciences: Theory…	1
Eurasian Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Education and…	1
Journal of Education and…	1
Journal of Educational and…	1
Journal of Experimental…	1
Measurement:…	1
Practical Assessment,…	1
Quality Assurance in…	1
Research Matters	1
Structural Equation Modeling:…	1
More ▼

Svetina, Dubravka	3
Chen, Ping	2
Kelecioglu, Hülya	2
Ackerman, Terry	1
Ahmed, Tamim	1
Ahn, Soyeon	1
Alhija, Fadia Nasser-Abu	1
Allan S. Cohen	1
Anil, Duygu	1
Atar, Hakan Yavuz	1
Avsec, Stanislav	1
Bramley, Tom	1
Breyer, F. Jay	1
Bulut, Okan	1
Carlson, James E.	1
Chen, Shyh-Huei	1
Cho, Sun-Joo	1
Cooper, Lisa Marie	1
Cromley, Jennifer G.	1
Dai, Shenghai	1
Dai, Ting	1
DeMars, Christine E.	1
Du, Yang	1
Fatih Orcan	1
Fechter, Tia M.	1
More ▼