ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	21
Since 2017 (last 10 years)	50
Since 2007 (last 20 years)	98

Descriptor

Sample Size	139
Test Length	139
Item Response Theory	93
Test Items	66
Simulation	44
Comparative Analysis	31
Error of Measurement	30
Statistical Analysis	28
Monte Carlo Methods	27
Computation	26
Models	26
Correlation	25
Goodness of Fit	25
Accuracy	23
Test Bias	22
Difficulty Level	18
Equated Scores	18
Ability	15
Statistical Bias	15
Maximum Likelihood Statistics	13
Test Construction	13
Item Analysis	12
Statistical Distributions	12
Bayesian Statistics	11
Estimation (Mathematics)	11
More ▼

Publication Type

Journal Articles	98
Reports - Research	97
Reports - Evaluative	29
Speeches/Meeting Papers	26
Dissertations/Theses -…	10
Reports - Descriptive	2
Guides - Non-Classroom	1
Numerical/Quantitative Data	1

Education Level

Higher Education	3
Postsecondary Education	3
Secondary Education	2

Audience

Researchers

Location

Taiwan	2
Turkey	2
Colombia	1
Indonesia	1
Jordan	1
Peru	1
Qatar	1

Laws, Policies, & Programs

Assessments and Surveys

Law School Admission Test	3
Program for International…	2
Comprehensive Tests of Basic…	1
Iowa Tests of Basic Skills	1
SAT (College Admission Test)	1
Trends in International…	1

What Works Clearinghouse Rating

Sample Size X

Showing 31 to 45 of 139 results Save | Export

Subscore Equating and Profile Reporting

Peer reviewed

Direct link

Lim, Euijin; Lee, Won-Chan – Applied Measurement in Education, 2020

The purpose of this study is to address the necessity of subscore equating and to evaluate the performance of various equating methods for subtests. Assuming the random groups design and number-correct scoring, this paper analyzed real data and simulated data with four study factors including test dimensionality, subtest length, form difference in…

Descriptors: Equated Scores, Test Length, Test Format, Difficulty Level

Assessing Ability Recovery of the Sequential IRT Model with Unstructured Multiple-Attempt Data

Peer reviewed
PDF on ERIC

Download full text

Direct link

Ziying Li; A. Corinne Huggins-Manley; Walter L. Leite; M. David Miller; Eric A. Wright – Educational and Psychological Measurement, 2022

The unstructured multiple-attempt (MA) item response data in virtual learning environments (VLEs) are often from student-selected assessment data sets, which include missing data, single-attempt responses, multiple-attempt responses, and unknown growth ability across attempts, leading to a complex and complicated scenario for using this kind of…

Descriptors: Sequential Approach, Item Response Theory, Data, Simulation

Comparison of Different Forms of a Test with or without Items That Exhibit DIF

Peer reviewed
PDF on ERIC

Download full text

Tulek, Onder Kamil; Kose, Ibrahim Alper – Eurasian Journal of Educational Research, 2019

Purpose: This research investigates Tests that include DIF items and which are purified from DIF items. While doing this, the ability estimations and purified DIF items are compared to understand whether there is a correlation between the estimations. Method: The researcher used to R 3.4.1 in order to compare the items and after this situation;…

Descriptors: Test Items, Item Analysis, Item Response Theory, Test Length

Parameter Estimation Bias of Dichotomous Logistic Item Response Theory Models Using Different Variables

Peer reviewed
PDF on ERIC

Download full text

Köse, Alper; Dogan, C. Deha – International Journal of Evaluation and Research in Education, 2019

The aim of this study was to examine the precision of item parameter estimation in different sample sizes and test lengths under three parameter logistic model (3PL) item response theory (IRT) model, where the trait measured by a test was not normally distributed or had a skewed distribution. In the study, number of categories (1-0), and item…

Descriptors: Statistical Bias, Item Response Theory, Simulation, Accuracy

Robustness of Weighted Differential Item Functioning (DIF) Analysis: The Case of Mantel-Haenszel DIF Statistics. Research Report. ETS RR-21-12

Peer reviewed
PDF on ERIC

Download full text

Lu, Ru; Guo, Hongwen; Dorans, Neil J. – ETS Research Report Series, 2021

Two families of analysis methods can be used for differential item functioning (DIF) analysis. One family is DIF analysis based on observed scores, such as the Mantel-Haenszel (MH) and the standardized proportion-correct metric for DIF procedures; the other is analysis based on latent ability, in which the statistic is a measure of departure from…

Descriptors: Robustness (Statistics), Weighted Scores, Test Items, Item Analysis

The Performance of the Semigeneralized Partial Credit Model for Handling Item-Level Missingness

Peer reviewed

Direct link

Zhou, Sherry; Huggins-Manley, Anne Corinne – Educational and Psychological Measurement, 2020

The semi-generalized partial credit model (Semi-GPCM) has been proposed as a unidimensional modeling method for handling not applicable scale responses and neutral scale responses, and it has been suggested that the model may be of use in handling missing data in scale items. The purpose of this study is to evaluate the ability of the…

Descriptors: Models, Statistical Analysis, Response Style (Tests), Test Items

Monte Carlo Simulation in Item Response Theory Applications Using SAS

Peer reviewed

Direct link

Ames, Allison J.; Leventhal, Brian C.; Ezike, Nnamdi C. – Measurement: Interdisciplinary Research and Perspectives, 2020

Data simulation and Monte Carlo simulation studies are important skills for researchers and practitioners of educational and psychological measurement, but there are few resources on the topic specific to item response theory. Even fewer resources exist on the statistical software techniques to implement simulation studies. This article presents…

Descriptors: Monte Carlo Methods, Item Response Theory, Simulation, Computer Software

Estimation of Mixture Rasch Models from Skewed Latent Ability Distributions

Peer reviewed

Direct link

Karadavut, Tugba; Cohen, Allan S.; Kim, Seock-Ho – Measurement: Interdisciplinary Research and Perspectives, 2020

Mixture Rasch (MixRasch) models conventionally assume normal distributions for latent ability. Previous research has shown that the assumption of normality is often unmet in educational and psychological measurement. When normality is assumed, asymmetry in the actual latent ability distribution has been shown to result in extraction of spurious…

Descriptors: Item Response Theory, Ability, Statistical Distributions, Sample Size

Determination of Type I Error Rates and Power of Answer Copying Indices under Various Conditions

Peer reviewed
PDF on ERIC

Download full text

Yormaz, Seha; Sünbül, Önder – Educational Sciences: Theory and Practice, 2017

This study aims to determine the Type I error rates and power of S[subscript 1] , S[subscript 2] indices and kappa statistic at detecting copying on multiple-choice tests under various conditions. It also aims to determine how copying groups are created in order to calculate how kappa statistics affect Type I error rates and power. In this study,…

Descriptors: Statistical Analysis, Cheating, Multiple Choice Tests, Sample Size

Comparison of Confirmatory Factor Analysis Estimation Methods on Binary Data

Peer reviewed
PDF on ERIC

Download full text

Kilic, Abdullah Faruk; Uysal, Ibrahim; Atar, Burcu – International Journal of Assessment Tools in Education, 2020

This Monte Carlo simulation study aimed to investigate confirmatory factor analysis (CFA) estimation methods under different conditions, such as sample size, distribution of indicators, test length, average factor loading, and factor structure. Binary data were generated to compare the performance of maximum likelihood (ML), mean and variance…

Descriptors: Factor Analysis, Computation, Methods, Sample Size

Comparison of Confirmatory Factor Analysis Estimation Methods on Mixed-Format Data

Peer reviewed
PDF on ERIC

Download full text

Kilic, Abdullah Faruk; Dogan, Nuri – International Journal of Assessment Tools in Education, 2021

Weighted least squares (WLS), weighted least squares mean-and-variance-adjusted (WLSMV), unweighted least squares mean-and-variance-adjusted (ULSMV), maximum likelihood (ML), robust maximum likelihood (MLR) and Bayesian estimation methods were compared in mixed item response type data via Monte Carlo simulation. The percentage of polytomous items,…

Descriptors: Factor Analysis, Computation, Least Squares Statistics, Maximum Likelihood Statistics

Different Methods of Adjusting for Form Difficulty under the Rasch Model: Impact on Consistency of Assessment Results. Research Report. ETS RR-19-08

Peer reviewed
PDF on ERIC

Download full text

Manna, Venessa F.; Gu, Lixiong – ETS Research Report Series, 2019

When using the Rasch model, equating with a nonequivalent groups anchor test design is commonly achieved by adjustment of new form item difficulty using an additive equating constant. Using simulated 5-year data, this report compares 4 approaches to calculating the equating constants and the subsequent impact on equating results. The 4 approaches…

Descriptors: Item Response Theory, Test Items, Test Construction, Sample Size

Evaluating the Accuracy of the Empirical Item Characteristic Curve Preequating Method in the Presence of Test Speededness

Peer reviewed

Direct link

Qiu, Yuxi; Huggins-Manley, Anne Corinne – Educational and Psychological Measurement, 2019

This study aimed to assess the accuracy of the empirical item characteristic curve (EICC) preequating method given the presence of test speededness. The simulation design of this study considered the proportion of speededness, speededness point, speededness rate, proportion of missing on speeded items, sample size, and test length. After crossing…

Descriptors: Accuracy, Equated Scores, Test Items, Nonparametric Statistics

Multidimensional Extension of Multiple Indicators Multiple Causes Models to Detect DIF

Peer reviewed

Direct link

Lee, Soo; Bulut, Okan; Suh, Youngsuk – Educational and Psychological Measurement, 2017

A number of studies have found multiple indicators multiple causes (MIMIC) models to be an effective tool in detecting uniform differential item functioning (DIF) for individual items and item bundles. A recently developed MIMIC-interaction model is capable of detecting both uniform and nonuniform DIF in the unidimensional item response theory…

Descriptors: Test Bias, Test Items, Models, Item Response Theory

Comparative Analyses of MIRT Models and Software (BMIRT and flexMIRT)

Peer reviewed

Direct link

Yavuz, Guler; Hambleton, Ronald K. – Educational and Psychological Measurement, 2017

Application of MIRT modeling procedures is dependent on the quality of parameter estimates provided by the estimation software and techniques used. This study investigated model parameter recovery of two popular MIRT packages, BMIRT and flexMIRT, under some common measurement conditions. These packages were specifically selected to investigate the…

Descriptors: Item Response Theory, Models, Comparative Analysis, Computer Software

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

Educational and Psychological…	30
Applied Psychological…	14
Applied Measurement in…	11
ProQuest LLC	10
International Journal of…	7
Journal of Educational…	7
Measurement:…	6
Educational Sciences: Theory…	5
ETS Research Report Series	3
International Journal of…	3
Journal of Educational and…	2
Journal of Experimental…	2
ACT, Inc.	1
Educational Testing Service	1
Eurasian Journal of…	1
Grantee Submission	1
International Journal of…	1
International Journal of…	1
Multivariate Behavioral…	1
Participatory Educational…	1
Psychometrika	1
Quality Assurance in…	1
Turkish Journal of Education	1
More ▼

Gessaroli, Marc E.	5
Lee, Won-Chan	5
Hambleton, Ronald K.	4
Kim, Seock-Ho	4
Wells, Craig S.	4
Cohen, Allan S.	3
De Champlain, Andre	3
De Champlain, Andre F.	3
Schumacker, Randall E.	3
Uysal, Ibrahim	3
Chon, Kyong Hee	2
De Ayala, R. J.	2
DeMars, Christine E.	2
Drasgow, Fritz	2
Finch, Holmes	2
Huggins-Manley, Anne Corinne	2
Kelecioglu, Hülya	2
Kilic, Abdullah Faruk	2
Kiliç, Abdullah Faruk	2
Lee, Yi-Hsuan	2
Liang, Tie	2
Nandakumar, Ratna	2
Paek, Insu	2
Sengul Avsar, Asiye	2
More ▼