ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	7

Descriptor

Monte Carlo Methods	13
Statistical Bias	13
Test Items	13
Item Response Theory	9
Difficulty Level	7
Computation	4
Item Analysis	4
Mathematical Models	4
Test Construction	4
Bayesian Statistics	3
Correlation	3
Error of Measurement	3
Estimation (Mathematics)	3
Markov Processes	3
Models	3
Multiple Choice Tests	3
Nonparametric Statistics	3
Sample Size	3
Sampling	3
Statistical Analysis	3
Accuracy	2
College Entrance Examinations	2
Comparative Analysis	2
Computer Simulation	2
Evaluation Methods	2
More ▼

Source

Educational and Psychological…	4
Applied Psychological…	2
International Journal of…	1
Journal of Educational…	1

Publication Type

Reports - Research	9
Journal Articles	8
Reports - Evaluative	4
Speeches/Meeting Papers	3

Education Level

Higher Education	2
Postsecondary Education	1

Audience

Researchers

Location

Saudi Arabia

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Differential Item Functioning Effect Size from the Multigroup Confirmatory Factor Analysis for a Meta-Analysis: A Simulation Study

Peer reviewed

Direct link

Park, Sung Eun; Ahn, Soyeon; Zopluoglu, Cengiz – Educational and Psychological Measurement, 2021

This study presents a new approach to synthesizing differential item functioning (DIF) effect size: First, using correlation matrices from each study, we perform a multigroup confirmatory factor analysis (MGCFA) that examines measurement invariance of a test item between two subgroups (i.e., focal and reference groups). Then we synthesize, across…

Descriptors: Item Analysis, Effect Size, Difficulty Level, Monte Carlo Methods

A Short Note on Obtaining Point Estimates of the IRT Ability Parameter with MCMC Estimation in Mplus: How Many Plausible Values Are Needed?

Peer reviewed

Direct link

Luo, Yong; Dimitrov, Dimiter M. – Educational and Psychological Measurement, 2019

Plausible values can be used to either estimate population-level statistics or compute point estimates of latent variables. While it is well known that five plausible values are usually sufficient for accurate estimation of population-level statistics in large-scale surveys, the minimum number of plausible values needed to obtain accurate latent…

Descriptors: Item Response Theory, Monte Carlo Methods, Markov Processes, Outcome Measures

An Algorithm to Improve Test Answer Copying Detection Using the Omega Statistic

Peer reviewed

Direct link

Maeda, Hotaka; Zhang, Bo – International Journal of Testing, 2017

The omega (?) statistic is reputed to be one of the best indices for detecting answer copying on multiple choice tests, but its performance relies on the accurate estimation of copier ability, which is challenging because responses from the copiers may have been contaminated. We propose an algorithm that aims to identify and delete the suspected…

Descriptors: Cheating, Test Items, Mathematics, Statistics

Rasch Model Parameter Estimation in the Presence of a Nonnormal Latent Trait Using a Nonparametric Bayesian Approach

Peer reviewed

Direct link

Finch, Holmes; Edwards, Julianne M. – Educational and Psychological Measurement, 2016

Standard approaches for estimating item response theory (IRT) model parameters generally work under the assumption that the latent trait being measured by a set of items follows the normal distribution. Estimation of IRT parameters in the presence of nonnormal latent traits has been shown to generate biased person and item parameter estimates. A…

Descriptors: Item Response Theory, Computation, Nonparametric Statistics, Bayesian Statistics

Weighted Maximum-a-Posteriori Estimation in Tests Composed of Dichotomous and Polytomous Items

Peer reviewed

Direct link

Sun, Shan-Shan; Tao, Jian; Chang, Hua-Hua; Shi, Ning-Zhong – Applied Psychological Measurement, 2012

For mixed-type tests composed of dichotomous and polytomous items, polytomous items often yield more information than dichotomous items. To reflect the difference between the two types of items and to improve the precision of ability estimation, an adaptive weighted maximum-a-posteriori (WMAP) estimation is proposed. To evaluate the performance of…

Descriptors: Monte Carlo Methods, Computation, Item Response Theory, Weighted Scores

A Method for Imputing Response Options for Missing Data on Multiple-Choice Assessments

Peer reviewed

Direct link

Wolkowitz, Amanda A.; Skorupski, William P. – Educational and Psychological Measurement, 2013

When missing values are present in item response data, there are a number of ways one might impute a correct or incorrect response to a multiple-choice item. There are significantly fewer methods for imputing the actual response option an examinee may have provided if he or she had not omitted the item either purposely or accidentally. This…

Descriptors: Multiple Choice Tests, Statistical Analysis, Models, Accuracy

A Multilevel Testlet Model for Dual Local Dependence

Peer reviewed

Direct link

Jiao, Hong; Kamata, Akihito; Wang, Shudong; Jin, Ying – Journal of Educational Measurement, 2012

The applications of item response theory (IRT) models assume local item independence and that examinees are independent of each other. When a representative sample for psychometric analysis is selected using a cluster sampling method in a testlet-based assessment, both local item dependence and local person dependence are likely to be induced.…

Descriptors: Item Response Theory, Test Items, Markov Processes, Monte Carlo Methods

Reliability Estimation for Single Dichotomous Items Based on Mokken's IRT Model.

Peer reviewed

Meijer, Rob R.; And Others – Applied Psychological Measurement, 1995

Three methods based on the nonparametric item response theory (IRT) of R. J. Mokken for the estimation of the reliability of single dichotomous test items are discussed. Analytical and Monte Carlo studies show that one method, designated "MS," is superior because of smaller bias and smaller sampling variance. (SLD)

Descriptors: Estimation (Mathematics), Item Response Theory, Monte Carlo Methods, Nonparametric Statistics

Reliability Estimation for Single Dichotomous Items. Research Report 94-5.

Download full text

Meijer, Rob R.; And Others – 1994

Three methods for the estimation of the reliability of single dichotomous items are discussed. All methods are based on the assumptions of nondecreasing and nonintersecting item response functions and the Mokken model of double monotonicity. Based on analytical and Monte Carlo studies, it is concluded that one method is superior to the other two…

Descriptors: Estimation (Mathematics), Foreign Countries, Item Response Theory, Monte Carlo Methods

Item Analysis of Achievement Tests Based on Small Numbers of Examinees.

Download full text

Kromrey, Jeffrey D.; Bacon, Tina P. – 1992

A Monte Carlo study was conducted to estimate the small sample standard errors and statistical bias of psychometric statistics commonly used in the analysis of achievement tests. The statistics examined in this research were: (1) the index of item difficulty; (2) the index of item discrimination; (3) the corrected item-total point-biserial…

Descriptors: Achievement Tests, Comparative Analysis, Difficulty Level, Estimation (Mathematics)

Effect of the Guessing Parameter on the Estimation of the Item Discrimination and Difficulty Parameters When Three-Parameter Logistic Model Is Assumed.

Samejima, Fumiko – 1986

Item analysis data fitting the normal ogive model were simulated in order to investigate the problems encountered when applying the three-parameter logistic model. Binary item tests containing 10 and 35 items were created, and Monte Carlo methods simulated the responses of 2,000 and 500 examinees. Item parameters were obtained using Logist 5.…

Descriptors: Computer Simulation, Difficulty Level, Guessing (Tests), Item Analysis

An Empirical Investigation of Equating Stability in a Single and a Double Linkage Design with Small Sample Sizes Using Angoff Model IV.

Download full text

Du Bose, Pansy; Kromrey, Jeffrey D. – 1993

Empirical evidence is presented of the relative efficiency of two potential linkage plans to be used when equivalent test forms are being administered. Equating is a process by which scores on one form of a test are converted to scores on another form of the same test. A Monte Carlo study was conducted to examine equating stability and statistical…

Descriptors: Art Education, Comparative Testing, Computer Simulation, Equated Scores

Operational Characteristics of a One-Parameter Tailored Testing Procedure. Research Report 79-2.

Download full text

Patience, Wayne M.; Reckase, Mark D. – 1979

An experiment was performed with computer-generated data to investigate some of the operational characteristics of tailored testing as they are related to various provisions of the computer program and item pool. With respect to the computer program, two characteristics were varied: the size of the step of increase or decrease in item difficulty…

Descriptors: Adaptive Testing, Computer Assisted Testing, Difficulty Level, Error of Measurement

Kromrey, Jeffrey D.	2
Meijer, Rob R.	2
Ahn, Soyeon	1
Bacon, Tina P.	1
Chang, Hua-Hua	1
Dimitrov, Dimiter M.	1
Du Bose, Pansy	1
Edwards, Julianne M.	1
Finch, Holmes	1
Jiao, Hong	1
Jin, Ying	1
Kamata, Akihito	1
Luo, Yong	1
Maeda, Hotaka	1
Park, Sung Eun	1
Patience, Wayne M.	1
Reckase, Mark D.	1
Samejima, Fumiko	1
Shi, Ning-Zhong	1
Skorupski, William P.	1
Sun, Shan-Shan	1
Tao, Jian	1
Wang, Shudong	1
Wolkowitz, Amanda A.	1
More ▼