ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	12
Since 2017 (last 10 years)	22
Since 2007 (last 20 years)	35

Descriptor

Monte Carlo Methods	52
Test Length	52
Item Response Theory	37
Test Items	28
Sample Size	27
Error of Measurement	15
Accuracy	13
Comparative Analysis	13
Computation	12
Item Analysis	11
Models	11
Markov Processes	10
Goodness of Fit	9
Maximum Likelihood Statistics	8
Simulation	8
Test Bias	8
Bayesian Statistics	7
Correlation	7
Difficulty Level	7
Mathematical Models	7
Computer Assisted Testing	6
Computer Simulation	6
Adaptive Testing	5
Computer Software	5
Estimation (Mathematics)	5
More ▼

Source

Applied Psychological…	12
Educational and Psychological…	8
International Journal of…	4
ProQuest LLC	4
Journal of Educational…	3
Measurement:…	3
Applied Measurement in…	2
Asia Pacific Education Review	1
Educational Measurement:…	1
Educational Sciences: Theory…	1
Eurasian Journal of…	1
Grantee Submission	1
Journal of Educational…	1
Journal of Educational and…	1
Participatory Educational…	1
Physical Review Physics…	1
More ▼

Publication Type

Journal Articles	40
Reports - Research	33
Reports - Evaluative	14
Speeches/Meeting Papers	5
Dissertations/Theses -…	4
Numerical/Quantitative Data	1
Reports - Descriptive	1

Education Level

Higher Education	2
Postsecondary Education	2
Elementary Education	1

Audience

Researchers

Location

Japan

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Monte Carlo Methods X

Showing 31 to 45 of 52 results Save | Export

Simultaneous Estimation of Overall and Domain Abilities: A Higher-Order IRT Model Approach

Peer reviewed

Direct link

de la Torre, Jimmy; Song, Hao – Applied Psychological Measurement, 2009

Assessments consisting of different domains (e.g., content areas, objectives) are typically multidimensional in nature but are commonly assumed to be unidimensional for estimation purposes. The different domains of these assessments are further treated as multi-unidimensional tests for the purpose of obtaining diagnostic information. However, when…

Descriptors: Ability, Tests, Item Response Theory, Data Analysis

Simulated Tests of Differential Item Functioning Using SIBTEST with and without Impact

Peer reviewed

Direct link

Klockars, Alan J.; Lee, Yoonsun – Journal of Educational Measurement, 2008

Monte Carlo simulations with 20,000 replications are reported to estimate the probability of rejecting the null hypothesis regarding DIF using SIBTEST when there is DIF present and/or when impact is present due to differences on the primary dimension to be measured. Sample sizes are varied from 250 to 2000 and test lengths from 10 to 40 items.…

Descriptors: Test Bias, Test Length, Reference Groups, Probability

Item Parameter Estimation for the MIRT Model: Bias and Precision of Confirmatory Factor Analysis-Based Models

Peer reviewed

Direct link

Finch, Holmes – Applied Psychological Measurement, 2010

The accuracy of item parameter estimates in the multidimensional item response theory (MIRT) model context is one that has not been researched in great detail. This study examines the ability of two confirmatory factor analysis models specifically for dichotomous data to properly estimate item parameters using common formulae for converting factor…

Descriptors: Item Response Theory, Computation, Factor Analysis, Models

Investigation of a Nonparametric Procedure for Assessing Goodness-of-Fit in Item Response Theory

Peer reviewed

Direct link

Wells, Craig S.; Bolt, Daniel M. – Applied Measurement in Education, 2008

Tests of model misfit are often performed to validate the use of a particular model in item response theory. Douglas and Cohen (2001) introduced a general nonparametric approach for detecting misfit under the two-parameter logistic model. However, the statistical properties of their approach, and empirical comparisons to other methods, have not…

Descriptors: Test Length, Test Items, Monte Carlo Methods, Nonparametric Statistics

Bias of Exploratory and Cross-Validated DETECT Index under Unidimensionality

Peer reviewed

Direct link

Monahan, Patrick O.; Stump, Timothy E.; Finch, Holmes; Hambleton, Ronald K. – Applied Psychological Measurement, 2007

DETECT is a nonparametric "full" dimensionality assessment procedure that clusters dichotomously scored items into dimensions and provides a DETECT index of magnitude of multidimensionality. Four factors (test length, sample size, item response theory [IRT] model, and DETECT index) were manipulated in a Monte Carlo study of bias, standard error,…

Descriptors: Test Length, Sample Size, Monte Carlo Methods, Geometric Concepts

Comparing BILOG and LOGIST Estimates for Normal, Truncated Normal, and Beta Ability Distributions.

Download full text

Abdel-fattah, Abdel-fattah A. – 1994

The accuracy of estimation procedures in item response theory was studied using Monte Carlo methods and varying sample size, number of subjects, and distribution of ability parameters for: (1) joint maximum likelihood as implemented in the computer program LOGIST; (2) marginal maximum likelihood; and (3) marginal Bayesian procedures as implemented…

Descriptors: Ability, Bayesian Statistics, Estimation (Mathematics), Maximum Likelihood Statistics

Monte Carlo Evaluation of Implied Orders as a Basis for Tailored Testing.

Peer reviewed

Cudeck, Robert; And Others – Applied Psychological Measurement, 1979

TAILOR, a computer program which implements an approach to tailored testing, was examined by Monte Carlo methods. The evaluation showed the procedure to be highly reliable and capable of reducing the required number of tests items by about one half. (Author/JKS)

Descriptors: Adaptive Testing, Computer Programs, Feasibility Studies, Item Analysis

An EM Approach to Parameter Estimation for the Zinnes and Griggs Paired Comparison IRT Model.

Peer reviewed

Stark, Stephen; Drasgow, Fritz – Applied Psychological Measurement, 2002

Describes item response and information functions for the Zinnes and Griggs paired comparison item response theory (IRT) model (1974) and presents procedures for estimating stimulus and person parameters. Monte Carlo simulations show that at least 400 ratings are required to obtain reasonably accurate estimates of the stimulus parameters and their…

Descriptors: Comparative Analysis, Computer Simulation, Error of Measurement, Item Response Theory

The Effect of Test Length and IRT Model on the Distribution and Stability of Three Appropriateness Indexes.

Peer reviewed

Noonan, Brian W.; And Others – Applied Psychological Measurement, 1992

Studied the extent to which three appropriateness indexes, Z(sub 3), ECIZ4, and W, are well standardized in a Monte Carlo study. The ECIZ4 most closely approximated a normal distribution, and its skewness and kurtosis were more stable and less affected by test length and item response theory model than the others. (SLD)

Descriptors: Comparative Analysis, Item Response Theory, Mathematical Models, Maximum Likelihood Statistics

Recovery of Marginal Maximum Likelihood Estimates in the Two-Parameter Logistic Response Model: An Evaluation of MULTILOG.

Peer reviewed

Stone, Clement A. – Applied Psychological Measurement, 1992

Monte Carlo methods are used to evaluate marginal maximum likelihood estimation of item parameters and maximum likelihood estimates of theta in the two-parameter logistic model for varying test lengths, sample sizes, and assumed theta distributions. Results with 100 datasets demonstrate the methods' general precision and stability. Exceptions are…

Descriptors: Computer Software Evaluation, Estimation (Mathematics), Mathematical Models, Maximum Likelihood Statistics

The Influence of Test Characteristics on the Detection of Aberrant Response Patterns.

Peer reviewed

Reise, Steven P.; Due, Allan M. – Applied Psychological Measurement, 1991

Previous person-fit research is extended through explication of an unexplored model for generating aberrant response patterns. The proposed model is then implemented to investigate the influence of test properties on the aberrancy detection power of a person-fit statistic. Difficulties of aberrancy detection are discussed. (SLD)

Descriptors: Algorithms, Computer Simulation, Item Response Theory, Mathematical Models

The Use of Moment Estimators for Mixtures of Two Binomials with One Known Success Parameter.

Peer reviewed

Van Der Linden, Wim J. – Educational and Psychological Measurement, 1983

This paper focuses on mixtures of two binomials with one known success parameter. It is shown how moment estimators can be obtained for the remaining unknown parameters of such mixtures, and results are presented from a Monte Carlo study carried out to explore the statistical properties of these estimators. (PN)

Descriptors: Educational Testing, Error of Measurement, Estimation (Mathematics), Guessing (Tests)

Markov Chain Monte Carlo Estimation of Item Parameters for the Generalized Graded Unfolding Model

Peer reviewed

Direct link

de la Torre, Jimmy; Stark, Stephen; Chernyshenko, Oleksandr S. – Applied Psychological Measurement, 2006

The authors present a Markov Chain Monte Carlo (MCMC) parameter estimation procedure for the generalized graded unfolding model (GGUM) and compare it to the marginal maximum likelihood (MML) approach implemented in the GGUM2000 computer program, using simulated and real personality data. In the simulation study, test length, number of response…

Descriptors: Computation, Monte Carlo Methods, Markov Processes, Item Response Theory

The Standard Errors of the Feldt-Gilmer Congeneric Reliability Coefficients: Iowa Testing Programs Occasional Papers. Number 31.

PDF pending restoration

Gilmer, Jerry S.; Feldt, Leonard S. – 1982

The Feldt-Gilmer congeneric reliability coefficients make it possible to estimate the reliability of a test composed of parts of unequal, unknown length. The approximate standard errors of the Feldt-Gilmer coefficients are derived via a method using the multivariate Taylor's expansion. Monte Carlo simulation is employed to corroborate the…

Descriptors: Educational Testing, Error of Measurement, Mathematical Formulas, Mathematical Models

A Monte Carlo Study of Marginal Maximum Likelihood Parameter Estimates for the Graded Model.

Download full text

Ankenmann, Robert D.; Stone, Clement A. – 1992

Effects of test length, sample size, and assumed ability distribution were investigated in a multiple replication Monte Carlo study under the 1-parameter (1P) and 2-parameter (2P) logistic graded model with five score levels. Accuracy and variability of item parameter and ability estimates were examined. Monte Carlo methods were used to evaluate…

Descriptors: Computer Simulation, Estimation (Mathematics), Item Bias, Mathematical Models

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Finch, Holmes	3
Baris Pekmezci, Fulya	2
Drasgow, Fritz	2
Hambleton, Ronald K.	2
Sengul Avsar, Asiye	2
Stark, Stephen	2
Stone, Clement A.	2
Tay, Louis	2
Wells, Craig S.	2
de la Torre, Jimmy	2
Abdel-fattah, Abdel-fattah A.	1
Allan S. Cohen	1
Allen, Nancy L.	1
Ames, Allison J.	1
Ankenmann, Robert D.	1
Arikan, Serkan	1
Aybek, Eren Can	1
Basman, Munevver	1
Batinic, Bernad	1
Bazaldua, Diego A. Luna	1
Bolt, Daniel M.	1
Cao, Mengyang	1
Chen, Cheng-Te	1
Chernyshenko, Oleksandr S.	1
More ▼