ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	9

Descriptor

Maximum Likelihood Statistics	17
Monte Carlo Methods	17
Test Items	17
Item Response Theory	12
Bayesian Statistics	6
Difficulty Level	6
Markov Processes	6
Simulation	6
Comparative Analysis	5
Computation	5
Models	5
Accuracy	4
Computer Assisted Testing	4
Estimation (Mathematics)	4
Test Reliability	4
Achievement Tests	3
Adaptive Testing	3
Error of Measurement	3
Identification	3
Item Bias	3
Statistical Distributions	3
Test Construction	3
Ability	2
Classification	2
Equations (Mathematics)	2
More ▼

Source

Educational and Psychological…	3
Applied Measurement in…	2
Applied Psychological…	2
ETS Research Report Series	2
Journal of Educational…	1
Journal of Educational and…	1
ProQuest LLC	1

Publication Type

Reports - Research	12
Journal Articles	11
Reports - Evaluative	4
Speeches/Meeting Papers	4
Dissertations/Theses -…	1
Numerical/Quantitative Data	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Comprehensive Tests of Basic…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

An Improved Inferential Procedure to Evaluate Item Discriminations in a Conditional Maximum Likelihood Framework

Peer reviewed

Direct link

Clemens Draxler; Andreas Kurz; Can Gürer; Jan Philipp Nolte – Journal of Educational and Behavioral Statistics, 2024

A modified and improved inductive inferential approach to evaluate item discriminations in a conditional maximum likelihood and Rasch modeling framework is suggested. The new approach involves the derivation of four hypothesis tests. It implies a linear restriction of the assumed set of probability distributions in the classical approach that…

Descriptors: Inferences, Test Items, Item Analysis, Maximum Likelihood Statistics

A Comparison of Estimation Techniques for IRT Models with Small Samples

Peer reviewed

Direct link

Finch, Holmes; French, Brian F. – Applied Measurement in Education, 2019

The usefulness of item response theory (IRT) models depends, in large part, on the accuracy of item and person parameter estimates. For the standard 3 parameter logistic model, for example, these parameters include the item parameters of difficulty, discrimination, and pseudo-chance, as well as the person ability parameter. Several factors impact…

Descriptors: Item Response Theory, Accuracy, Test Items, Difficulty Level

Large Sample Confidence Intervals for Item Response Theory Reliability Coefficients

Peer reviewed

Direct link

Andersson, Björn; Xin, Tao – Educational and Psychological Measurement, 2018

In applications of item response theory (IRT), an estimate of the reliability of the ability estimates or sum scores is often reported. However, analytical expressions for the standard errors of the estimators of the reliability coefficients are not available in the literature and therefore the variability associated with the estimated reliability…

Descriptors: Item Response Theory, Test Reliability, Test Items, Scores

Comparing Three Estimation Methods for the Three-Parameter Logistic IRT Model

Direct link

Lamsal, Sunil – ProQuest LLC, 2015

Different estimation procedures have been developed for the unidimensional three-parameter item response theory (IRT) model. These techniques include the marginal maximum likelihood estimation, the fully Bayesian estimation using Markov chain Monte Carlo simulation techniques, and the Metropolis-Hastings Robbin-Monro estimation. With each…

Descriptors: Item Response Theory, Monte Carlo Methods, Maximum Likelihood Statistics, Markov Processes

Rasch Model Parameter Estimation in the Presence of a Nonnormal Latent Trait Using a Nonparametric Bayesian Approach

Peer reviewed

Direct link

Finch, Holmes; Edwards, Julianne M. – Educational and Psychological Measurement, 2016

Standard approaches for estimating item response theory (IRT) model parameters generally work under the assumption that the latent trait being measured by a set of items follows the normal distribution. Estimation of IRT parameters in the presence of nonnormal latent traits has been shown to generate biased person and item parameter estimates. A…

Descriptors: Item Response Theory, Computation, Nonparametric Statistics, Bayesian Statistics

Parameter Recovery and Classification Accuracy under Conditions of Testlet Dependency: A Comparison of the Traditional 2PL, Testlet, and Bi-Factor Models

Peer reviewed

Direct link

Koziol, Natalie A. – Applied Measurement in Education, 2016

Testlets, or groups of related items, are commonly included in educational assessments due to their many logistical and conceptual advantages. Despite their advantages, testlets introduce complications into the theory and practice of educational measurement. Responses to items within a testlet tend to be correlated even after controlling for…

Descriptors: Classification, Accuracy, Comparative Analysis, Models

l[subscript z] Person-Fit Index to Identify Misfit Students with Achievement Test Data

Peer reviewed

Direct link

Seo, Dong Gi; Weiss, David J. – Educational and Psychological Measurement, 2013

The usefulness of the l[subscript z] person-fit index was investigated with achievement test data from 20 exams given to more than 3,200 college students. Results for three methods of estimating ? showed that the distributions of l[subscript z] were not consistent with its theoretical distribution, resulting in general overfit to the item response…

Descriptors: Achievement Tests, College Students, Goodness of Fit, Item Response Theory

The Effects of Rater Severity and Rater Distribution on Examinees' Ability Estimation for Constructed-Response Items. Research Report. ETS RR-13-23

Peer reviewed
PDF on ERIC

Download full text

Wang, Zhen; Yao, Lihua – ETS Research Report Series, 2013

The current study used simulated data to investigate the properties of a newly proposed method (Yao's rater model) for modeling rater severity and its distribution under different conditions. Our study examined the effects of rater severity, distributions of rater severity, the difference between item response theory (IRT) models with rater effect…

Descriptors: Test Format, Test Items, Responses, Computation

Estimation Methods for One-Parameter Testlet Models

Peer reviewed

Direct link

Jiao, Hong; Wang, Shudong; He, Wei – Journal of Educational Measurement, 2013

This study demonstrated the equivalence between the Rasch testlet model and the three-level one-parameter testlet model and explored the Markov Chain Monte Carlo (MCMC) method for model parameter estimation in WINBUGS. The estimation accuracy from the MCMC method was compared with those from the marginalized maximum likelihood estimation (MMLE)…

Descriptors: Computation, Item Response Theory, Models, Monte Carlo Methods

An Investigation of the Likelihood Ratio Test for Detection of Differential Item Functioning.

Peer reviewed

Cohen, Allan S.; And Others – Applied Psychological Measurement, 1996

Type I error rates for the likelihood ratio test for detecting differential item functioning (DIF) were investigated using Monte Carlo simulations. Type I error rates for the two-parameter model were within theoretically expected values at each alpha level, but those for the three-parameter model were not. (SLD)

Descriptors: Identification, Item Bias, Item Response Theory, Maximum Likelihood Statistics

An Investigation of the Likelihood Ratio Test for Detection of Differential Item Functioning under the Graded Response Model.

Download full text

Kim, Seock-Ho; Cohen, Allan S. – 1997

Type I error rates of the likelihood ratio test for the detection of differential item functioning (DIF) were investigated using Monte Carlo simulations. The graded response model with five ordered categories was used to generate data sets of a 30-item test for samples of 300 and 1,000 simulated examinees. All DIF comparisons were simulated by…

Descriptors: Ability, Classification, Computer Simulation, Estimation (Mathematics)

An Investigation of Lord's Procedure for the Detection of Differential Item Functioning.

Peer reviewed

Kim, Seock-Ho; And Others – Applied Psychological Measurement, 1994

Type I error rates of F. M. Lord's chi square test for differential item functioning were investigated using Monte Carlo simulations with marginal maximum likelihood estimation and marginal Bayesian estimation algorithms. Lord's chi square did not provide useful Type I error control for the three-parameter logistic model at these sample sizes.…

Descriptors: Algorithms, Bayesian Statistics, Chi Square, Error of Measurement

Factors Influencing the Psychometric Characteristics of an Adaptive Testing Strategy for Test Batteries.

Download full text

Maurelli, Vincent A.; Weiss, David J. – 1981

A monte carlo simulation was conducted to assess the effects in an adaptive testing strategy for test batteries of varying subtest order, subtest termination criterion, and variable versus fixed entry on the psychometric properties of an existent achievement test battery. Comparisons were made among conventionally administered tests and adaptive…

Descriptors: Achievement Tests, Adaptive Testing, Computer Assisted Testing, Latent Trait Theory

Item Response Theory Equating Using Bayesian Informative Priors.

Download full text

de la Torre, Jimmy; Patz, Richard J. – 2001

This paper seeks to extend the application of Markov chain Monte Carlo (MCMC) methods in item response theory (IRT) to include the estimation of equating relationships along with the estimation of test item parameters. A method is proposed that incorporates estimation of the equating relationship in the item calibration phase. Item parameters from…

Descriptors: Achievement Tests, Bayesian Statistics, Equated Scores, Estimation (Mathematics)

Dichotomous Search Strategies for Computerized Adaptive Testing.

Xiao, Beiling – 1990

Dichotomous search strategies (DSSs) for computerized adaptive testing are similar to golden section search strategies (GSSSs). Each middle point of successive search regions is a testing point. After each item is administered, the subject's obtained score is compared with the expected score at successive testing points. If the subject's obtained…

Descriptors: Ability Identification, Adaptive Testing, Computer Assisted Testing, Equations (Mathematics)

Previous Page | Next Page »

Pages: 1 | 2

Cohen, Allan S.	2
Finch, Holmes	2
Kim, Seock-Ho	2
Weiss, David J.	2
Andersson, Björn	1
Andreas Kurz	1
Can Gürer	1
Clemens Draxler	1
Edwards, Julianne M.	1
French, Brian F.	1
He, Wei	1
Jan Philipp Nolte	1
Jiao, Hong	1
Koziol, Natalie A.	1
Lamsal, Sunil	1
Maurelli, Vincent A.	1
Patience, Wayne M.	1
Patz, Richard J.	1
Reckase, Mark D.	1
Seo, Dong Gi	1
Wang, Shudong	1
Wang, Zhen	1
Xiao, Beiling	1
Xin, Tao	1
More ▼