ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	3

Descriptor

Maximum Likelihood Statistics	7
Monte Carlo Methods	7
Test Reliability	7
Test Items	4
Ability	2
Adaptive Testing	2
Comparative Analysis	2
Computation	2
Computer Assisted Testing	2
Equations (Mathematics)	2
Error of Measurement	2
Item Response Theory	2
Latent Trait Theory	2
Mathematical Models	2
Multiple Choice Tests	2
Scores	2
Simulation	2
Statistical Analysis	2
Statistical Bias	2
Test Bias	2
Test Construction	2
Accuracy	1
Achievement Tests	1
Bayesian Statistics	1
Criterion Referenced Tests	1
More ▼

Source

Educational and Psychological…	2
ETS Research Report Series	1
Journal of Educational Issues	1
Psychometrika	1

Author

Andersson, Björn	1
Eason, Hershel	1
Jin, Ying	1
Kim, Jwa K.	1
Maurelli, Vincent A.	1
Nicewander, W. Alan	1
Patience, Wayne M.	1
Reckase, Mark D.	1
Wang, Zhen	1
Weiss, David J.	1
Wilcox, Rand R.	1
Xin, Tao	1
Yao, Lihua	1
More ▼

Publication Type

Reports - Research	6
Journal Articles	5
Reports - Evaluative	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Large Sample Confidence Intervals for Item Response Theory Reliability Coefficients

Peer reviewed

Direct link

Andersson, Björn; Xin, Tao – Educational and Psychological Measurement, 2018

In applications of item response theory (IRT), an estimate of the reliability of the ability estimates or sum scores is often reported. However, analytical expressions for the standard errors of the estimators of the reliability coefficients are not available in the literature and therefore the variability associated with the estimated reliability…

Descriptors: Item Response Theory, Test Reliability, Test Items, Scores

DIF Analysis with Multilevel Data: A Simulation Study Using the Latent Variable Approach

Peer reviewed
PDF on ERIC

Download full text

Jin, Ying; Eason, Hershel – Journal of Educational Issues, 2016

The effects of mean ability difference (MAD) and short tests on the performance of various DIF methods have been studied extensively in previous simulation studies. Their effects, however, have not been studied under multilevel data structure. MAD was frequently observed in large-scale cross-country comparison studies where the primary sampling…

Descriptors: Test Bias, Simulation, Hierarchical Linear Modeling, Comparative Analysis

The Effects of Rater Severity and Rater Distribution on Examinees' Ability Estimation for Constructed-Response Items. Research Report. ETS RR-13-23

Peer reviewed
PDF on ERIC

Download full text

Wang, Zhen; Yao, Lihua – ETS Research Report Series, 2013

The current study used simulated data to investigate the properties of a newly proposed method (Yao's rater model) for modeling rater severity and its distribution under different conditions. Our study examined the effects of rater severity, distributions of rater severity, the difference between item response theory (IRT) models with rater effect…

Descriptors: Test Format, Test Items, Responses, Computation

Estimating the Parameters of the Beta-Binomial Distribution.

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1979

For some situations the beta-binomial distribution might be used to describe the marginal distribution of test scores for a particular population of examinees. Several different methods of approximating the maximum likelihood estimate were investigated, and it was found that the Newton-Raphson method should be used when it yields admissable…

Descriptors: Criterion Referenced Tests, Maximum Likelihood Statistics, Measurement, Monte Carlo Methods

Ability Estimation for Conventional Tests.

Peer reviewed

Kim, Jwa K.; Nicewander, W. Alan – Psychometrika, 1993

Bias, standard error, and reliability of five ability estimators were evaluated using Monte Carlo estimates of the unknown conditional means and variances of the estimators. Results indicate that estimates based on Bayesian modal, expected a posteriori, and weighted likelihood estimators were reasonably unbiased with relatively small standard…

Descriptors: Ability, Bayesian Statistics, Equations (Mathematics), Error of Measurement

Factors Influencing the Psychometric Characteristics of an Adaptive Testing Strategy for Test Batteries.

Download full text

Maurelli, Vincent A.; Weiss, David J. – 1981

A monte carlo simulation was conducted to assess the effects in an adaptive testing strategy for test batteries of varying subtest order, subtest termination criterion, and variable versus fixed entry on the psychometric properties of an existent achievement test battery. Comparisons were made among conventionally administered tests and adaptive…

Descriptors: Achievement Tests, Adaptive Testing, Computer Assisted Testing, Latent Trait Theory

Operational Characteristics of a One-Parameter Tailored Testing Procedure. Research Report 79-2.

Download full text

Patience, Wayne M.; Reckase, Mark D. – 1979

An experiment was performed with computer-generated data to investigate some of the operational characteristics of tailored testing as they are related to various provisions of the computer program and item pool. With respect to the computer program, two characteristics were varied: the size of the step of increase or decrease in item difficulty…

Descriptors: Adaptive Testing, Computer Assisted Testing, Difficulty Level, Error of Measurement