Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 3 |
Descriptor
| Maximum Likelihood Statistics | 7 |
| Monte Carlo Methods | 7 |
| Test Reliability | 7 |
| Test Items | 4 |
| Ability | 2 |
| Adaptive Testing | 2 |
| Comparative Analysis | 2 |
| Computation | 2 |
| Computer Assisted Testing | 2 |
| Equations (Mathematics) | 2 |
| Error of Measurement | 2 |
| More ▼ | |
Source
| Educational and Psychological… | 2 |
| ETS Research Report Series | 1 |
| Journal of Educational Issues | 1 |
| Psychometrika | 1 |
Author
| Andersson, Björn | 1 |
| Eason, Hershel | 1 |
| Jin, Ying | 1 |
| Kim, Jwa K. | 1 |
| Maurelli, Vincent A. | 1 |
| Nicewander, W. Alan | 1 |
| Patience, Wayne M. | 1 |
| Reckase, Mark D. | 1 |
| Wang, Zhen | 1 |
| Weiss, David J. | 1 |
| Wilcox, Rand R. | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 6 |
| Journal Articles | 5 |
| Reports - Evaluative | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Andersson, Björn; Xin, Tao – Educational and Psychological Measurement, 2018
In applications of item response theory (IRT), an estimate of the reliability of the ability estimates or sum scores is often reported. However, analytical expressions for the standard errors of the estimators of the reliability coefficients are not available in the literature and therefore the variability associated with the estimated reliability…
Descriptors: Item Response Theory, Test Reliability, Test Items, Scores
Jin, Ying; Eason, Hershel – Journal of Educational Issues, 2016
The effects of mean ability difference (MAD) and short tests on the performance of various DIF methods have been studied extensively in previous simulation studies. Their effects, however, have not been studied under multilevel data structure. MAD was frequently observed in large-scale cross-country comparison studies where the primary sampling…
Descriptors: Test Bias, Simulation, Hierarchical Linear Modeling, Comparative Analysis
Wang, Zhen; Yao, Lihua – ETS Research Report Series, 2013
The current study used simulated data to investigate the properties of a newly proposed method (Yao's rater model) for modeling rater severity and its distribution under different conditions. Our study examined the effects of rater severity, distributions of rater severity, the difference between item response theory (IRT) models with rater effect…
Descriptors: Test Format, Test Items, Responses, Computation
Peer reviewedWilcox, Rand R. – Educational and Psychological Measurement, 1979
For some situations the beta-binomial distribution might be used to describe the marginal distribution of test scores for a particular population of examinees. Several different methods of approximating the maximum likelihood estimate were investigated, and it was found that the Newton-Raphson method should be used when it yields admissable…
Descriptors: Criterion Referenced Tests, Maximum Likelihood Statistics, Measurement, Monte Carlo Methods
Peer reviewedKim, Jwa K.; Nicewander, W. Alan – Psychometrika, 1993
Bias, standard error, and reliability of five ability estimators were evaluated using Monte Carlo estimates of the unknown conditional means and variances of the estimators. Results indicate that estimates based on Bayesian modal, expected a posteriori, and weighted likelihood estimators were reasonably unbiased with relatively small standard…
Descriptors: Ability, Bayesian Statistics, Equations (Mathematics), Error of Measurement
Maurelli, Vincent A.; Weiss, David J. – 1981
A monte carlo simulation was conducted to assess the effects in an adaptive testing strategy for test batteries of varying subtest order, subtest termination criterion, and variable versus fixed entry on the psychometric properties of an existent achievement test battery. Comparisons were made among conventionally administered tests and adaptive…
Descriptors: Achievement Tests, Adaptive Testing, Computer Assisted Testing, Latent Trait Theory
Patience, Wayne M.; Reckase, Mark D. – 1979
An experiment was performed with computer-generated data to investigate some of the operational characteristics of tailored testing as they are related to various provisions of the computer program and item pool. With respect to the computer program, two characteristics were varied: the size of the step of increase or decrease in item difficulty…
Descriptors: Adaptive Testing, Computer Assisted Testing, Difficulty Level, Error of Measurement

Direct link
