Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 4 |
| Since 2017 (last 10 years) | 5 |
| Since 2007 (last 20 years) | 15 |
Descriptor
Source
Author
Publication Type
Education Level
| Higher Education | 2 |
| Grade 4 | 1 |
| Grade 8 | 1 |
| Postsecondary Education | 1 |
Audience
| Researchers | 49 |
| Practitioners | 1 |
Location
| Netherlands | 5 |
| United States | 3 |
| Australia | 2 |
| Belgium | 2 |
| Italy | 2 |
| California | 1 |
| China | 1 |
| Denmark | 1 |
| Florida | 1 |
| Georgia | 1 |
| Hungary | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Reckase, Mark D. – 1978
Five comparisons were made relative to the quality of estimates of ability parameters and item calibrations obtained from the one-parameter and three-parameter logistic models. The results indicate: (1) The three-parameter model fit the test data better in all cases than did the one-parameter model. For simulation data sets, multi-factor data were…
Descriptors: Comparative Analysis, Goodness of Fit, Item Analysis, Mathematical Models
Forster, Fred; And Others – 1978
Research on the Rasch model of test and item analysis was applied to tests constructed from item banks for reading and mathematics with respect to five practical problems for scaling items and equating test forms. The questions were: (1) Does the Rasch model yield the same scale value regardless of the student sample? (2) How many students are…
Descriptors: Achievement Tests, Difficulty Level, Elementary Secondary Education, Equated Scores
deGruijter, Dato N. M. – 1980
The setting of standards involves subjective value judgments. The inherent arbitrariness of specific standards has been severely criticized by Glass. His antagonists agree that standard setting is a judgmental task but they have pointed out that arbitrariness in the positive sense of serious judgmental decisions is unavoidable. Further, small…
Descriptors: Cutting Scores, Difficulty Level, Error of Measurement, Mastery Tests
McKinley, Robert L.; Reckase, Mark D. – 1980
A study was conducted to compare the quality of the item parameter estimates obtained from the ANCILLES and LOGIST estimation procedures using goodness of fit as a criterion. Statistics used to compare the fit included a chi-square statistic and a mean square deviation statistic. Other analyses performed included comparisons of the distributions…
Descriptors: Comparative Analysis, Computer Programs, Difficulty Level, Goodness of Fit
Peer reviewedHambleton, Ronald K.; De Gruijter, Dato N. M. – Journal of Educational Measurement, 1983
Addressing the shortcomings of classical item statistics for selecting criterion-referenced test items, this paper describes an optimal item selection procedure utilizing item response theory (IRT) and offers examples in which random selection and optimal item selection methods are compared. Theoretical advantages of optimal selection based upon…
Descriptors: Criterion Referenced Tests, Cutting Scores, Item Banks, Latent Trait Theory
Peer reviewedWhitely, Susan E. – Intelligence, 1980
This article examines the potential contribution of latent trait models to the study of intelligence. Nontechnical introductions to both unidimensional and multidimensional latent trait models are given. Multidimensional latent trait models can be used to test alternative multiple component theories of test item processing. (Author/CTM)
Descriptors: Ability, Aptitude Tests, Cognitive Processes, Intelligence
Peer reviewedKim, Seock-Ho; Cohen, Allan S. – Journal of Educational Measurement, 1992
Effects of the following methods for linking metrics on detection of differential item functioning (DIF) were compared: (1) test characteristic curve method (TCC); (2) weighted mean and sigma method; and (3) minimum chi-square method. With large samples, results were essentially the same. With small samples, TCC was most accurate. (SLD)
Descriptors: Chi Square, Comparative Analysis, Equations (Mathematics), Estimation (Mathematics)
Peer reviewedAlbert, James H. – Journal of Educational Statistics, 1992
Estimating item parameters from a two-parameter normal ogive model is considered using Gibbs sampling to simulate draws from the joint posterior distribution of ability and item parameters. The method gives marginal posterior density estimates for any parameter of interest, as illustrated using data from a 33-item mathematics placement…
Descriptors: Algorithms, Bayesian Statistics, Equations (Mathematics), Estimation (Mathematics)
Peer reviewedWilson, Mark; Masters, Geoffery N. – Psychometrika, 1993
A strategy is described for dealing with measurement situations in which certain categories of responses are null, that is, persons do not respond in certain categories to certain items. The method is described for the partial credit model while maintaining the integrity of the original response framework. (SLD)
Descriptors: Equations (Mathematics), Estimation (Mathematics), Item Response Theory, Mathematical Models
Peer reviewedDodd, Barbara G. – Applied Psychological Measurement, 1990
Using one simulated and two real data sets, the effects of the systematic variation of the item-selection procedure and the stepsize method on the operating characteristics of computerized adaptive testing (CAT) for instruments with polychotomously scored rating scale items were studied. The six rating scale CAT procedures used performed well.…
Descriptors: Adaptive Testing, Attitude Measures, Comparative Analysis, Computer Assisted Testing
Lietz, Petra H.; Roche, Lawrence A. – 1996
This study investigates whether or not the factor structure of reading comprehension is invariant across large, nationally representative samples of 14-year-old students from four different countries. The data from French-speaking Belgium, Hungary, Italy, and the United States were collected as part of the Reading Literacy Study of 1990-91,…
Descriptors: Adolescents, Correlation, Databases, Factor Analysis
Beller, Michal – 1992
It has previously been shown by M. Beller (1990) that an additive tree (Addtree, a hierarchical tree representation of similarity data developed by S. Sattath and A. Tversky in 1977), may be useful for representing the structure between tests and items through the similarity among them as measured by their intercorrelations. In this study, the…
Descriptors: College Entrance Examinations, Decision Making, Difficulty Level, Equations (Mathematics)
Nandakumar, Ratna – 1992
The performance of the following four methodologies for assessing unidimensionality was examined: (1) DIMTEST; (2) the approach of P. W. Holland and P. R. Rosenbaum; (3) linear factor analysis; and (4) non-linear factor analysis. Each method is examined and compared with other methods using simulated data sets and real data sets. Seven data sets,…
Descriptors: Ability, Comparative Testing, Correlation, Equations (Mathematics)
Engelen, Ronald J. H.; Jannarone, Robert J. – 1989
The purpose of this paper is to link empirical Bayes methods with two specific topics in item response theory--item/subtest regression, and testing the goodness of fit of the Rasch model--under the assumptions of local independence and sufficiency. It is shown that item/subtest regression results in empirical Bayes estimates only if the Rasch…
Descriptors: Bayesian Statistics, Comparative Analysis, Equations (Mathematics), Estimation (Mathematics)
DeAyala, R. J.; Koch, William R. – 1987
A nominal response model-based computerized adaptive testing procedure (nominal CAT) was implemented using simulated data. Ability estimates from the nominal CAT were compared to those from a CAT based upon the three-parameter logistic model (3PL CAT). Furthermore, estimates from both CAT procedures were compared with the known true abilities used…
Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Computer Simulation


