Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 2 |
Descriptor
Comparative Analysis | 29 |
Item Analysis | 29 |
Mathematical Models | 29 |
Statistical Analysis | 17 |
Test Items | 15 |
Goodness of Fit | 13 |
Latent Trait Theory | 10 |
Test Construction | 9 |
Achievement Tests | 7 |
Factor Analysis | 7 |
Estimation (Mathematics) | 6 |
More ▼ |
Author
Bashaw, W. L. | 2 |
Benson, Jeri | 2 |
Reckase, Mark D. | 2 |
Rentz, R. Robert | 2 |
Airasian, Peter W. | 1 |
Albanese, Mark A. | 1 |
Algina, James | 1 |
Allan S. Cohen | 1 |
Bacon, Tina P. | 1 |
Bart, William M. | 1 |
Bock, R. Darrell | 1 |
More ▼ |
Publication Type
Reports - Research | 20 |
Speeches/Meeting Papers | 9 |
Journal Articles | 4 |
Reports - Evaluative | 3 |
Numerical/Quantitative Data | 1 |
Reports - General | 1 |
Education Level
Audience
Researchers | 5 |
Location
Laws, Policies, & Programs
Assessments and Surveys
Iowa Tests of Educational… | 1 |
SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024
Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…
Descriptors: Semantics, Educational Assessment, Evaluators, Reliability
Peabody, Michael R. – Measurement: Interdisciplinary Research and Perspectives, 2023
Many organizations utilize some form of automation in the test assembly process; either fully algorithmic or heuristically constructed. However, one issue with heuristic models is that when the test assembly problem changes the entire model may need to be re-conceptualized and recoded. In contrast, mixed-integer programming (MIP) is a mathematical…
Descriptors: Programming Languages, Algorithms, Heuristics, Mathematical Models

Wright, Benjamin D.; Douglas, Graham A. – Educational and Psychological Measurement, 1977
Two procedures for Rasch, sample-free item calibration are reviewed and compared for accuracy. The theoretically ideal "conditional" procedure is impractical for more than fifteen items. The more practical but biased "unconditional" procedure is discussed in detail. (Author/JKS)
Descriptors: Comparative Analysis, Item Analysis, Latent Trait Theory, Mathematical Models

Rosenbaum, Paul R. – Psychometrika, 1987
This paper develops and applies three nonparametric comparisons of the shapes of two item characteristic surfaces: (1) proportional latent odds; (2) uniform relative difficulty; and (3) item sensitivity. A method is presented for comparing the relative shapes of two item characteristic curves in two examinee populations who were administered an…
Descriptors: Comparative Analysis, Computer Simulation, Difficulty Level, Item Analysis

Brink, Nicholas E. – Educational and Psychological Measurement, 1972
Study compares the Rasch and the Guttman models of measurement and thus adds to the description of the characteristics of Rasch's logistic model. Such knowledge is of importance in making decisions as to which model and which statistics should be used in evaluations of tests. (Author/CB)
Descriptors: Comparative Analysis, Educational Testing, Error of Measurement, Goodness of Fit
Dinero, Thomas E.; Haertel, Edward – 1976
This paper will discuss the results of a series of computer simulations comparing the Rasch logistic model to a series of models departing to various degrees from its assumption of equal discrimination power for all items. The results have implications for test construction and test scoring, indicating how closely the conventional raw score…
Descriptors: Comparative Analysis, Computer Programs, Goodness of Fit, Individual Differences

Albanese, Mark A.; Forsyth, Robert A. – Educational and Psychological Measurement, 1984
The purpose of this study was to compare the relative robustness of the one-, two-, and modified two-parameter latent trait logistic models for the Iowa Tests of Educational Development. Results suggest that the modified two-parameter model may provide the best representation of the data. (Author/BW)
Descriptors: Achievement Tests, Comparative Analysis, Goodness of Fit, Item Analysis
Choppin, Bruce; And Others – 1982
A detailed description of five latent structure models of achievement measurement is presented. The first project paper, by David L. McArthur, analyzes the history of mental testing to show how conventional item analysis procedures were developed, and how dissatisfaction with them has led to fragmentation. The range of distinct conceptual and…
Descriptors: Academic Achievement, Achievement Tests, Comparative Analysis, Data Analysis
Reckase, Mark D. – 1978
Five comparisons were made relative to the quality of estimates of ability parameters and item calibrations obtained from the one-parameter and three-parameter logistic models. The results indicate: (1) The three-parameter model fit the test data better in all cases than did the one-parameter model. For simulation data sets, multi-factor data were…
Descriptors: Comparative Analysis, Goodness of Fit, Item Analysis, Mathematical Models

Bock, R. Darrell – Psychometrika, 1972
Descriptors: Ability Identification, Comparative Analysis, Item Analysis, Mathematical Models
Engelen, Ronald J. H.; Jannarone, Robert J. – 1989
The purpose of this paper is to link empirical Bayes methods with two specific topics in item response theory--item/subtest regression, and testing the goodness of fit of the Rasch model--under the assumptions of local independence and sufficiency. It is shown that item/subtest regression results in empirical Bayes estimates only if the Rasch…
Descriptors: Bayesian Statistics, Comparative Analysis, Equations (Mathematics), Estimation (Mathematics)
Benson, Jeri – 1979
Two methods of item selection were used to select sets of 40 items from a 50-item verbal analogies test, and the resulting item sets were compared for relative efficiency. The BICAL program was used to select the 40 items having the best mean square fit to the one parameter logistic (Rasch) model. The LOGIST program was used to select the 40 items…
Descriptors: Comparative Analysis, Computer Programs, Costs, Efficiency
Hambleton, Ronald K.; Rovinelli, Richard J. – 1986
Four methods for determining the dimensionality of a set of test items were compared: (1) linear factor analysis; (2) residual analysis; (3) nonlinear factor analysis; and (4) Bejar's method. Five artificial test data sets (for 40 items and 1500 examinees) were generated, consistent with the three-parameter logistic model and the assumption of…
Descriptors: Comparative Analysis, Computer Simulation, Correlation, Factor Analysis
Bart, William M.; Airasian, Peter W. – 1976
The question of whether test factor structure is indicative of the test item hierarchy was examined. Data from 1,000 subjects on two sets of five bivalued Law School Admission Test items, which were analyzed with latent trait methods of Bock and Lieberman and of Christoffersson in Psychometrika, were analyzed with an ordering-theoretic method to…
Descriptors: Comparative Analysis, Correlation, Factor Analysis, Factor Structure
Kromrey, Jeffrey D.; Bacon, Tina P. – 1992
A Monte Carlo study was conducted to estimate the small sample standard errors and statistical bias of psychometric statistics commonly used in the analysis of achievement tests. The statistics examined in this research were: (1) the index of item difficulty; (2) the index of item discrimination; (3) the corrected item-total point-biserial…
Descriptors: Achievement Tests, Comparative Analysis, Difficulty Level, Estimation (Mathematics)
Previous Page | Next Page ยป
Pages: 1 | 2