Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 0 |
| Since 2007 (last 20 years) | 7 |
Descriptor
| Models | 9 |
| Item Response Theory | 8 |
| Computation | 4 |
| Graphs | 3 |
| Mathematics Tests | 3 |
| Test Items | 3 |
| Comparative Analysis | 2 |
| Data Collection | 2 |
| English | 2 |
| Evaluation Methods | 2 |
| Goodness of Fit | 2 |
| More ▼ | |
Source
| Educational Testing Service | 9 |
Author
| Rijmen, Frank | 3 |
| von Davier, Matthias | 3 |
| Carstensen, Claus H. | 1 |
| Davey, Tim | 1 |
| DeCarlo, Lawrence T. | 1 |
| Herbert, Erin | 1 |
| Li, Shuhong | 1 |
| Li, Yanmei | 1 |
| Rizavi, Saba | 1 |
| Sinharay, Sandip | 1 |
| Wang, Lin | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 6 |
| Reports - Evaluative | 3 |
Education Level
| Grade 10 | 1 |
| Grade 4 | 1 |
| Grade 8 | 1 |
| Grade 9 | 1 |
| Secondary Education | 1 |
Audience
Location
| Germany | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| Program for International… | 1 |
What Works Clearinghouse Rating
Li, Yanmei; Li, Shuhong; Wang, Lin – Educational Testing Service, 2010
Many standardized educational tests include groups of items based on a common stimulus, known as "testlets". Standard unidimensional item response theory (IRT) models are commonly used to model examinees' responses to testlet items. However, it is known that local dependence among testlet items can lead to biased item parameter estimates…
Descriptors: English, Language Tests, Reading Tests, Item Response Theory
DeCarlo, Lawrence T. – Educational Testing Service, 2010
A basic consideration in large-scale assessments that use constructed response (CR) items, such as essays, is how to allocate the essays to the raters that score them. Designs that are used in practice are incomplete, in that each essay is scored by only a subset of the raters, and also unbalanced, in that the number of essays scored by each rater…
Descriptors: Test Items, Responses, Essay Tests, Scoring
Rijmen, Frank – Educational Testing Service, 2010
As is the case for any statistical model, a multidimensional latent growth model comes with certain requirements with respect to the data collection design. In order to measure growth, repeated measurements of the same set of individuals are required. Furthermore, the data collection design should be specified such that no individual is given the…
Descriptors: Tests, Statistical Analysis, Models, Measurement
Rijmen, Frank – Educational Testing Service, 2009
Maximum marginal likelihood estimation of multidimensional item response theory (IRT) models has been hampered by the calculation of the multidimensional integral over the ability distribution. However, the researcher often has a specific hypothesis about the conditional (in)dependence relations among the latent variables. Exploiting these…
Descriptors: Maximum Likelihood Statistics, Item Response Theory, Computation, Models
Rijmen, Frank – Educational Testing Service, 2009
Three multidimensional item response theory (IRT) models for testlet-based tests are described. In the bifactor model (Gibbons & Hedeker, 1992), each item measures a general dimension in addition to a testlet-specific dimension. The testlet model (Bradlow, Wainer, & Wang, 1999) is a bifactor model in which the loadings on the specific dimensions…
Descriptors: Item Response Theory, Models, Graphs, Comparative Analysis
von Davier, Matthias; Sinharay, Sandip – Educational Testing Service, 2009
This paper presents an application of a stochastic approximation EM-algorithm using a Metropolis-Hastings sampler to estimate the parameters of an item response latent regression model. Latent regression models are extensions of item response theory (IRT) to a 2-level latent variable model in which covariates serve as predictors of the…
Descriptors: Item Response Theory, Regression (Statistics), Models, Methods
von Davier, Matthias; Xu, Xueli; Carstensen, Claus H. – Educational Testing Service, 2009
A general diagnostic model was used to specify and compare two multidimensional item-response-theory (MIRT) models for longitudinal data: (a) a model that handles repeated measurements as multiple, correlated variables over time (Andersen, 1985) and (b) a model that assumes one common variable over time and additional orthogonal variables that…
Descriptors: Models, Item Response Theory, Longitudinal Studies, Measurement
von Davier, Matthias; von Davier, Alina A. – Educational Testing Service, 2004
This paper examines item response theory (IRT) scale transformations and IRT scale linking methods used in the Non-Equivalent Groups with Anchor Test (NEAT) design to equate two tests, X and Y. It proposes a unifying approach to the commonly used IRT linking methods: mean-mean, mean-var linking, concurrent calibration, Stocking and Lord and…
Descriptors: Measures (Individuals), Item Response Theory, Item Analysis, Models
Rizavi, Saba; Way, Walter D.; Davey, Tim; Herbert, Erin – Educational Testing Service, 2004
Item parameter estimates vary for a variety of reasons, including estimation error, characteristics of the examinee samples, and context effects (e.g., item location effects, section location effects, etc.). Although we expect variation based on theory, there is reason to believe that observed variation in item parameter estimates exceeds what…
Descriptors: Adaptive Testing, Test Items, Computation, Context Effect


