ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	0
Since 2007 (last 20 years)	7

Descriptor

Models	9
Item Response Theory	8
Computation	4
Graphs	3
Mathematics Tests	3
Test Items	3
Comparative Analysis	2
Data Collection	2
English	2
Evaluation Methods	2
Goodness of Fit	2
Language Tests	2
Measurement	2
Psychometrics	2
Reading Tests	2
Scoring	2
Tests	2
Adaptive Testing	1
Bias	1
Change	1
Computer Assisted Testing	1
Context Effect	1
Correlation	1
Data Analysis	1
Error of Measurement	1
More ▼

Source

Educational Testing Service

Author

Rijmen, Frank	3
von Davier, Matthias	3
Carstensen, Claus H.	1
Davey, Tim	1
DeCarlo, Lawrence T.	1
Herbert, Erin	1
Li, Shuhong	1
Li, Yanmei	1
Rizavi, Saba	1
Sinharay, Sandip	1
Wang, Lin	1
Way, Walter D.	1
Xu, Xueli	1
von Davier, Alina A.	1
More ▼

Publication Type

Reports - Research	6
Reports - Evaluative	3

Education Level

Grade 10	1
Grade 4	1
Grade 8	1
Grade 9	1
Secondary Education	1

Audience

Location

Germany

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Application of a General Polytomous Testlet Model to the Reading Section of a Large-Scale English Language Assessment. Research Report. ETS RR-10-21

Download full text

Li, Yanmei; Li, Shuhong; Wang, Lin – Educational Testing Service, 2010

Many standardized educational tests include groups of items based on a common stimulus, known as "testlets". Standard unidimensional item response theory (IRT) models are commonly used to model examinees' responses to testlet items. However, it is known that local dependence among testlet items can lead to biased item parameter estimates…

Descriptors: English, Language Tests, Reading Tests, Item Response Theory

Studies of a Latent Class Signal Detection Model for Constructed Response Scoring II: Incomplete and Hierarchical Designs. Research Report. ETS RR-10-08

Download full text

DeCarlo, Lawrence T. – Educational Testing Service, 2010

A basic consideration in large-scale assessments that use constructed response (CR) items, such as essays, is how to allocate the essays to the raters that score them. Designs that are used in practice are incomplete, in that each essay is scored by only a subset of the raters, and also unbalanced, in that the number of essays scored by each rater…

Descriptors: Test Items, Responses, Essay Tests, Scoring

Measuring Multidimensional Latent Growth. Research Report. ETS RR-10-24

Download full text

Rijmen, Frank – Educational Testing Service, 2010

As is the case for any statistical model, a multidimensional latent growth model comes with certain requirements with respect to the data collection design. In order to measure growth, repeated measurements of the same set of individuals are required. Furthermore, the data collection design should be specified such that no individual is given the…

Descriptors: Tests, Statistical Analysis, Models, Measurement

Efficient Full Information Maximum Likelihood Estimation for Multidimensional IRT Models. Research Report. ETS RR-09-03

Download full text

Rijmen, Frank – Educational Testing Service, 2009

Maximum marginal likelihood estimation of multidimensional item response theory (IRT) models has been hampered by the calculation of the multidimensional integral over the ability distribution. However, the researcher often has a specific hypothesis about the conditional (in)dependence relations among the latent variables. Exploiting these…

Descriptors: Maximum Likelihood Statistics, Item Response Theory, Computation, Models

Three Multidimensional Models for Testlet-Based Tests: Formal Relations and an Empirical Comparison. Research Report. ETS RR-09-37

Download full text

Rijmen, Frank – Educational Testing Service, 2009

Three multidimensional item response theory (IRT) models for testlet-based tests are described. In the bifactor model (Gibbons & Hedeker, 1992), each item measures a general dimension in addition to a testlet-specific dimension. The testlet model (Bradlow, Wainer, & Wang, 1999) is a bifactor model in which the loadings on the specific dimensions…

Descriptors: Item Response Theory, Models, Graphs, Comparative Analysis

Stochastic Approximation Methods for Latent Regression Item Response Models. Research Report. ETS RR-09-09

Download full text

von Davier, Matthias; Sinharay, Sandip – Educational Testing Service, 2009

This paper presents an application of a stochastic approximation EM-algorithm using a Metropolis-Hastings sampler to estimate the parameters of an item response latent regression model. Latent regression models are extensions of item response theory (IRT) to a 2-level latent variable model in which covariates serve as predictors of the…

Descriptors: Item Response Theory, Regression (Statistics), Models, Methods

Using the General Diagnostic Model to Measure Learning and Change in a Longitudinal Large-Scale Assessment. Research Report. ETS RR-09-28

Download full text

von Davier, Matthias; Xu, Xueli; Carstensen, Claus H. – Educational Testing Service, 2009

A general diagnostic model was used to specify and compare two multidimensional item-response-theory (MIRT) models for longitudinal data: (a) a model that handles repeated measurements as multiple, correlated variables over time (Andersen, 1985) and (b) a model that assumes one common variable over time and additional orthogonal variables that…

Descriptors: Models, Item Response Theory, Longitudinal Studies, Measurement

A Unified Approach to IRT Scale Linking and Scale Transformations. Research Report. RR-04-09

Download full text

von Davier, Matthias; von Davier, Alina A. – Educational Testing Service, 2004

This paper examines item response theory (IRT) scale transformations and IRT scale linking methods used in the Non-Equivalent Groups with Anchor Test (NEAT) design to equate two tests, X and Y. It proposes a unifying approach to the commonly used IRT linking methods: mean-mean, mean-var linking, concurrent calibration, Stocking and Lord and…

Descriptors: Measures (Individuals), Item Response Theory, Item Analysis, Models

Tolerable Variation in Item Parameter Estimates for Linear and Adaptive Computer-Based Testing. Research Report No. 04-28

Download full text

Rizavi, Saba; Way, Walter D.; Davey, Tim; Herbert, Erin – Educational Testing Service, 2004

Item parameter estimates vary for a variety of reasons, including estimation error, characteristics of the examinee samples, and context effects (e.g., item location effects, section location effects, etc.). Although we expect variation based on theory, there is reason to believe that observed variation in item parameter estimates exceeds what…

Descriptors: Adaptive Testing, Test Items, Computation, Context Effect