ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	11
Since 2006 (last 20 years)	29

Descriptor

Computation	37
Item Response Theory	37
Test Items	14
Maximum Likelihood Statistics	12
Comparative Analysis	9
Models	9
Scores	9
Accuracy	8
Statistical Analysis	8
Ability	7
Error of Measurement	7
Mathematics	6
Simulation	6
Bayesian Statistics	5
Equations (Mathematics)	5
Sampling	5
Markov Processes	4
Monte Carlo Methods	4
National Competency Tests	4
Probability	4
Reliability	4
Statistical Bias	4
Correlation	3
Difficulty Level	3
Equated Scores	3
More ▼

Source

ETS Research Report Series

Publication Type

Journal Articles	37
Reports - Research	35
Numerical/Quantitative Data	2
Reports - Descriptive	2
Speeches/Meeting Papers	2
Information Analyses	1

Education Level

Elementary Education	2
Early Childhood Education	1
Grade 8	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Primary Education	1
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	4
Praxis Series	3
Early Childhood Longitudinal…	1
Graduate Record Examinations	1

What Works Clearinghouse Rating

Showing 1 to 15 of 37 results Save | Export

Methods for Imputing Scores When All Responses Are Missing for One or More Polytomous Items: Accuracy and Impact on Psychometric Property. Research Report. ETS RR-23-07

Peer reviewed
PDF on ERIC

Download full text

Yanxuan Qu; Sandip Sinharay – ETS Research Report Series, 2023

Though a substantial amount of research exists on imputing missing scores in educational assessments, there is little research on cases where responses or scores to an item are missing for all test takers. In this paper, we tackled the problem of imputing missing scores for tests for which the responses to an item are missing for all test takers.…

Descriptors: Scores, Test Items, Accuracy, Psychometrics

Practical Considerations in Item Calibration with Small Samples under Multistage Test Design: A Case Study. Research Report. ETS RR-24-03

Peer reviewed
PDF on ERIC

Download full text

Hongwen Guo; Matthew S. Johnson; Daniel F. McCaffrey; Lixong Gu – ETS Research Report Series, 2024

The multistage testing (MST) design has been gaining attention and popularity in educational assessments. For testing programs that have small test-taker samples, it is challenging to calibrate new items to replenish the item pool. In the current research, we used the item pools from an operational MST program to illustrate how research studies…

Descriptors: Test Items, Test Construction, Sample Size, Scaling

Investigating Constructed-Response Scoring over Time: The Effects of Study Design on Trend Rescore Statistics. Research Report. ETS RR-22-15

Peer reviewed
PDF on ERIC

Download full text

Donoghue, John R.; McClellan, Catherine A.; Hess, Melinda R. – ETS Research Report Series, 2022

When constructed-response items are administered for a second time, it is necessary to evaluate whether the current Time B administration's raters have drifted from the scoring of the original administration at Time A. To study this, Time A papers are sampled and rescored by Time B scorers. Commonly the scores are compared using the proportion of…

Descriptors: Item Response Theory, Test Construction, Scoring, Testing

Observed Scores as Matching Variables in Differential Item Functioning under the One- and Two-Parameter Logistic Models: Population Results. Research Report. ETS RR-19-06

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Dorans, Neil J. – ETS Research Report Series, 2019

We derive formulas for the differential item functioning (DIF) measures that two routinely used DIF statistics are designed to estimate. The DIF measures that match on observed scores are compared to DIF measures based on an unobserved ability (theta or true score) for items that are described by either the one-parameter logistic (1PL) or…

Descriptors: Scores, Test Bias, Statistical Analysis, Item Response Theory

Maximum Marginal Likelihood Estimation with an Expectation-Maximization Algorithm for Multigroup/Mixture Multidimensional Item Response Theory Models. Research Report. ETS RR-19-35

Peer reviewed
PDF on ERIC

Download full text

Fu, Jianbin – ETS Research Report Series, 2019

A maximum marginal likelihood estimation with an expectation-maximization algorithm has been developed for estimating multigroup or mixture multidimensional item response theory models using the generalized partial credit function, graded response function, and 3-parameter logistic function. The procedure includes the estimation of item…

Descriptors: Maximum Likelihood Statistics, Mathematics, Item Response Theory, Expectation

Error Variance in Common Population Linking Bridge Studies. Research Report. ETS RR-19-42

Peer reviewed
PDF on ERIC

Download full text

Jewsbury, Paul A. – ETS Research Report Series, 2019

When an assessment undergoes changes to the administration or instrument, bridge studies are typically used to try to ensure comparability of scores before and after the change. Among the most common and powerful is the common population linking design, with the use of a linear transformation to link scores to the metric of the original…

Descriptors: Evaluation Research, Scores, Error Patterns, Error of Measurement

Grouping Effects on Jackknifed Variance Estimation for Item Response Theory Scaling and Equating with Cluster-Based Assessment Data. Research Report. ETS RR-18-16

Peer reviewed
PDF on ERIC

Download full text

Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2018

Educational assessment data are often collected from a set of test centers across various geographic regions, and therefore the data samples contain clusters. Such cluster-based data may result in clustering effects in variance estimation. However, in many grouped jackknife variance estimation applications, jackknife groups are often formed by a…

Descriptors: Item Response Theory, Scaling, Equated Scores, Cluster Grouping

High-Performance Psychometrics: The Parallel-E Parallel-M Algorithm for Generalized Latent Variable Models. Research Report. ETS RR-16-34

Peer reviewed
PDF on ERIC

Download full text

von Davier, Matthias – ETS Research Report Series, 2016

This report presents results on a parallel implementation of the expectation-maximization (EM) algorithm for multidimensional latent variable models. The developments presented here are based on code that parallelizes both the E step and the M step of the parallel-E parallel-M algorithm. Examples presented in this report include item response…

Descriptors: Psychometrics, Mathematics, Models, Statistical Analysis

SARM: A Computer Program for Estimating Speed-Accuracy Response Models for Dichotomous Items. Research Report. ETS RR-18-15

Peer reviewed
PDF on ERIC

Download full text

van Rijn, Peter W.; Ali, Usama S. – ETS Research Report Series, 2018

A computer program was developed to estimate speed-accuracy response models for dichotomous items. This report describes how the models are estimated and how to specify data and input files. An example using data from a listening section of an international language test is described to illustrate the modeling approach and features of the computer…

Descriptors: Computer Software, Computation, Reaction Time, Timed Tests

Exploring Online Learning Data Using Fractal Dimensions. Research Report. ETS RR-17-15

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen – ETS Research Report Series, 2017

Data collected from online learning and tutoring systems for individual students showed strong autocorrelation or dependence because of content connection, knowledge-based dependency, or persistence of learning behavior. When the response data show little dependence or negative autocorrelations for individual students, it is suspected that…

Descriptors: Data Collection, Electronic Learning, Intelligent Tutoring Systems, Information Utilization

Investigating Robustness of Item Response Theory Proficiency Estimators to Atypical Response Behaviors under Two-Stage Multistage Testing. ETS GRE® Board Research Report. ETS GRE®-16-03. ETS Research Report No. RR-16-22

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim – ETS Research Report Series, 2016

The purpose of this study is to evaluate the extent to which item response theory (IRT) proficiency estimation methods are robust to the presence of aberrant responses under the "GRE"® General Test multistage adaptive testing (MST) design. To that end, a wide range of atypical response behaviors affecting as much as 10% of the test items…

Descriptors: Item Response Theory, Computation, Robustness (Statistics), Response Style (Tests)

Enhancing the Equating of Item Difficulty Metrics: Estimation of Reference Distribution. Research Report. ETS RR-14-07

Peer reviewed
PDF on ERIC

Download full text

Ali, Usama S.; Walker, Michael E. – ETS Research Report Series, 2014

Two methods are currently in use at Educational Testing Service (ETS) for equating observed item difficulty statistics. The first method involves the linear equating of item statistics in an observed sample to reference statistics on the same items. The second method, or the item response curve (IRC) method, involves the summation of conditional…

Descriptors: Difficulty Level, Test Items, Equated Scores, Causal Models

Effectiveness of Item Response Theory (IRT) Proficiency Estimation Methods under Adaptive Multistage Testing. Research Report. ETS RR-15-11

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim; Yoo, Hanwook Henry – ETS Research Report Series, 2015

The purpose of this inquiry was to investigate the effectiveness of item response theory (IRT) proficiency estimators in terms of estimation bias and error under multistage testing (MST). We chose a 2-stage MST design in which 1 adaptation to the examinees' ability levels takes place. It includes 4 modules (1 at Stage 1, 3 at Stage 2) and 3 paths…

Descriptors: Item Response Theory, Computation, Statistical Bias, Error of Measurement

The Kernel Levine Equipercentile Observed-Score Equating Function. Research Report. ETS RR-13-38

Peer reviewed
PDF on ERIC

Download full text

von Davier, Alina A.; Chen, Haiwen – ETS Research Report Series, 2013

In the framework of the observed-score equating methods for the nonequivalent groups with anchor test design, there are 3 fundamentally different ways of using the information provided by the anchor scores to equate the scores of a new form to those of an old form. One method uses the anchor scores as a conditioning variable, such as the Tucker…

Descriptors: Equated Scores, Item Response Theory, True Scores, Methods

Weighting Test Samples in IRT Linking and Equating: Toward an Improved Sampling Design for Complex Equating. Research Report. ETS RR-13-39

Peer reviewed
PDF on ERIC

Download full text

Qian, Jiahe; Jiang, Yanming; von Davier, Alina A. – ETS Research Report Series, 2013

Several factors could cause variability in item response theory (IRT) linking and equating procedures, such as the variability across examinee samples and/or test items, seasonality, regional differences, native language diversity, gender, and other demographic variables. Hence, the following question arises: Is it possible to select optimal…

Descriptors: Item Response Theory, Test Items, Sampling, True Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3

Haberman, Shelby J.	8
Zhang, Jinming	5
Antal, Tamás	3
Ali, Usama S.	2
Dorans, Neil J.	2
Guo, Hongwen	2
Kim, Sooyeon	2
Lee, Yi-Hsuan	2
Moses, Tim	2
Qian, Jiahe	2
von Davier, Alina A.	2
Bradlow, Eric T.	1
Braun, Henry	1
Chen, Haiwen	1
Daniel F. McCaffrey	1
Donoghue, John R.	1
Fu, Jianbin	1
Hartz, Sarah	1
Hess, Melinda R.	1
Hongwen Guo	1
Jenkins, Frank	1
Jewsbury, Paul A.	1
Jiang, Yanming	1
Johnson, Matthew S.	1
Livingston, Samuel A.	1
More ▼