ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	18

Source

Journal of Educational…	5
ETS Research Report Series	4
Applied Psychological…	3
Educational Testing Service	3
Educational Measurement:…	1
Educational and Psychological…	1
Journal of Educational and…	1

Author

Moses, Tim	18
Kim, Sooyeon	4
Dorans, Neil	2
Miao, Jing	2
Deng, Weiling	1
Dorans, Neil J.	1
Kim, YoungKoung	1
Klockars, Alan	1
Liu, Jinghua	1
Oh, Hyeonjoo	1
Puhan, Gautam	1
Yoo, Hanwook	1
Yoo, Hanwook Henry	1
Yu, Lei	1
Zhang, Yu-Li	1
von Davier, Alina	1
More ▼

Publication Type

Journal Articles	15
Reports - Research	12
Reports - Evaluative	5
Reports - Descriptive	1

Education Level

High Schools	1
Higher Education	1
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 18 results Save | Export

Linking and Comparability across Conditions of Measurement: Established Frameworks and Proposed Updates

Peer reviewed

Direct link

Moses, Tim – Journal of Educational Measurement, 2022

One result of recent changes in testing is that previously established linking frameworks may not adequately address challenges in current linking situations. Test linking through equating, concordance, vertical scaling or battery scaling may not represent linkings for the scores of tests developed to measure constructs differently for different…

Descriptors: Measures (Individuals), Educational Assessment, Test Construction, Comparative Analysis

Stabilizing Conditional Standard Errors of Measurement in Scale Score Transformations

Peer reviewed

Direct link

Moses, Tim; Kim, YoungKoung – Journal of Educational Measurement, 2017

The focus of this article is on scale score transformations that can be used to stabilize conditional standard errors of measurement (CSEMs). Three transformations for stabilizing the estimated CSEMs are reviewed, including the traditional arcsine transformation, a recently developed general variance stabilization transformation, and a new method…

Descriptors: Error of Measurement, Scores, Comparative Analysis, Item Response Theory

A Comparison of IRT Proficiency Estimation Methods under Adaptive Multistage Testing

Peer reviewed

Direct link

Kim, Sooyeon; Moses, Tim; Yoo, Hanwook – Journal of Educational Measurement, 2015

This inquiry is an investigation of item response theory (IRT) proficiency estimators' accuracy under multistage testing (MST). We chose a two-stage MST design that includes four modules (one at Stage 1, three at Stage 2) and three difficulty paths (low, middle, high). We assembled various two-stage MST panels (i.e., forms) by manipulating two…

Descriptors: Comparative Analysis, Item Response Theory, Computation, Accuracy

Alternative Smoothing and Scaling Strategies for Weighted Composite Scores

Peer reviewed

Direct link

Moses, Tim – Educational and Psychological Measurement, 2014

In this study, smoothing and scaling approaches are compared for estimating subscore-to-composite scaling results involving composites computed as rounded and weighted combinations of subscores. The considered smoothing and scaling approaches included those based on raw data, on smoothing the bivariate distribution of the subscores, on smoothing…

Descriptors: Weighted Scores, Scaling, Data Analysis, Comparative Analysis

Adjoined Piecewise Linear Approximations (APLAs) for Equating: Accuracy Evaluations of a Postsmoothing Equating Method

Peer reviewed

Direct link

Moses, Tim – Journal of Educational Measurement, 2013

The purpose of this study was to evaluate the use of adjoined and piecewise linear approximations (APLAs) of raw equipercentile equating functions as a postsmoothing equating method. APLAs are less familiar than other postsmoothing equating methods (i.e., cubic splines), but their use has been described in historical equating practices of…

Descriptors: Equated Scores, Accuracy, Simulation, Comparative Analysis

Effectiveness of Item Response Theory (IRT) Proficiency Estimation Methods under Adaptive Multistage Testing. Research Report. ETS RR-15-11

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim; Yoo, Hanwook Henry – ETS Research Report Series, 2015

The purpose of this inquiry was to investigate the effectiveness of item response theory (IRT) proficiency estimators in terms of estimation bias and error under multistage testing (MST). We chose a 2-stage MST design in which 1 adaptation to the examinees' ability levels takes place. It includes 4 modules (1 at Stage 1, 3 at Stage 2) and 3 paths…

Descriptors: Item Response Theory, Computation, Statistical Bias, Error of Measurement

Quantifying Error and Uncertainty Reductions in Scaling Functions: An ITEMS Module

Peer reviewed

Direct link

Moses, Tim – Educational Measurement: Issues and Practice, 2014

This module describes and extends X-to-Y regression measures that have been proposed for use in the assessment of X-to-Y scaling and equating results. Measures are developed that are similar to those based on prediction error in regression analyses but that are directly suited to interests in scaling and equating evaluations. The regression and…

Descriptors: Scaling, Regression (Statistics), Equated Scores, Comparative Analysis

An Investigation of the Impact of Misrouting under Two-Stage Multistage Testing: A Simulation Study. Research Report. ETS RR-14-01

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim – ETS Research Report Series, 2014

The purpose of this study was to investigate the potential impact of misrouting under a 2-stage multistage test (MST) design, which includes 1 routing and 3 second-stage modules. Simulations were used to create a situation in which a large group of examinees took each of the 3 possible MST paths (high, middle, and low). We compared differences in…

Descriptors: Comparative Analysis, Difficulty Level, Scores, Test Wiseness

Smoothing and Equating Methods Applied to Different Types of Test Score Distributions and Evaluated with Respect to Multiple Equating Criteria. Research Report. ETS RR-11-20

Download full text

Moses, Tim; Liu, Jinghua – Educational Testing Service, 2011

In equating research and practice, equating functions that are smooth are typically assumed to be more accurate than equating functions with irregularities. This assumption presumes that population test score distributions are relatively smooth. In this study, two examples were used to reconsider common beliefs about smoothing and equating. The…

Descriptors: Equated Scores, Data Analysis, Scores, Methods

A SAS IML Macro for Loglinear Smoothing

Peer reviewed

Direct link

Moses, Tim; von Davier, Alina – Applied Psychological Measurement, 2011

Polynomial loglinear models for one-, two-, and higher-way contingency tables have important applications to measurement and assessment. They are essentially regarded as a smoothing technique, which is commonly referred to as loglinear smoothing. A SAS IML (SAS Institute, 2002a) macro was created to implement loglinear smoothing according to…

Descriptors: Statistical Analysis, Computer Software, Algebra, Mathematical Formulas

Comparison of the One- and Bi-Direction Chained Equipercentile Equating

Peer reviewed

Direct link

Oh, Hyeonjoo; Moses, Tim – Journal of Educational Measurement, 2012

This study investigated differences between two approaches to chained equipercentile (CE) equating (one- and bi-direction CE equating) in nearly equal groups and relatively unequal groups. In one-direction CE equating, the new form is linked to the anchor in one sample of examinees and the anchor is linked to the reference form in the other…

Descriptors: Equated Scores, Statistical Analysis, Comparative Analysis, Differences

A Comparison of Strategies for Estimating Conditional DIF

Peer reviewed

Direct link

Moses, Tim; Miao, Jing; Dorans, Neil J. – Journal of Educational and Behavioral Statistics, 2010

In this study, the accuracies of four strategies were compared for estimating conditional differential item functioning (DIF), including raw data, logistic regression, log-linear models, and kernel smoothing. Real data simulations were used to evaluate the estimation strategies across six items, DIF and No DIF situations, and four sample size…

Descriptors: Test Bias, Statistical Analysis, Computation, Comparative Analysis

Two Approaches for Using Multiple Anchors in NEAT Equating: A Description and Demonstration

Peer reviewed

Direct link

Moses, Tim; Deng, Weiling; Zhang, Yu-Li – Applied Psychological Measurement, 2011

Nonequivalent groups with anchor test (NEAT) equating functions that use a single anchor can have accuracy problems when the groups are extremely different and/or when the anchor weakly correlates with the tests being equated. Proposals have been made to address these issues by incorporating more than one anchor into NEAT equating functions. These…

Descriptors: Equated Scores, Tests, Comparative Analysis, Correlation

A Comparison of Statistical Significance Tests for Selecting Equating Functions

Peer reviewed

Direct link

Moses, Tim – Applied Psychological Measurement, 2009

This study compared the accuracies of nine previously proposed statistical significance tests for selecting identity, linear, and equipercentile equating functions in an equivalent groups equating design. The strategies included likelihood ratio tests for the loglinear models of tests' frequency distributions, regression tests, Kolmogorov-Smirnov…

Descriptors: Statistical Significance, Equated Scores, Comparative Analysis, Tests

A Comparison of Methods for Estimating Conditional Item Score Differences in Differential Item Functioning (DIF) Assessments. Research Report. ETS RR-10-15

Download full text

Moses, Tim; Miao, Jing; Dorans, Neil – Educational Testing Service, 2010

This study compared the accuracies of four differential item functioning (DIF) estimation methods, where each method makes use of only one of the following: raw data, logistic regression, loglinear models, or kernel smoothing. The major focus was on the estimation strategies' potential for estimating score-level, conditional DIF. A secondary focus…

Descriptors: Test Bias, Statistical Analysis, Computation, Scores

Previous Page | Next Page »

Pages: 1 | 2

Comparative Analysis	18
Equated Scores	9
Statistical Analysis	8
Error of Measurement	6
Accuracy	5
Computation	5
Scores	5
Item Response Theory	4
Regression (Statistics)	4
Data Analysis	3
Differences	3
Difficulty Level	3
Sample Size	3
Scaling	3
Statistical Bias	3
Test Bias	3
Adaptive Testing	2
Bayesian Statistics	2
Correlation	2
Evaluation Methods	2
Mathematics Tests	2
Models	2
Reading Tests	2
Simulation	2
Statistical Significance	2
More ▼