ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	0
Since 2007 (last 20 years)	4

Descriptor

Comparative Analysis	5
Test Format	5
Equated Scores	4
Test Items	3
Computer Assisted Testing	2
Difficulty Level	2
Multiple Choice Tests	2
Responses	2
Scores	2
Cognitive Style	1
Cutting Scores	1
Design	1
Effect Size	1
Gender Differences	1
Group Testing	1
Item Analysis	1
Item Response Theory	1
Learning Modules	1
Mathematics Skills	1
Raw Scores	1
Reading Skills	1
Sample Size	1
Sequential Approach	1
Simulation	1
Statistical Analysis	1
More ▼

Source

ETS Research Report Series	4
Journal of Educational…	1

Author

Kim, Sooyeon	5
Walker, Michael E.	2
Boughton, Keith A.	1
Haberman, Shelby	1
McHale, Frederick	1
Moses, Tim	1
Puhan, Gautam	1
von Davier, Alina A.	1

Publication Type

Journal Articles	5
Reports - Research	5

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 5 results Save | Export

An Investigation of the Impact of Misrouting under Two-Stage Multistage Testing: A Simulation Study. Research Report. ETS RR-14-01

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim – ETS Research Report Series, 2014

The purpose of this study was to investigate the potential impact of misrouting under a 2-stage multistage test (MST) design, which includes 1 routing and 3 second-stage modules. Simulations were used to create a situation in which a large group of examinees took each of the 3 possible MST paths (high, middle, and low). We compared differences in…

Descriptors: Comparative Analysis, Difficulty Level, Scores, Test Wiseness

Evaluating Subpopulation Invariance of Linking Functions to Determine the Anchor Composition for a Mixed-Format Test. Research Report. ETS RR-09-36

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Walker, Michael E. – ETS Research Report Series, 2009

We examined the appropriateness of the anchor composition in a mixed-format test, which includes both multiple-choice (MC) and constructed-response (CR) items, using subpopulation invariance indices. We derived linking functions in the nonequivalent groups with anchor test (NEAT) design using two types of anchor sets: (a) MC only and (b) a mix of…

Descriptors: Test Format, Equated Scores, Test Items, Multiple Choice Tests

Small-Sample Equating Using a Synthetic Linking Function

Peer reviewed

Direct link

Kim, Sooyeon; von Davier, Alina A.; Haberman, Shelby – Journal of Educational Measurement, 2008

This study addressed the sampling error and linking bias that occur with small samples in a nonequivalent groups anchor test design. We proposed a linking method called the synthetic function, which is a weighted average of the identity function and a traditional equating function (in this case, the chained linear equating function). Specifically,…

Descriptors: Equated Scores, Sample Size, Test Reliability, Comparative Analysis

Comparisons among Designs for Equating Constructed-Response Tests. Research Report. ETS RR-08-53

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Walker, Michael E.; McHale, Frederick – ETS Research Report Series, 2008

This study examined variations of a nonequivalent groups equating design used with constructed-response (CR) tests to determine which design was most effective in producing equivalent scores across the two tests to be equated. Using data from a large-scale exam, the study investigated the use of anchor CR item rescoring in the context of classical…

Descriptors: Equated Scores, Comparative Analysis, Test Format, Responses

Evaluating the Comparability of Paper-and-Pencil and Computerized Versions of a Large-Scale Certification Test. Research Report. ETS RR-05-21

Peer reviewed
PDF on ERIC

Download full text

Puhan, Gautam; Boughton, Keith A.; Kim, Sooyeon – ETS Research Report Series, 2005

The study evaluated the comparability of two versions of a teacher certification test: a paper-and-pencil test (PPT) and computer-based test (CBT). Standardized mean difference (SMD) and differential item functioning (DIF) analyses were used as measures of comparability at the test and item levels, respectively. Results indicated that effect sizes…

Descriptors: Comparative Analysis, Test Items, Statistical Analysis, Teacher Certification