ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	0
Since 2007 (last 20 years)	8

Descriptor

Simulation	8
Test Content	8
Comparative Analysis	4
Computer Assisted Testing	3
Item Response Theory	3
Test Items	3
Test Length	3
Accuracy	2
Classification	2
Correlation	2
Error of Measurement	2
Mathematics Tests	2
Methods	2
Models	2
Psychometrics	2
Reading Tests	2
Scores	2
Statistical Analysis	2
Test Construction	2
Test Format	2
Achievement Tests	1
Adaptive Testing	1
Certification	1
Clinical Diagnosis	1
Computation	1
More ▼

Source

Applied Psychological…	2
Journal of Educational…	2
Applied Measurement in…	1
ETS Research Report Series	1
Large-scale Assessments in…	1
Pearson	1

Publication Type

Journal Articles	7
Reports - Evaluative	4
Reports - Research	4
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	1
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Iowa Tests of Basic Skills	1
Program for International…	1

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Does Maximizing Information at the Cut Score Always Maximize Classification Accuracy and Consistency?

Peer reviewed

Direct link

Wyse, Adam E.; Babcock, Ben – Journal of Educational Measurement, 2016

A common suggestion made in the psychometric literature for fixed-length classification tests is that one should design tests so that they have maximum information at the cut score. Designing tests in this way is believed to maximize the classification accuracy and consistency of the assessment. This article uses simulated examples to illustrate…

Descriptors: Cutting Scores, Psychometrics, Test Construction, Classification

On the Use of Rotated Context Questionnaires in Conjunction with Multilevel Item Response Models

Peer reviewed

Direct link

Adams, Raymond J.; Lietz, Petra; Berezner, Alla – Large-scale Assessments in Education, 2013

Background: While rotated test booklets have been employed in large-scale assessments to increase the content coverage of the assessments, rotation has not yet been applied to the context questionnaires administered to respondents. Methods: This paper describes the development of a methodology that uses rotated context questionnaires in…

Descriptors: Questionnaires, Item Response Theory, Foreign Countries, Achievement Tests

A Comparison of Three Content Balancing Methods for Fixed and Variable Length Computerized Adaptive Tests

Direct link

Shin, Chingwei David; Chien, Yuehmei; Way, Walter Denny – Pearson, 2012

Content balancing is one of the most important components in the computerized adaptive testing (CAT) especially in the K to 12 large scale tests that complex constraint structure is required to cover a broad spectrum of content. The purpose of this study is to compare the weighted penalty model (WPM) and the weighted deviation method (WDM) under…

Descriptors: Computer Assisted Testing, Elementary Secondary Education, Test Content, Models

Evaluation of Two New Smoothing Methods in Equating: The Cubic B-Spline Presmoothing Method and the Direct Presmoothing Method

Peer reviewed

Direct link

Cui, Zhongmin; Kolen, Michael J. – Journal of Educational Measurement, 2009

This article considers two new smoothing methods in equipercentile equating, the cubic B-spline presmoothing method and the direct presmoothing method. Using a simulation study, these two methods are compared with established methods, the beta-4 method, the polynomial loglinear method, and the cubic spline postsmoothing method, under three sample…

Descriptors: Equated Scores, Methods, Sample Size, Test Content

A Comparison of Content-Balancing Procedures for Estimating Multiple Clinical Domains in Computerized Adaptive Testing: Relative Precision, Validity, and Detection of Persons with Misfitting Responses

Peer reviewed

Direct link

Riley, Barth B.; Dennis, Michael L.; Conrad, Kendon J. – Applied Psychological Measurement, 2010

This simulation study sought to compare four different computerized adaptive testing (CAT) content-balancing procedures designed for use in a multidimensional assessment with respect to measurement precision, symptom severity classification, validity of clinical diagnostic recommendations, and sensitivity to atypical responding. The four…

Descriptors: Simulation, Computer Assisted Testing, Adaptive Testing, Comparative Analysis

Item Position and Item Difficulty Change in an IRT-Based Common Item Equating Design

Peer reviewed

Direct link

Meyers, Jason L.; Miller, G. Edward; Way, Walter D. – Applied Measurement in Education, 2009

In operational testing programs using item response theory (IRT), item parameter invariance is threatened when an item appears in a different location on the live test than it did when it was field tested. This study utilizes data from a large state's assessments to model change in Rasch item difficulty (RID) as a function of item position change,…

Descriptors: Test Items, Test Content, Testing Programs, Simulation

Comparison of Parametric and Nonparametric Bootstrap Methods for Estimating Random Error in Equipercentile Equating

Peer reviewed

Direct link

Cui, Zhongmin; Kolen, Michael J. – Applied Psychological Measurement, 2008

This article considers two methods of estimating standard errors of equipercentile equating: the parametric bootstrap method and the nonparametric bootstrap method. Using a simulation study, these two methods are compared under three sample sizes (300, 1,000, and 3,000), for two test content areas (the Iowa Tests of Basic Skills Maps and Diagrams…

Descriptors: Test Length, Test Content, Simulation, Computation

Comparison of Multistage Tests with Computerized Adaptive and Paper-and-Pencil Tests. Research Report. ETS RR-07-04

Peer reviewed
PDF on ERIC

Download full text

Rotou, Ourania; Patsula, Liane; Steffen, Manfred; Rizavi, Saba – ETS Research Report Series, 2007

Traditionally, the fixed-length linear paper-and-pencil (P&P) mode of administration has been the standard method of test delivery. With the advancement of technology, however, the popularity of administering tests using adaptive methods like computerized adaptive testing (CAT) and multistage testing (MST) has grown in the field of measurement…

Descriptors: Comparative Analysis, Test Format, Computer Assisted Testing, Models

Cui, Zhongmin	2
Kolen, Michael J.	2
Adams, Raymond J.	1
Babcock, Ben	1
Berezner, Alla	1
Chien, Yuehmei	1
Conrad, Kendon J.	1
Dennis, Michael L.	1
Lietz, Petra	1
Meyers, Jason L.	1
Miller, G. Edward	1
Patsula, Liane	1
Riley, Barth B.	1
Rizavi, Saba	1
Rotou, Ourania	1
Shin, Chingwei David	1
Steffen, Manfred	1
Way, Walter D.	1
Way, Walter Denny	1
Wyse, Adam E.	1
More ▼