ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	10

Descriptor

Item Response Theory	17
Test Format	17
Equated Scores	8
Test Construction	6
Test Items	6
Models	5
Multiple Choice Tests	4
Adaptive Testing	3
Computer Assisted Testing	3
Evaluation Methods	3
Mathematics Tests	3
Sampling	3
Achievement Tests	2
Classification	2
Comparative Testing	2
Computation	2
Cutting Scores	2
Educational Assessment	2
Foreign Countries	2
High School Students	2
Item Banks	2
Monte Carlo Methods	2
Psychometrics	2
Simulation	2
Standardized Tests	2
More ▼

Source

Applied Psychological…

Publication Type

Journal Articles	17
Reports - Research	8
Reports - Evaluative	7
Reports - Descriptive	2

Education Level

Elementary Secondary Education	1
Grade 12	1
Grade 4	1
Grade 8	1
High Schools	1
Higher Education	1

Audience

Location

Canada	1
Netherlands	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Armed Services Vocational…	1
California Learning…	1
Law School Admission Test	1
National Assessment of…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

The Performance of IRT Model Selection Methods with Mixed-Format Tests

Peer reviewed

Direct link

Whittaker, Tiffany A.; Chang, Wanchen; Dodd, Barbara G. – Applied Psychological Measurement, 2012

When tests consist of multiple-choice and constructed-response items, researchers are confronted with the question of which item response theory (IRT) model combination will appropriately represent the data collected from these mixed-format tests. This simulation study examined the performance of six model selection criteria, including the…

Descriptors: Item Response Theory, Models, Selection, Criteria

The Potential Impact of Not Being Able to Create Parallel Tests on Expected Classification Accuracy

Peer reviewed

Direct link

Wyse, Adam E. – Applied Psychological Measurement, 2011

In many practical testing situations, alternate test forms from the same testing program are not strictly parallel to each other and instead the test forms exhibit small psychometric differences. This article investigates the potential practical impact that these small psychometric differences can have on expected classification accuracy. Ten…

Descriptors: Test Format, Test Construction, Testing Programs, Psychometrics

A Monte Carlo Approach to the Design, Assembly, and Evaluation of Multistage Adaptive Tests

Peer reviewed

Direct link

Belov, Dmitry I.; Armstrong, Ronald D. – Applied Psychological Measurement, 2008

This article presents an application of Monte Carlo methods for developing and assembling multistage adaptive tests (MSTs). A major advantage of the Monte Carlo assembly over other approaches (e.g., integer programming or enumerative heuristics) is that it provides a uniform sampling from all MSTs (or MST paths) available from a given item pool.…

Descriptors: Monte Carlo Methods, Adaptive Testing, Sampling, Item Response Theory

Anchor Test Type and Population Invariance: An Exploration across Subpopulations and Test Administrations

Peer reviewed

Direct link

Dorans, Neil J.; Liu, Jinghua; Hammond, Shelby – Applied Psychological Measurement, 2008

This exploratory study was built on research spanning three decades. Petersen, Marco, and Stewart (1982) conducted a major empirical investigation of the efficacy of different equating methods. The studies reported in Dorans (1990) examined how different equating methods performed across samples selected in different ways. Recent population…

Descriptors: Test Format, Equated Scores, Sampling, Evaluation Methods

Investigating the Population Sensitivity Assumption of Item Response Theory True-Score Equating across Two Subgroups of Examinees and Two Test Formats

Peer reviewed

Direct link

von Davier, Alina A.; Wilson, Christine – Applied Psychological Measurement, 2008

Dorans and Holland (2000) and von Davier, Holland, and Thayer (2003) introduced measures of the degree to which an observed-score equating function is sensitive to the population on which it is computed. This article extends the findings of Dorans and Holland and of von Davier et al. to item response theory (IRT) true-score equating methods that…

Descriptors: Advanced Placement, Advanced Placement Programs, Equated Scores, Calculus

A Method for Estimating Classification Consistency Indices for Two Equated Forms

Peer reviewed

Direct link

Yi, Hyun Sook; Kim, Seonghoon; Brennan, Robert L. – Applied Psychological Measurement, 2007

Large-scale testing programs involving classification decisions typically have multiple forms available and conduct equating to ensure cut-score comparability across forms. A test developer might be interested in the extent to which an examinee who happens to take a particular form would have a consistent classification decision if he or she had…

Descriptors: Classification, Reliability, Indexes, Computation

Computerized Adaptive Testing for Polytomous Motivation Items: Administration Mode Effects and a Comparison with Short Forms

Peer reviewed

Direct link

Hol, A. Michiel; Vorst, Harrie C. M.; Mellenbergh, Gideon J. – Applied Psychological Measurement, 2007

In a randomized experiment (n = 515), a computerized and a computerized adaptive test (CAT) are compared. The item pool consists of 24 polytomous motivation items. Although items are carefully selected, calibration data show that Samejima's graded response model did not fit the data optimally. A simulation study is done to assess possible…

Descriptors: Student Motivation, Simulation, Adaptive Testing, Computer Assisted Testing

IRT Test Assembly Using Network-Flow Programming.

Peer reviewed

Armstrong, Ronald D.; Jones, Douglas H.; Kunce, Charles S. – Applied Psychological Measurement, 1998

Investigated the use of mathematical programming techniques to generate parallel test forms with passages and items based on item-response theory (IRT) using the Fundamentals of Engineering Examination. Generated four parallel test forms from the item bank of almost 1,100 items. Comparison with human-generated forms supports the mathematical…

Descriptors: Engineering, Item Banks, Item Response Theory, Test Construction

An Investigation of the Sampling Distributions of Equating Coefficients.

Peer reviewed

Baker, Frank B. – Applied Psychological Measurement, 1996

Using the characteristic curve method for dichotomously scored test items, the sampling distributions of equating coefficients were examined. Simulations indicate that for the equating conditions studied, the sampling distributions of the equating coefficients appear to have acceptable characteristics, suggesting confidence in the values obtained…

Descriptors: Equated Scores, Item Response Theory, Sampling, Statistical Distributions

Test Equating under the Multiple-Choice Model.

Peer reviewed

Kim, Jee-Seon; Hanson, Bradley A. – Applied Psychological Measurement, 2002

Presents a characteristic curve procedure for comparing transformations of the item response theory ability scale assuming the multiple-choice model. Illustrates the use of the method with an example equating American College Testing mathematics tests. (SLD)

Descriptors: Ability, Equated Scores, Item Response Theory, Mathematics Tests

The Effects of Computer Administration on Scores and Item Parameter Estimates of an IRT-Based Licensure Examination.

Peer reviewed

Sykes, Robert C.; Ito, Kyoko – Applied Psychological Measurement, 1997

Evaluated the equivalence of scores and one-parameter logistic model item difficulty estimates obtained from computer-based and paper-and-pencil forms of a licensure examination taken by 418 examinees. There was no effect of either order or mode of administration on the equivalences. (SLD)

Descriptors: Computer Assisted Testing, Estimation (Mathematics), Health Personnel, Item Response Theory

Model-Based Versus Empirical Equating of Test Forms

Peer reviewed

Direct link

Quenette, Mary A.; Nicewander, W. Alan; Thomasson, Gary L. – Applied Psychological Measurement, 2006

Model-based equating was compared to empirical equating of an Armed Services Vocational Aptitude Battery (ASVAB) test form. The model-based equating was done using item pretest data to derive item response theory (IRT) item parameter estimates for those items that were retained in the final version of the test. The analysis of an ASVAB test form…

Descriptors: Item Response Theory, Multiple Choice Tests, Test Items, Computation

Complex Composites: Issues That Arise in Combining Different Modes of Assessment.

Peer reviewed

Wilson, Mark; Wang, Wen-chung – Applied Psychological Measurement, 1995

Data from the California Learning Assessment System mathematics assessment were used to examine issues that arise when scores from different assessment modes are combined. Multiple-choice, open-ended, and investigation items were combined in a test across three test forms. Results illustrate the difficulties faced in evaluating combined…

Descriptors: Educational Assessment, Equated Scores, Evaluation Methods, Item Response Theory

A Multidimensional Partial Credit Model with Associated Item and Test Statistics: An Application to Mixed-Format Tests

Peer reviewed

Direct link

Yao, Lihua; Schwarz, Richard D. – Applied Psychological Measurement, 2006

Multidimensional item response theory (IRT) models have been proposed for better understanding the dimensional structure of data or to define diagnostic profiles of student learning. A compensatory multidimensional two-parameter partial credit model (M-2PPC) for constructed-response items is presented that is a generalization of those proposed to…

Descriptors: Models, Item Response Theory, Markov Processes, Monte Carlo Methods

Equating Scores from Adaptive to Linear Tests

Peer reviewed

Direct link

van der Linden, Wim J. – Applied Psychological Measurement, 2006

Two local methods for observed-score equating are applied to the problem of equating an adaptive test to a linear test. In an empirical study, the methods were evaluated against a method based on the test characteristic function (TCF) of the linear test and traditional equipercentile equating applied to the ability estimates on the adaptive test…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Format, Equated Scores

Previous Page | Next Page »

Pages: 1 | 2

Armstrong, Ronald D.	2
Baker, Frank B.	1
Belov, Dmitry I.	1
Brennan, Robert L.	1
Budgell, Glen R.	1
Chang, Wanchen	1
Dodd, Barbara G.	1
Dorans, Neil J.	1
Hammond, Shelby	1
Hanson, Bradley A.	1
Harris, Deborah J.	1
Hol, A. Michiel	1
Ito, Kyoko	1
Jones, Douglas H.	1
Kim, Jee-Seon	1
Kim, Seonghoon	1
Kunce, Charles S.	1
Liu, Jinghua	1
Mellenbergh, Gideon J.	1
Nicewander, W. Alan	1
Quenette, Mary A.	1
Schwarz, Richard D.	1
Sykes, Robert C.	1
Thomasson, Gary L.	1
More ▼