ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	7

Descriptor

Multiple Choice Tests	13
Test Format	13
Test Items	11
Test Construction	5
Mathematics Tests	4
High School Students	3
Item Response Theory	3
Responses	3
Science Tests	3
Sex Differences	3
Classification	2
Computer Assisted Testing	2
Constructed Response	2
Correlation	2
Elementary Secondary Education	2
Ethnicity	2
High Schools	2
High Stakes Tests	2
Item Analysis	2
Licensing Examinations…	2
Objective Tests	2
Achievement Tests	1
Affective Behavior	1
Cognitive Psychology	1
College Graduates	1
More ▼

Source

Applied Measurement in…

Publication Type

Journal Articles	13
Reports - Research	8
Reports - Evaluative	5
Information Analyses	1

Education Level

Elementary Secondary Education	2
High Schools	2
Elementary Education	1
Grade 10	1
Grade 4	1
Grade 5	1
Grade 7	1
Grade 8	1
Higher Education	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1
More ▼

Audience

Location

Canada

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Impact of Violating Unidimensionality on Rasch Calibration for Mixed-Format Tests

Peer reviewed

Direct link

Chunyan Liu; Raja Subhiyah; Richard A. Feinberg – Applied Measurement in Education, 2024

Mixed-format tests that include both multiple-choice (MC) and constructed-response (CR) items have become widely used in many large-scale assessments. When an item response theory (IRT) model is used to score a mixed-format test, the unidimensionality assumption may be violated if the CR items measure a different construct from that measured by MC…

Descriptors: Test Format, Response Style (Tests), Multiple Choice Tests, Item Response Theory

Evaluating the Psychometric Characteristics of Generated Multiple-Choice Test Items

Peer reviewed

Direct link

Gierl, Mark J.; Lai, Hollis; Pugh, Debra; Touchie, Claire; Boulais, André-Philippe; De Champlain, André – Applied Measurement in Education, 2016

Item development is a time- and resource-intensive process. Automatic item generation integrates cognitive modeling with computer technology to systematically generate test items. To date, however, items generated using cognitive modeling procedures have received limited use in operational testing situations. As a result, the psychometric…

Descriptors: Psychometrics, Multiple Choice Tests, Test Items, Item Analysis

Determining the Anchor Composition for a Mixed-Format Test: Evaluation of Subpopulation Invariance of Linking Functions

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael – Applied Measurement in Education, 2012

This study examined the appropriateness of the anchor composition in a mixed-format test, which includes both multiple-choice (MC) and constructed-response (CR) items, using subpopulation invariance indices. Linking functions were derived in the nonequivalent groups with anchor test (NEAT) design using two types of anchor sets: (a) MC only and (b)…

Descriptors: Multiple Choice Tests, Test Format, Test Items, Equated Scores

Gender DIF in Reading and Mathematics Tests with Mixed Item Formats

Peer reviewed

Direct link

Taylor, Catherine S.; Lee, Yoonsun – Applied Measurement in Education, 2012

This was a study of differential item functioning (DIF) for grades 4, 7, and 10 reading and mathematics items from state criterion-referenced tests. The tests were composed of multiple-choice and constructed-response items. Gender DIF was investigated using POLYSIBTEST and a Rasch procedure. The Rasch procedure flagged more items for DIF than did…

Descriptors: Test Bias, Gender Differences, Reading Tests, Mathematics Tests

Measurement Properties of Two Innovative Item Formats in a Computer-Based Test

Peer reviewed

Direct link

Wan, Lei; Henly, George A. – Applied Measurement in Education, 2012

Many innovative item formats have been proposed over the past decade, but little empirical research has been conducted on their measurement properties. This study examines the reliability, efficiency, and construct validity of two innovative item formats--the figural response (FR) and constructed response (CR) formats used in a K-12 computerized…

Descriptors: Test Items, Test Format, Computer Assisted Testing, Measurement

Peer reviewed

Direct link

Ascalon, M. Evelina; Meyers, Lawrence S.; Davis, Bruce W.; Smits, Niels – Applied Measurement in Education, 2007

This article examined two item-writing guidelines: the format of the item stem and homogeneity of the answer set. Answering the call of Haladyna, Downing, and Rodriguez (2002) for empirical tests of item writing guidelines and extending the work of Smith and Smith (1988) on differential use of item characteristics, a mock multiple-choice driver's…

Descriptors: Guidelines, Difficulty Level, Standard Setting, Driver Education

Robustness to Format Effects of IRT Linking Methods for Mixed-Format Tests

Peer reviewed

Direct link

Kim, Seonghoon; Kolen, Michael J. – Applied Measurement in Education, 2006

Four item response theory linking methods (2 moment methods and 2 characteristic curve methods) were compared to concurrent (CO) calibration with the focus on the degree of robustness to format effects (FEs) when applying the methods to multidimensional data that reflected the FEs associated with mixed-format tests. Based on the quantification of…

Descriptors: Item Response Theory, Robustness (Statistics), Test Format, Comparative Analysis

Validity of a Taxonomy of Multiple-Choice Item-Writing Rules.

Peer reviewed

Haladyna, Thomas M.; Downing, Steven M. – Applied Measurement in Education, 1989

Results of 96 theoretical/empirical studies were reviewed to see if they support a taxonomy of 43 rules for writing multiple-choice test items. The taxonomy is the result of an analysis of 46 textbooks dealing with multiple-choice item writing. For nearly half of the rules, no research was found. (SLD)

Descriptors: Classification, Literature Reviews, Multiple Choice Tests, Test Construction

A Taxonomy of Multiple-Choice Item-Writing Rules.

Peer reviewed

Haladyna, Thomas M.; Downing, Steven M. – Applied Measurement in Education, 1989

A taxonomy of 43 rules for writing multiple-choice test items is presented, based on a consensus of 46 textbooks. These guidelines are presented as complete and authoritative, with solid consensus apparent for 33 of the rules. Four rules lack consensus, and 5 rules were cited fewer than 10 times. (SLD)

Descriptors: Classification, Interrater Reliability, Multiple Choice Tests, Objective Tests

Gender Differences in Mathematics and Science on a High School Proficiency Exam: The Role of Response Format.

Peer reviewed

DeMars, Christine E. – Applied Measurement in Education, 1998

Scores from mathematics (tested at 102 schools) and science (tested at 99 schools) sections of pilot forms of the Michigan High School Proficiency Test were examined for interaction between gender and response format (multiple choice or constructed response). Overall, neither males nor females seemed to be disadvantaged by item format. (SLD)

Descriptors: Constructed Response, High School Students, High Schools, Mathematics Tests

Differential Effects of Question Formats in Math Assessment on Metacognition and Affect.

Peer reviewed

O'Neil, Harold F., Jr.; Brown, Richard S. – Applied Measurement in Education, 1998

The effect of item format on metacognitive and affective processes of children in a large-scale mathematics assessment program were studied. Results from 1032 eighth graders indicate that open-ended and multiple choice items have differential effects, although these did not vary substantially as a function of gender and ethnicity. (SLD)

Descriptors: Affective Behavior, Ethnicity, Grade 8, Junior High School Students

The Effectiveness of Several Multiple-Choice Formats.

Peer reviewed

Haladyna, Thomas A. – Applied Measurement in Education, 1992

Several multiple-choice item formats are examined in the current climate of test reform. The reform movement is discussed as it affects use of the following formats: (1) complex multiple-choice; (2) alternate choice; (3) true-false; (4) multiple true-false; and (5) the context dependent item set. (SLD)

Descriptors: Cognitive Psychology, Comparative Testing, Context Effect, Educational Change

Test Stakes and Item Format Interactions.

Peer reviewed

DeMars, Christine E. – Applied Measurement in Education, 2000

Studied the effects of test consequences, response formats, gender, and ethnicity on the mathematics and science sections of the Michigan High School Proficiency Test. Results for more than 11,000 students show that students taking constructed response and multiple choice formats performed better under high stakes conditions. Discusses gender and…

Descriptors: Constructed Response, Ethnicity, High School Students, High Schools

DeMars, Christine E.	2
Downing, Steven M.	2
Haladyna, Thomas M.	2
Ascalon, M. Evelina	1
Boulais, André-Philippe	1
Brown, Richard S.	1
Chunyan Liu	1
Davis, Bruce W.	1
De Champlain, André	1
Gierl, Mark J.	1
Haladyna, Thomas A.	1
Henly, George A.	1
Kim, Seonghoon	1
Kim, Sooyeon	1
Kolen, Michael J.	1
Lai, Hollis	1
Lee, Yoonsun	1
Meyers, Lawrence S.	1
O'Neil, Harold F., Jr.	1
Pugh, Debra	1
Raja Subhiyah	1
Richard A. Feinberg	1
Smits, Niels	1
Taylor, Catherine S.	1
Touchie, Claire	1
More ▼