ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	11

Source

Educational Assessment

Publication Type

Journal Articles	14
Reports - Research	11
Reports - Evaluative	3
Information Analyses	1

Education Level

Higher Education	3
Elementary Education	2
Elementary Secondary Education	2
Grade 3	2
Intermediate Grades	2
Middle Schools	2
Postsecondary Education	2
Early Childhood Education	1
Grade 4	1
Grade 5	1
Grade 6	1
Junior High Schools	1
Primary Education	1
Secondary Education	1
More ▼

Audience

Location

Asia	1
Sweden	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Beyond Agreement: Exploring Rater Effects in Large-Scale Mixed Format Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Guo, Wenjing – Educational Assessment, 2021

Scoring procedures for the constructed-response (CR) items in large-scale mixed-format educational assessments often involve checks for rater agreement or rater reliability. Although these analyses are important, researchers have documented rater effects that persist despite rater training and that are not always detected in rater agreement and…

Descriptors: Scoring, Responses, Test Items, Test Format

The Effects of Providing Students with Revision Opportunities in Alternate Assessments

Peer reviewed

Direct link

Bulut, Okan; Bulut, Hatice Cigdem; Cormier, Damien C.; Ilgun Dibek, Munevver; Sahin Kursad, Merve – Educational Assessment, 2023

Some statewide testing programs allow students to receive corrective feedback and revise their answers during testing. Despite its pedagogical benefits, the effects of providing revision opportunities remain unknown in the context of alternate assessments. Therefore, this study examined student data from a large-scale alternate assessment that…

Descriptors: Error Correction, Alternative Assessment, Feedback (Response), Multiple Choice Tests

Test Takers' Response Tendencies in Alternative Item Formats: A Cognitive Science Approach

Peer reviewed

Direct link

Moon, Jung Aa; Keehner, Madeleine; Katz, Irvin R. – Educational Assessment, 2020

We investigated how item formats influence test takers' response tendencies under uncertainty. Adult participants solved content-equivalent math items in three formats: multiple-selection multiple-choice, grid with forced-choice (true-false) options, and grid with non-forced-choice options. Participants showed a greater tendency to commit (rather…

Descriptors: College Students, Test Wiseness, Test Format, Test Items

Investigating the Effect of Different Selected-Response Item Formats for Reading Comprehension

Peer reviewed

Direct link

Becker, Anthony; Nekrasova-Beker, Tatiana – Educational Assessment, 2018

While previous research has identified numerous factors that contribute to item difficulty, studies involving large-scale reading tests have provided mixed results. This study examined five selected-response item types used to measure reading comprehension in the Pearson Test of English Academic: a) multiple-choice (choose one answer), b)…

Descriptors: Reading Comprehension, Test Items, Reading Tests, Test Format

Embedded Field Test Item Statistics: Can They Be Trusted for Estimating Student Proficiency?

Peer reviewed

Direct link

Steedle, Jeffrey T.; Morrison, Kristin M. – Educational Assessment, 2019

Assessment items are commonly field tested prior to operational use to observe statistical item properties such as difficulty. Item parameter estimates from field testing may be used to assign scores via pre-equating or computer adaptive designs. This study examined differences between item difficulty estimates based on field test and operational…

Descriptors: Field Tests, Test Items, Statistics, Difficulty Level

Tablets Instead of Paper-Based Tests for Young Children? Comparability between Paper and Tablet Versions of the Mathematical Heidelberger Rechen Test 1-4

Peer reviewed

Direct link

Hassler Hallstedt, Martin; Ghaderi, Ata – Educational Assessment, 2018

Tablets can be used to facilitate systematic testing of academic skills. Yet, when using validated paper tests on tablet, comparability between the mediums must be established. Comparability between a tablet and a paper version of a basic math skills test (HRT: Heidelberger Rechen Test 1-4) was investigated. Five samples with second and third…

Descriptors: Handheld Devices, Scores, Test Format, Computer Assisted Testing

The Impact of Item Stem Format on the Dimensional Structure of Mathematics Assessments

Peer reviewed

Direct link

Kan, Adnan; Bulut, Okan; Cormier, Damien C. – Educational Assessment, 2019

Item stem formats can alter the cognitive complexity as well as the type of abilities required for solving mathematics items. Consequently, it is possible that item stem formats can affect the dimensional structure of mathematics assessments. This empirical study investigated the relationship between item stem format and the dimensionality of…

Descriptors: Mathematics Tests, Test Items, Test Format, Problem Solving

Test Directions as a Critical Component of Test Design: Best Practices and the Impact of Examinee Characteristics

Peer reviewed

Direct link

Lakin, Joni M. – Educational Assessment, 2014

The purpose of test directions is to familiarize examinees with a test so that they respond to items in the manner intended. However, changes in educational measurement as well as the U.S. student population present new challenges to test directions and increase the impact that differential familiarity could have on the validity of test score…

Descriptors: Test Content, Test Construction, Best Practices, Familiarity

Forced Choice or Free Choice?: The Role of Question Formats in Predicting Speaking and Writing Skills of Nonnative Speakers of English

Peer reviewed

Direct link

Pae, Hye K. – Educational Assessment, 2014

This study investigated the role of item formats in the performance of 206 nonnative speakers of English on expressive skills (i.e., speaking and writing). Test scores were drawn from the field test of the "Pearson Test of English Academic" for Chinese, French, Hebrew, and Korean native speakers. Four item formats, including…

Descriptors: Test Items, Test Format, Speech Skills, Writing Skills

Voices from Test-Takers: Further Evidence for Language Assessment Validation and Use

Peer reviewed

Direct link

Cheng, Liying; DeLuca, Christopher – Educational Assessment, 2011

Test-takers' interpretations of validity as related to test constructs and test use have been widely debated in large-scale language assessment. This study contributes further evidence to this debate by examining 59 test-takers' perspectives in writing large-scale English language tests. Participants wrote about their test-taking experiences in…

Descriptors: Language Tests, Test Validity, Test Use, English

Changes in Rapid-Guessing Behavior over a Series of Assessments

Peer reviewed

Direct link

DeMars, Christine E. – Educational Assessment, 2007

A series of 8 tests was administered to university students over 4 weeks for program assessment purposes. The stakes of these tests were low for students; they received course points based on test completion, not test performance. Tests were administered in a counterbalanced order across 2 administrations. Response time effort, a measure of the…

Descriptors: Reaction Time, Guessing (Tests), Testing Programs, College Students

DIF Detection and Interpretation in Large-Scale Science Assessments: Informing Item Writing Practices

Peer reviewed

Direct link

Zenisky, April L.; Hambleton, Ronald K.; Robin, Frederic – Educational Assessment, 2004

Differential item functioning (DIF) analyses are a routine part of the development of large-scale assessments. Less common are studies to understand the potential sources of DIF. The goals of this study were (a) to identify gender DIF in a large-scale science assessment and (b) to look for trends in the DIF and non-DIF items due to content,…

Descriptors: Program Effectiveness, Test Format, Science Tests, Test Items

Achieving Form-to-Form Comparability: Fundamental Issues and Proposed Strategies for Equating Performance Assessments of Teachers.

Peer reviewed

Loyd, Brenda; Englehard, George, Jr.; Crocker, Linda – Educational Assessment, 1996

Some of the measurement issues encountered in the equating of performance assessments designed for use in teacher certification decisions are described. Analytic strategies based on examinee data that involve modification of existing procedures and judgmental strategies involving the judgments of experts to determine score equivalence are also…

Descriptors: Elementary Secondary Education, Equated Scores, Judges, Performance Based Assessment

Reliability and Decision Consistency: An Analysis of Writing Mode at Two Times on a Statewide Test.

Peer reviewed

Hollenbeck, Keith; Tindal, Gerald; Almond, Patricia – Educational Assessment, 1999

Studied the amount of measurement error in a state's performance-based writing task as it relates to high-stakes decision reproducibility. Using 175 eighth-grade writing samples, the study finds moderate correlations between the two raters' scores, with significant differences for the rates for the handwritten, but not the typed, essays.(SLD)

Descriptors: Decision Making, Error of Measurement, Essay Tests, Grade 8

Test Format	14
Test Items	10
Mathematics Tests	4
Multiple Choice Tests	4
Scores	4
College Students	3
Foreign Countries	3
Language Tests	3
Test Wiseness	3
Difficulty Level	2
Elementary School Students	2
Elementary Secondary Education	2
Feedback (Response)	2
Grade 3	2
Mathematics Achievement	2
Performance Based Assessment	2
Reading Comprehension	2
Reading Tests	2
Response Style (Tests)	2
Science Tests	2
Scoring	2
Test Construction	2
Test Content	2
Testing Programs	2
Accuracy	1
More ▼

Bulut, Okan	2
Cormier, Damien C.	2
Almond, Patricia	1
Becker, Anthony	1
Bulut, Hatice Cigdem	1
Cheng, Liying	1
Crocker, Linda	1
DeLuca, Christopher	1
DeMars, Christine E.	1
Englehard, George, Jr.	1
Ghaderi, Ata	1
Guo, Wenjing	1
Hambleton, Ronald K.	1
Hassler Hallstedt, Martin	1
Hollenbeck, Keith	1
Ilgun Dibek, Munevver	1
Kan, Adnan	1
Katz, Irvin R.	1
Keehner, Madeleine	1
Lakin, Joni M.	1
Loyd, Brenda	1
Moon, Jung Aa	1
Morrison, Kristin M.	1
Nekrasova-Beker, Tatiana	1
Pae, Hye K.	1
More ▼