ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	10

Descriptor

Difficulty Level	16
Test Bias	16
Test Format	16
Test Items	12
Foreign Countries	7
Multiple Choice Tests	6
Comparative Analysis	4
Test Construction	4
Grade 8	3
Item Analysis	3
Item Response Theory	3
Language Tests	3
Achievement Tests	2
College Entrance Examinations	2
College Students	2
English (Second Language)	2
Equated Scores	2
French	2
Goodness of Fit	2
Individual Differences	2
Item Banks	2
Language Proficiency	2
Listening Comprehension Tests	2
Mathematics Tests	2
Psychometrics	2
More ▼

Source

Advances in Health Sciences…	1
Applied Measurement in…	1
Cogent Education	1
Educational Psychology Review	1
International Journal of…	1
Journal of Applied Measurement	1
Journal of Educational…	1
Language Learning in Higher…	1
Language Testing	1
SAGE Open	1

Publication Type

Reports - Research	11
Journal Articles	10
Reports - Evaluative	4
Information Analyses	3
Speeches/Meeting Papers	3

Education Level

Grade 8	3
Higher Education	3
Postsecondary Education	3
Elementary Education	2
Elementary Secondary Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1
More ▼

Audience

Location

Belgium	1
Germany	1
Iran	1
Israel	1
Japan	1
Spain	1
Turkey	1
USSR	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	1
National Assessment of…	1
Program for International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

Do Practice Tests (Quizzes) Reduce or Provoke Test Anxiety? A Meta-Analytic Review

Peer reviewed

Direct link

Yang, Chunliang; Li, Jiaojiao; Zhao, Wenbo; Luo, Liang; Shanks, David R. – Educational Psychology Review, 2023

Practice testing is a powerful tool to consolidate long-term retention of studied information, facilitate subsequent learning of new information, and foster knowledge transfer. However, practitioners frequently express the concern that tests are anxiety-inducing and that their employment in the classroom should be minimized. The current review…

Descriptors: Tests, Test Format, Testing, Test Wiseness

Multiple True-False Items: A Comparison of Scoring Algorithms

Peer reviewed

Direct link

Lahner, Felicitas-Maria; Lörwald, Andrea Carolin; Bauer, Daniel; Nouns, Zineb Miriam; Krebs, René; Guttormsen, Sissel; Fischer, Martin R.; Huwendiek, Sören – Advances in Health Sciences Education, 2018

Multiple true-false (MTF) items are a widely used supplement to the commonly used single-best answer (Type A) multiple choice format. However, an optimal scoring algorithm for MTF items has not yet been established, as existing studies yielded conflicting results. Therefore, this study analyzes two questions: What is the optimal scoring algorithm…

Descriptors: Scoring Formulas, Scoring Rubrics, Objective Tests, Multiple Choice Tests

Method Bias in Cloze Tests as Reading Comprehension Measures

Peer reviewed

Direct link

Baghaei, Purya; Ravand, Hamdollah – SAGE Open, 2019

In many reading comprehension tests, different test formats are employed. Two commonly used test formats to measure reading comprehension are sustained passages followed by some questions and cloze items. Individual differences in handling test format peculiarities could constitute a source of score variance. In this study, a bifactor Rasch model…

Descriptors: Cloze Procedure, Test Bias, Individual Differences, Difficulty Level

Bias Factors in Mathematics Achievement Tests among Israeli Students from the Former Soviet Union

Peer reviewed

Direct link

Levi-Keren, Michal – Cogent Education, 2016

This study explains mathematical difficulties of students who immigrated from the Former Soviet Union (FSU) vis-à-vis Israeli students, by identifying the existing bias factors in achievement tests. These factors are irrelevant to the mathematical knowledge being measured, and therefore threaten the test results. The bias factors were identified…

Descriptors: Mathematics Achievement, Mathematics Tests, Immigrants, Interviews

A Comparison of Video- and Audio-Mediated Listening Tests with Many-Facet Rasch Modeling and Differential Distractor Functioning

Peer reviewed

Direct link

Batty, Aaron Olaf – Language Testing, 2015

The rise in the affordability of quality video production equipment has resulted in increased interest in video-mediated tests of foreign language listening comprehension. Although research on such tests has continued fairly steadily since the early 1980s, studies have relied on analyses of raw scores, despite the growing prevalence of item…

Descriptors: Listening Comprehension Tests, Comparative Analysis, Video Technology, Audio Equipment

Taking Decisions: Assessment for University Entry

Peer reviewed

Direct link

Plassmann, Sibylle; Zeidler, Beate – Language Learning in Higher Education, 2014

Language testing means taking decisions: about the test taker's results, but also about the test construct and the measures taken in order to ensure quality. This article takes the German test "telc Deutsch C1 Hochschule" as an example to illustrate this decision-making process in an academic context. The test is used for university…

Descriptors: Language Tests, Test Wiseness, Test Construction, Decision Making

Modeling Item-Position Effects within an IRT Framework

Peer reviewed

Direct link

Debeer, Dries; Janssen, Rianne – Journal of Educational Measurement, 2013

Changing the order of items between alternate test forms to prevent copying and to enhance test security is a common practice in achievement testing. However, these changes in item order may affect item and test characteristics. Several procedures have been proposed for studying these item-order effects. The present study explores the use of…

Descriptors: Item Response Theory, Test Items, Test Format, Models

Exploring Differential Item Functioning (DIF) with the Rasch Model: A Comparison of Gender Differences on Eighth Grade Science Items in the United States and Spain

Peer reviewed

Direct link

Babiar, Tasha Calvert – Journal of Applied Measurement, 2011

Traditionally, women and minorities have not been fully represented in science and engineering. Numerous studies have attributed these differences to gaps in science achievement as measured by various standardized tests. Rather than describe mean group differences in science achievement across multiple cultures, this study focused on an in-depth…

Descriptors: Test Bias, Science Achievement, Standardized Tests, Grade 8

Examining the Effectiveness of Test Accommodation Using DIF and a Mixture IRT Model

Peer reviewed

Direct link

Cho, Hyun-Jeong; Lee, Jaehoon; Kingston, Neal – Applied Measurement in Education, 2012

This study examined the validity of test accommodation in third-eighth graders using differential item functioning (DIF) and mixture IRT models. Two data sets were used for these analyses. With the first data set (N = 51,591) we examined whether item type (i.e., story, explanation, straightforward) or item features were associated with item…

Descriptors: Testing Accommodations, Test Bias, Item Response Theory, Validity

Which Item Types Are Better Suited to the Linking of Verbal Adapted Tests?

Peer reviewed

Direct link

Allalouf, Avi; Rapp, Joel; Stoller, Reuven – International Journal of Testing, 2009

When a test is adapted from a source language (SL) into a target language (TL), the two forms are usually not psychometrically equivalent. If linking between test forms is necessary, those items that have had their psychometric characteristics altered by the translation (differential item functioning [DIF] items) should be eliminated from the…

Descriptors: Test Items, Test Format, Verbal Tests, Psychometrics

Differential Performance of Males and Females on Easy to Hard Item Arrangements; Influence of Feedback at the Item Level.

Plake, Barbara S.; And Others – 1983

Differential test performance by undergraduate males and females enrolled in a developmental educational psychology course (n=167) was reported on a quantitative examination as a function of item arrangement. Males were expected to perform better than females on tests whose items arranged easy to hard. Plake and Ansorge (1982) speculated this may…

Descriptors: Difficulty Level, Feedback, Higher Education, Scoring

On the Viability of Some Untestable Assumptions in Equating Exams That Allow Examinee Choice. Program Statistics Research Technical Report No. 93-31.

Download full text

Wang, Xiang-bo; And Others – 1993

An increasingly popular test format allows examinees to choose the items they will answer from among a larger set. When examinee choice is allowed fairness requires that the different test forms thus formed be equated for their possible differential difficulty. For this equating to be possible it is necessary to know how well examinees would have…

Descriptors: Adaptive Testing, Advanced Placement, Difficulty Level, Equated Scores

On Examinee Choice in Educational Testing. GRE Board Professional Report No. 91-17P.

Download full text

Wainer, Howard; Thissen, David – 1994

When an examination consists in whole or part of constructed response test items, it is common practice to allow the examinee to choose a subset of the constructed response questions from a larger pool. It is sometimes argued that, if choice were not allowed, the limitations on domain coverage forced by the small number of items might unfairly…

Descriptors: Constructed Response, Difficulty Level, Educational Testing, Equated Scores

Item Bias Detection and Item Writing Technology.

Roid, Gale H.; Wendler, Cathy L. W. – 1983

The development of the emerging technology of item writing was motivated in part by the desire to reduce potential subjectivity and bias between different item writers who attempt to construct parallel achievement tests. The present study contrasts four test forms constructed by the combined efforts of six item writers using four methods of item…

Descriptors: Achievement Tests, Difficulty Level, Intermediate Grades, Item Analysis

The Effect of Position and Format on the Difficulty of Assessment Exercises.

Download full text

Burton, Nancy W.; And Others – 1976

Assessment exercises (items) in three different formats--multiple-choice with an "I don't know" (IDK) option, multiple-choice without the IDK, and open-ended--were placed at the beginning, middle and end of 45-minute assessment packages (instruments). A balanced incomplete blocks analysis of variance was computed to determine the biasing…

Descriptors: Age Differences, Difficulty Level, Educational Assessment, Guessing (Tests)

Previous Page | Next Page »

Pages: 1 | 2

Allalouf, Avi	1
Babiar, Tasha Calvert	1
Baghaei, Purya	1
Batty, Aaron Olaf	1
Bauer, Daniel	1
Bolden, Bernadine J.	1
Burton, Nancy W.	1
Cho, Hyun-Jeong	1
Debeer, Dries	1
Fischer, Martin R.	1
Guttormsen, Sissel	1
Huwendiek, Sören	1
Janssen, Rianne	1
Kingston, Neal	1
Krebs, René	1
Lahner, Felicitas-Maria	1
Lee, Jaehoon	1
Levi-Keren, Michal	1
Li, Jiaojiao	1
Luo, Liang	1
Lörwald, Andrea Carolin	1
Nouns, Zineb Miriam	1
Plake, Barbara S.	1
Plassmann, Sibylle	1
More ▼