ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	6

Descriptor

Item Analysis	7
Multiple Choice Tests	7
Performance Based Assessment	7
Test Items	4
Comparative Testing	3
Difficulty Level	2
Foreign Countries	2
Test Construction	2
Test Format	2
Test Validity	2
Academic Achievement	1
Accuracy	1
Achievement Gains	1
Alternative Assessment	1
Bias	1
Biotechnology	1
Case Studies	1
Cohort Analysis	1
Comparative Analysis	1
Computer Assisted Testing	1
Concept Formation	1
Construct Validity	1
Decision Making	1
Educational Experiments	1
Educational Research	1
More ▼

Source

College Teaching	1
Decision Sciences Journal of…	1
Educational Assessment	1
Journal of Biological…	1
Journal of the Scholarship of…	1
Language Testing	1

Author

Ali, Syed Haris	1
Bennett, Randy Elliot	1
Carr, Patrick A.	1
Ehmer, Maike	1
Grimm, Tobias	1
Hammann, Marcus	1
Kimmel, Rumena	1
Kuechler, William L.	1
Laprise, Shari L.	1
Lowenkamp, Lena	1
Phan, Thi Thanh Hoi	1
Rost, Detlef H.	1
Ruit, Kenneth G.	1
Simkin, Mark G.	1
Singley, Mark K.	1
Sparfeldt, Jorn R.	1
Steingraber, Antje	1
Wind, Stefanie A.	1
More ▼

Publication Type

Journal Articles	6
Reports - Research	5
Reports - Evaluative	2

Education Level

Higher Education	3
Elementary Education	2
Postsecondary Education	2
Grade 4	1
High Schools	1
Secondary Education	1

Audience

Location

Germany	2
North Dakota	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 7 results Save | Export

A Sequential Approach to Detecting Differential Rater Functioning in Sparse Rater-Mediated Assessment Networks

Peer reviewed

Direct link

Wind, Stefanie A. – Language Testing, 2023

Researchers frequently evaluate rater judgments in performance assessments for evidence of differential rater functioning (DRF), which occurs when rater severity is systematically related to construct-irrelevant student characteristics after controlling for student achievement levels. However, researchers have observed that methods for detecting…

Descriptors: Evaluators, Decision Making, Student Characteristics, Performance Based Assessment

Validity and Reliability of Scores Obtained on Multiple-Choice Questions: Why Functioning Distractors Matter

Peer reviewed
PDF on ERIC

Download full text

Ali, Syed Haris; Carr, Patrick A.; Ruit, Kenneth G. – Journal of the Scholarship of Teaching and Learning, 2016

Plausible distractors are important for accurate measurement of knowledge via multiple-choice questions (MCQs). This study demonstrates the impact of higher distractor functioning on validity and reliability of scores obtained on MCQs. Freeresponse (FR) and MCQ versions of a neurohistology practice exam were given to four cohorts of Year 1 medical…

Descriptors: Scores, Multiple Choice Tests, Test Reliability, Test Validity

Afraid Not: Student Performance versus Perception Based on Exam Question Format

Peer reviewed

Direct link

Laprise, Shari L. – College Teaching, 2012

Successful exam composition can be a difficult task. Exams should not only assess student comprehension, but be learning tools in and of themselves. In a biotechnology course delivered to nonmajors at a business college, objective multiple-choice test questions often require students to choose the exception or "not true" choice. Anecdotal student…

Descriptors: Feedback (Response), Test Items, Multiple Choice Tests, Biotechnology

Not Read, but Nevertheless Solved? Three Experiments on PIRLS Multiple Choice Reading Comprehension Test Items

Peer reviewed

Direct link

Sparfeldt, Jorn R.; Kimmel, Rumena; Lowenkamp, Lena; Steingraber, Antje; Rost, Detlef H. – Educational Assessment, 2012

Multiple-choice (MC) reading comprehension test items comprise three components: text passage, questions about the text, and MC answers. The construct validity of this format has been repeatedly criticized. In three between-subjects experiments, fourth graders (N[subscript 1] = 230, N[subscript 2] = 340, N[subscript 3] = 194) worked on three…

Descriptors: Test Items, Reading Comprehension, Construct Validity, Grade 4

Why Is Performance on Multiple-Choice Tests and Constructed-Response Tests Not More Closely Related? Theory and an Empirical Test

Peer reviewed

Direct link

Kuechler, William L.; Simkin, Mark G. – Decision Sciences Journal of Innovative Education, 2010

Both professional certification and academic tests rely heavily on multiple-choice questions, despite the widespread belief that alternate, constructed-response questions are superior measures of a test taker's understanding of the underlying material. Empirically, the search for a link between these two assessment metrics has met with limited…

Descriptors: Multiple Choice Tests, Performance Based Assessment, Alternative Assessment, Knowledge Level

Assessing Pupils' Skills in Experimentation

Peer reviewed

Direct link

Hammann, Marcus; Phan, Thi Thanh Hoi; Ehmer, Maike; Grimm, Tobias – Journal of Biological Education, 2008

This study is concerned with different forms of assessment of pupils' skills in experimentation. The findings of three studies are reported. Study 1 investigates whether it is possible to develop reliable multiple-choice tests for the skills of forming hypotheses, designing experiments and analysing experimental data. Study 2 compares scores from…

Descriptors: Multiple Choice Tests, Experiments, Science Process Skills, Skill Analysis

Toward Computer-Based Performance Assessment in Mathematics.

Download full text

Singley, Mark K.; Bennett, Randy Elliot – 1995

One of the main limitations of the current generation of computer-based tests is its dependency on the multiple-choice item. This research was aimed at extending computer-based testing by bringing limited forms of performance assessment to it in the domain of mathematics. This endeavor involves not only building task types that better reflect…

Descriptors: Computer Assisted Testing, Item Analysis, Mathematics Tests, Multiple Choice Tests