ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	8

Descriptor

Difficulty Level	12
Error of Measurement	12
Multiple Choice Tests	12
Test Items	11
Goodness of Fit	6
Reading Comprehension	6
Reading Tests	5
Statistical Analysis	5
Item Response Theory	4
Test Construction	4
Formative Evaluation	3
Pilot Projects	3
Student Evaluation	3
Test Reliability	3
Accuracy	2
Comparative Analysis	2
Elementary School Students	2
Foreign Countries	2
Grade 7	2
Higher Education	2
Item Analysis	2
Public Schools	2
Standardized Tests	2
Test Validity	2
Testing Problems	2
More ▼

Source

Behavioral Research and…	5
Educational and Psychological…	2
Evaluation and Program…	1
International Journal of…	1
Practical Assessment,…	1

Publication Type

Reports - Research	7
Journal Articles	5
Numerical/Quantitative Data	5
Reports - Evaluative	4
Speeches/Meeting Papers	2
Reports - Descriptive	1
Tests/Questionnaires	1

Education Level

Elementary Education	3
Grade 2	2
Grade 5	2
Grade 7	2
Middle Schools	2
Secondary Education	2
Early Childhood Education	1
Grade 1	1
Grade 3	1
Grade 4	1
Higher Education	1
Intermediate Grades	1
Junior High Schools	1
Kindergarten	1
Postsecondary Education	1
Primary Education	1
More ▼

Audience

Researchers

Location

Chile	1
United Kingdom (Wales)	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 12 results Save | Export

The Effect of Multiple-Choice Test Items' Difficulty Degree on the Reliability Coefficient and the Standard Error of Measurement Depending on the Item Response Theory (IRT)

Peer reviewed
PDF on ERIC

Download full text

Al-zboon, Habis Saad; Alrekebat, Amjad Farhan – International Journal of Higher Education, 2021

This study aims at identifying the effect of multiple-choice test items' difficulty degree on the reliability coefficient and the standard error of measurement depending on the item response theory IRT. To achieve the objectives of the study, (WinGen3) software was used to generate the IRT parameters (difficulty, discrimination, guessing) for four…

Descriptors: Multiple Choice Tests, Test Items, Difficulty Level, Error of Measurement

Position of Correct Option and Distractors Impacts Responses to Multiple-Choice Items: Evidence from a National Test

Peer reviewed

Direct link

Lions, Séverin; Dartnell, Pablo; Toledo, Gabriela; Godoy, María Inés; Córdova, Nora; Jiménez, Daniela; Lemarié, Julie – Educational and Psychological Measurement, 2023

Even though the impact of the position of response options on answers to multiple-choice items has been investigated for decades, it remains debated. Research on this topic is inconclusive, perhaps because too few studies have obtained experimental data from large-sized samples in a real-world context and have manipulated the position of both…

Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Responses

Fixing the c Parameter in the Three-Parameter Logistic Model

Peer reviewed
PDF on ERIC

Download full text

Han, Kyung T. – Practical Assessment, Research & Evaluation, 2012

For several decades, the "three-parameter logistic model" (3PLM) has been the dominant choice for practitioners in the field of educational measurement for modeling examinees' response data from multiple-choice (MC) items. Past studies, however, have pointed out that the c-parameter of 3PLM should not be interpreted as a guessing…

Descriptors: Statistical Analysis, Models, Multiple Choice Tests, Guessing (Tests)

Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 7. Technical Report #1206

Download full text

Irvin, P. Shawn; Alonzo, Julie; Lai, Cheng-Fei; Park, Bitnara Jasmine; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we present the results of a reliability study of the seventh-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

Descriptors: Reading Comprehension, Testing Programs, Statistical Analysis, Grade 7

The Development and Technical Adequacy of Seventh-Grade Reading Comprehension Measures in a Progress Monitoring Assessment System. Technical Report #1102

Download full text

Park, Bitnara Jasmine; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2011

This technical report describes the process of development and piloting of reading comprehension measures that are appropriate for seventh-grade students as part of an online progress screening and monitoring assessment system, http://easycbm.com. Each measure consists of an original fictional story of approximately 1,600 to 1,900 words with 20…

Descriptors: Reading Comprehension, Reading Tests, Grade 7, Test Construction

Examining the Technical Adequacy of Second-Grade Reading Comprehension Measures in a Progress Monitoring Assessment System. Technical Report # 08-08

Download full text

Alonzo, Julie; Liu, Kimy; Tindal, Gerald – Behavioral Research and Teaching, 2008

This technical report describes the development of reading comprehension assessments designed for use as progress monitoring measures appropriate for 2nd Grade students. The creation, piloting, and technical adequacy of the measures are presented. The following are appended: (1) Item Specifications for MC [Multiple Choice] Comprehension - Passage…

Descriptors: Reading Comprehension, Reading Tests, Grade 2, Elementary School Students

Examining the Technical Adequacy of Fifth-Grade Reading Comprehension Measures in a Progress Monitoring Assessment System. Technical Report # 08-07

Download full text

Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2008

This technical report describes the development and piloting of reading comprehension measures developed for use by fifth-grade students as part of an online progress monitoring assessment system, http://easycbm.com. Each comprehension measure is comprised of an original work of narrative fiction approximately 1500 words in length followed by 20…

Descriptors: Reading Comprehension, Reading Tests, Grade 5, Multiple Choice Tests

A Comparison of Two, Three and Four-Choice Item Tests Given a Fixed Total Number of Choices.

Peer reviewed

Straton, Ralph G.; Catts, Ralph M. – Educational and Psychological Measurement, 1980

Multiple-choice tests composed entirely of two-, three-, or four-choice items were investigated. Results indicated that number of alternatives per item was inversely related to item difficulty, but directly related to item discrimination. Reliability and standard error of measurement of three-choice item tests was equivalent or superior.…

Descriptors: Difficulty Level, Error of Measurement, Foreign Countries, Higher Education

Examining the Technical Adequacy of Reading Comprehension Measures in a Progress Monitoring Assessment System. Technical Report # 41

Download full text

Alonzo, Julie; Liu, Kimy; Tindal, Gerald – Behavioral Research and Teaching, 2007

In this technical report, the authors describe the development and piloting of reading comprehension measures as part of a comprehensive progress monitoring literacy assessment system developed in 2006 for use with students in Kindergarten through fifth grade. They begin with a brief overview of the two conceptual frameworks underlying the…

Descriptors: Reading Comprehension, Emergent Literacy, Test Construction, Literacy Education

Adjusting Scores on Examinations Offering a Choice of Questions.

Download full text

Livingston, Samuel A. – 1986

This paper deals with test fairness regarding a test consisting of two parts: (1) a "common" section, taken by all students; and (2) a "variable" section, in which some students may answer a different set of questions from other students. For example, a test taken by several thousand students each year contains a common multiple-choice portion and…

Descriptors: Difficulty Level, Error of Measurement, Essay Tests, Mathematical Models

Can Response Order Bias Evaluations?

Peer reviewed

Israel, Glenn D.; Taylor, C. L. – Evaluation and Program Planning, 1990

Mail questionnaire items that are susceptible to order effects were examined using data from 168 questionnaires in a Florida Cooperative Extension Service evaluation. Order effects were found for multiple-response and attributive questions but not for single-response items. Order also interacted with question complexity, social desirability, and…

Descriptors: Adult Farmer Education, Difficulty Level, Educational Assessment, Error of Measurement

A Comparison of Several Multiple-Choice, Linguistic-Based Item Writing Algorithms.

Roid, Gale; Haladyna, Tom – 1978

The technology of transforming sentences from prose instruction into test questions was examined by comparing two methods of selecting sentences (keyword vs. rare singleton), two types of question words (nouns vs. adjectives), and two foil construction methods (writer's choice vs. algorithmic). Four item writers created items using each…

Descriptors: Algorithms, Cloze Procedure, Comparative Analysis, Criterion Referenced Tests

Alonzo, Julie	5
Tindal, Gerald	5
Liu, Kimy	2
Park, Bitnara Jasmine	2
Al-zboon, Habis Saad	1
Alrekebat, Amjad Farhan	1
Catts, Ralph M.	1
Córdova, Nora	1
Dartnell, Pablo	1
Godoy, María Inés	1
Haladyna, Tom	1
Han, Kyung T.	1
Irvin, P. Shawn	1
Israel, Glenn D.	1
Jiménez, Daniela	1
Lai, Cheng-Fei	1
Lemarié, Julie	1
Lions, Séverin	1
Livingston, Samuel A.	1
Roid, Gale	1
Straton, Ralph G.	1
Taylor, C. L.	1
Toledo, Gabriela	1
More ▼