ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	0
Since 2007 (last 20 years)	7

Descriptor

Error of Measurement	10
Multiple Choice Tests	10
Statistical Analysis	10
Test Items	7
Difficulty Level	5
Goodness of Fit	4
Item Response Theory	4
Reading Comprehension	3
Reading Tests	3
Cheating	2
Comparative Analysis	2
Educational Testing	2
Formative Evaluation	2
Guessing (Tests)	2
Pilot Projects	2
Public Schools	2
Responses	2
Simulation	2
Student Evaluation	2
Test Construction	2
Test Reliability	2
Academic Standards	1
Accuracy	1
Certification	1
Computation	1
More ▼

Source

Behavioral Research and…	3
ETS Research Report Series	2
Applied Psychological…	1
Educ Psychol Meas	1
Educational and Psychological…	1
Practical Assessment,…	1

Publication Type

Reports - Research	6
Journal Articles	5
Numerical/Quantitative Data	3
Reports - Evaluative	3
Speeches/Meeting Papers	1

Education Level

Elementary Education	2
Early Childhood Education	1
Grade 2	1
Grade 5	1
Grade 7	1
Intermediate Grades	1
Middle Schools	1
Primary Education	1

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 10 results Save | Export

The Empirical Power and Type I Error Rates of the GBT and [omega] Indices in Detecting Answer Copying on Multiple-Choice Tests

Peer reviewed

Direct link

Zopluoglu, Cengiz; Davenport, Ernest C., Jr. – Educational and Psychological Measurement, 2012

The generalized binomial test (GBT) and [omega] indices are the most recent methods suggested in the literature to detect answer copying behavior on multiple-choice tests. The [omega] index is one of the most studied indices, but there has not yet been a systematic simulation study for the GBT index. In addition, the effect of the ability levels…

Descriptors: Statistical Analysis, Error of Measurement, Simulation, Multiple Choice Tests

Fixing the c Parameter in the Three-Parameter Logistic Model

Peer reviewed
PDF on ERIC

Download full text

Han, Kyung T. – Practical Assessment, Research & Evaluation, 2012

For several decades, the "three-parameter logistic model" (3PLM) has been the dominant choice for practitioners in the field of educational measurement for modeling examinees' response data from multiple-choice (MC) items. Past studies, however, have pointed out that the c-parameter of 3PLM should not be interpreted as a guessing…

Descriptors: Statistical Analysis, Models, Multiple Choice Tests, Guessing (Tests)

Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 7. Technical Report #1206

Download full text

Irvin, P. Shawn; Alonzo, Julie; Lai, Cheng-Fei; Park, Bitnara Jasmine; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we present the results of a reliability study of the seventh-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

Descriptors: Reading Comprehension, Testing Programs, Statistical Analysis, Grade 7

Impossible Scores Resulting in Zero Frequencies in the Anchor Test: Impact on Smoothing and Equating. Research Report. ETS RR-08-10

Peer reviewed
PDF on ERIC

Download full text

Puhan, Gautam; vonDavier, Alina; Gupta, Shaloo – ETS Research Report Series, 2008

Equating under the external anchor design is frequently conducted using scaled scores on the anchor test. However, scaled scores often lead to the unique problem of creating zero frequencies in the score distribution because there may not always be a one-to-one correspondence between raw and scaled scores. For example, raw scores of 17 and 18 may…

Descriptors: Equated Scores, Test Items, Raw Scores, Statistical Analysis

Comparison of Subscores Based on Classical Test Theory Methods. Research Report. ETS RR-08-54

Peer reviewed
PDF on ERIC

Download full text

Puhan, Gautam; Sinharay, Sandip; Haberman, Shelby; Larkin, Kevin – ETS Research Report Series, 2008

Will reporting subscores provide any additional information than the total score? Is there a method that can be used to provide more trustworthy subscores than observed subscores? These 2 questions are addressed in this study. To answer the 2nd question, 2 subscore estimation methods (i.e., subscore estimated from the observed total score or…

Descriptors: Comparative Analysis, Scores, Tests, Certification

Examining the Technical Adequacy of Second-Grade Reading Comprehension Measures in a Progress Monitoring Assessment System. Technical Report # 08-08

Download full text

Alonzo, Julie; Liu, Kimy; Tindal, Gerald – Behavioral Research and Teaching, 2008

This technical report describes the development of reading comprehension assessments designed for use as progress monitoring measures appropriate for 2nd Grade students. The creation, piloting, and technical adequacy of the measures are presented. The following are appended: (1) Item Specifications for MC [Multiple Choice] Comprehension - Passage…

Descriptors: Reading Comprehension, Reading Tests, Grade 2, Elementary School Students

Examining the Technical Adequacy of Fifth-Grade Reading Comprehension Measures in a Progress Monitoring Assessment System. Technical Report # 08-07

Download full text

Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2008

This technical report describes the development and piloting of reading comprehension measures developed for use by fifth-grade students as part of an online progress monitoring assessment system, http://easycbm.com. Each comprehension measure is comprised of an original work of narrative fiction approximately 1500 words in length followed by 20…

Descriptors: Reading Comprehension, Reading Tests, Grade 5, Multiple Choice Tests

Effect of Variation in Probability of Guessing Correctly on Reliability of Multiple-Choice Tests

Frary, Robert B.; Zimmerman, Donald W. – Educ Psychol Meas, 1970

Descriptors: Error of Measurement, Guessing (Tests), Multiple Choice Tests, Probability

Detecting Answer Copying Using the Kappa Statistic

Peer reviewed

Direct link

Sotaridona, Leonardo S.; van der Linden, Wim J.; Meijer, Rob R. – Applied Psychological Measurement, 2006

A statistical test for detecting answer copying on multiple-choice tests based on Cohen's kappa is proposed. The test is free of any assumptions on the response processes of the examinees suspected of copying and having served as the source, except for the usual assumption that these processes are probabilistic. Because the asymptotic null and…

Descriptors: Cheating, Test Items, Simulation, Statistical Analysis

Adjusting Scores on Examinations Offering a Choice of Questions.

Download full text

Livingston, Samuel A. – 1986

This paper deals with test fairness regarding a test consisting of two parts: (1) a "common" section, taken by all students; and (2) a "variable" section, in which some students may answer a different set of questions from other students. For example, a test taken by several thousand students each year contains a common multiple-choice portion and…

Descriptors: Difficulty Level, Error of Measurement, Essay Tests, Mathematical Models

Alonzo, Julie	3
Tindal, Gerald	3
Puhan, Gautam	2
Davenport, Ernest C., Jr.	1
Frary, Robert B.	1
Gupta, Shaloo	1
Haberman, Shelby	1
Han, Kyung T.	1
Irvin, P. Shawn	1
Lai, Cheng-Fei	1
Larkin, Kevin	1
Liu, Kimy	1
Livingston, Samuel A.	1
Meijer, Rob R.	1
Park, Bitnara Jasmine	1
Sinharay, Sandip	1
Sotaridona, Leonardo S.	1
Zimmerman, Donald W.	1
Zopluoglu, Cengiz	1
van der Linden, Wim J.	1
vonDavier, Alina	1
More ▼