ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	10
Since 2006 (last 20 years)	13

Descriptor

Difficulty Level	32
Test Format	32
Test Reliability	32
Test Items	27
Multiple Choice Tests	17
Test Validity	11
Comparative Analysis	10
Higher Education	10
Item Analysis	9
Foreign Countries	8
Item Response Theory	6
Comparative Testing	5
Computer Assisted Testing	5
Mathematics Tests	5
Scores	5
Scoring	5
Achievement Tests	4
Correlation	4
Grade 7	4
Language Tests	4
Objective Tests	4
Psychometrics	4
Statistical Analysis	4
Test Construction	4
Undergraduate Students	4
More ▼

Source

Educational and Psychological…	3
Educational Research and…	2
Advances in Health Sciences…	1
Educational Measurement:…	1
Grantee Submission	1
International Journal of…	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational…	1
Journal of Experimental…	1
Journal of Interactive Online…	1
Journal of the Scholarship of…	1
Language Testing	1
Novitas-ROYAL (Research on…	1
Practical Assessment,…	1
Research Matters	1
More ▼

Publication Type

Reports - Research	26
Journal Articles	18
Speeches/Meeting Papers	8
Reports - Evaluative	5
Guides - Non-Classroom	1
Information Analyses	1
Opinion Papers	1
Tests/Questionnaires	1

Education Level

Higher Education	5
Postsecondary Education	5
Elementary Education	4
Middle Schools	4
Secondary Education	4
Grade 7	3
Junior High Schools	3
Intermediate Grades	2
Early Childhood Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 8	1
Grade 9	1
High Schools	1
Primary Education	1
More ▼

Audience

Researchers	2
Practitioners	1

Location

Turkey	2
Germany	1
Japan	1
Louisiana	1
Nigeria	1
North Dakota	1
Turkey (Ankara)	1
United Kingdom	1
United Kingdom (Wales)	1

Laws, Policies, & Programs

Pell Grant Program

Assessments and Surveys

Test of English as a Foreign…	2
ACT Assessment	1
Defining Issues Test	1
Embedded Figures Test	1

What Works Clearinghouse Rating

Showing 1 to 15 of 32 results Save | Export

Item Response Theory, Computer Adaptive Testing and the Risk of Self-Deception

Download full text

Benton, Tom – Research Matters, 2021

Computer adaptive testing is intended to make assessment more reliable by tailoring the difficulty of the questions a student has to answer to their level of ability. Most commonly, this benefit is used to justify the length of tests being shortened whilst retaining the reliability of a longer, non-adaptive test. Improvements due to adaptive…

Descriptors: Risk, Item Response Theory, Computer Assisted Testing, Difficulty Level

A Framework for Evaluating Stopping Rules for Fixed-Form Formative Assessments: Balancing Efficiency and Reliability

Peer reviewed
PDF on ERIC

Download full text

Basaraba, Deni L.; Yovanoff, Paul; Shivraj, Pooja; Ketterlin-Geller, Leanne R. – Practical Assessment, Research & Evaluation, 2020

Stopping rules for fixed-form tests with graduated item difficulty are intended to stop administration of a test at the point where students are sufficiently unlikely to provide a correct response following a pattern of incorrect responses. Although widely employed in fixed-form tests in education, little research has been done to empirically…

Descriptors: Formative Evaluation, Test Format, Test Items, Difficulty Level

The Use of Three-Option Multiple Choice Items for Classroom Assessment

Peer reviewed
PDF on ERIC

Download full text

Atalmis, Erkan Hasan – International Journal of Assessment Tools in Education, 2018

Although multiple-choice items (MCIs) are widely used for classroom assessment, designing MCIs with sufficient number of plausible distracters is very challenging for teachers. In this regard, previous empirical studies reveal that using three-option MCIs provides various advantages when compared to four-option MCIs due to less preparation and…

Descriptors: Multiple Choice Tests, Test Items, Difficulty Level, Test Reliability

Multiple True-False Items: A Comparison of Scoring Algorithms

Peer reviewed

Direct link

Lahner, Felicitas-Maria; Lörwald, Andrea Carolin; Bauer, Daniel; Nouns, Zineb Miriam; Krebs, René; Guttormsen, Sissel; Fischer, Martin R.; Huwendiek, Sören – Advances in Health Sciences Education, 2018

Multiple true-false (MTF) items are a widely used supplement to the commonly used single-best answer (Type A) multiple choice format. However, an optimal scoring algorithm for MTF items has not yet been established, as existing studies yielded conflicting results. Therefore, this study analyzes two questions: What is the optimal scoring algorithm…

Descriptors: Scoring Formulas, Scoring Rubrics, Objective Tests, Multiple Choice Tests

Computer-Based and Paper-and-Pencil Tests: A Study in Calculus for STEM Majors

Peer reviewed

Direct link

Smolinsky, Lawrence; Marx, Brian D.; Olafsson, Gestur; Ma, Yanxia A. – Journal of Educational Computing Research, 2020

Computer-based testing is an expanding use of technology offering advantages to teachers and students. We studied Calculus II classes for science, technology, engineering, and mathematics majors using different testing modes. Three sections with 324 students employed: paper-and-pencil testing, computer-based testing, and both. Computer tests gave…

Descriptors: Test Format, Computer Assisted Testing, Paper (Material), Calculus

Analysis of Multiple-Choice versus Open-Ended Questions in Language Tests According to Different Cognitive Domain Levels

Peer reviewed
PDF on ERIC

Download full text

Polat, Murat – Novitas-ROYAL (Research on Youth and Language), 2020

Classroom practices, materials and teaching methods in language classes have changed a lot in the last decades and continue to evolve; however, the commonly used techniques to test students' foreign language skills have not changed much regardless of the recent awareness in Bloom's taxonomy. Testing units at schools rely mostly on multiple choice…

Descriptors: Multiple Choice Tests, Test Format, Test Items, Difficulty Level

Psychometric Report for the Early Fractions Test (Version 2.2) Administered with Third- and Fourth-Grade Students in Spring 2017. Research Report No. 2017-11

Download full text

Schoen, Robert C.; Yang, Xiaotong; Liu, Sicong; Paek, Insu – Grantee Submission, 2017

The Early Fractions Test v2.2 is a paper-pencil test designed to measure mathematics achievement of third- and fourth-grade students in the domain of fractions. The purpose, or intended use, of the Early Fractions Test v2.2 is to serve as a measure of student outcomes in a randomized trial designed to estimate the effect of an educational…

Descriptors: Psychometrics, Mathematics Tests, Mathematics Achievement, Fractions

Validity and Reliability of Scores Obtained on Multiple-Choice Questions: Why Functioning Distractors Matter

Peer reviewed
PDF on ERIC

Download full text

Ali, Syed Haris; Carr, Patrick A.; Ruit, Kenneth G. – Journal of the Scholarship of Teaching and Learning, 2016

Plausible distractors are important for accurate measurement of knowledge via multiple-choice questions (MCQs). This study demonstrates the impact of higher distractor functioning on validity and reliability of scores obtained on MCQs. Freeresponse (FR) and MCQ versions of a neurohistology practice exam were given to four cohorts of Year 1 medical…

Descriptors: Scores, Multiple Choice Tests, Test Reliability, Test Validity

A Comparison of Three Test Formats to Assess Word Difficulty

Peer reviewed

Direct link

Culligan, Brent – Language Testing, 2015

This study compared three common vocabulary test formats, the Yes/No test, the Vocabulary Knowledge Scale (VKS), and the Vocabulary Levels Test (VLT), as measures of vocabulary difficulty. Vocabulary difficulty was defined as the item difficulty estimated through Item Response Theory (IRT) analysis. Three tests were given to 165 Japanese students,…

Descriptors: Language Tests, Test Format, Comparative Analysis, Vocabulary

Examination of Test and Item Statistics from Visual and Verbal Mathematics Questions

Peer reviewed
PDF on ERIC

Download full text

Alpayar, Cagla; Gulleroglu, H. Deniz – Educational Research and Reviews, 2017

The aim of this research is to determine whether students' test performance and approaches to test questions change based on the type of mathematics questions (visual or verbal) administered to them. This research is based on a mixed-design model. The quantitative data are gathered from 297 seventh grade students, attending seven different middle…

Descriptors: Foreign Countries, Middle School Students, Grade 7, Student Evaluation

The Impact of Sub-Skills and Item Content on Students' Skills with Regard to the Control-of-Variables Strategy

Peer reviewed

Direct link

Schwichow, Martin; Christoph, Simon; Boone, William J.; Härtig, Hendrik – International Journal of Science Education, 2016

The so-called control-of-variables strategy (CVS) incorporates the important scientific reasoning skills of designing controlled experiments and interpreting experimental outcomes. As CVS is a prominent component of science standards appropriate assessment instruments are required to measure these scientific reasoning skills and to evaluate the…

Descriptors: Thinking Skills, Science Instruction, Science Experiments, Science Tests

Reliability and Levels of Difficulty of Objective Test Items in a Mathematics Achievement Test: A Study of Ten Senior Secondary Schools in Five Local Government Areas of Akure, Ondo State

Peer reviewed

Direct link

Adebule, S. O. – Educational Research and Reviews, 2009

This study examined the reliability and difficult indices of Multiple Choice (MC) and True or False (TF) types of objective test items in a Mathematics Achievement Test (MAT). The instruments used were two variants- 50-items Mathematics achievement test based on the multiple choice and true or false test formats. A total of five hundred (500)…

Descriptors: Objective Tests, Mathematics Achievement, Achievement Tests, Test Reliability

A Comparison of the Item Difficulty and Item Discrimination of Multiple-Choice Items Using the "None of the Above" and One Correct Response Options.

Peer reviewed

Tollefson, Nona – Educational and Psychological Measurement, 1987

This study compared the item difficulty, item discrimination, and test reliability of three forms of multiple-choice items: (1) one correct answer; (2) "none of the above" as a foil; and (3) "none of the above" as the correct answer. Twelve items in the three formats were administered in a college statistics examination. (BS)

Descriptors: Difficulty Level, Higher Education, Item Analysis, Multiple Choice Tests

Some Advantages of Alternate-Choice Test Items.

Ebel, Robert L. – 1981

An alternate-choice test item is a simple declarative sentence, one portion of which is given with two different wordings. For example, "Foundations like Ford and Carnegie tend to be (1) eager (2) hesitant to support innovative solutions to educational problems." The examinee's task is to choose the alternative that makes the sentence…

Descriptors: Comparative Testing, Difficulty Level, Guessing (Tests), Multiple Choice Tests

Relative Effectiveness of Single and Double Multiple-Choice Questions in Educational Measurement.

Peer reviewed

Weiten, Wayne – Journal of Experimental Education, 1982

A comparison of double as opposed to single multiple-choice questions yielded significant differences in regard to item difficulty, item discrimination, and internal reliability, but not concurrent validity. (Author/PN)

Descriptors: Difficulty Level, Educational Testing, Higher Education, Multiple Choice Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3

Frisbie, David A.	2
Henning, Grant	2
Adebule, S. O.	1
Aiken, Lewis R.	1
Algina, James	1
Ali, Syed Haris	1
Alpayar, Cagla	1
Anderson, Paul S.	1
Atalmis, Erkan Hasan	1
Basaraba, Deni L.	1
Bauer, Daniel	1
Benton, Tom	1
Bethscheider, Janine K.	1
Boone, William J.	1
Carr, Patrick A.	1
Catts, Ralph M.	1
Chissom, Brad	1
Christoph, Simon	1
Chukabarah, Prince C. O.	1
Coats, Pamela K.	1
Culligan, Brent	1
Ebel, Robert L.	1
Fischer, Martin R.	1
Gulleroglu, H. Deniz	1
More ▼