ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	7
Since 2017 (last 10 years)	17
Since 2007 (last 20 years)	35

Descriptor

Comparative Analysis	53
Difficulty Level	53
Test Format	53
Test Items	37
Foreign Countries	18
Item Response Theory	17
Multiple Choice Tests	17
Item Analysis	15
Computer Assisted Testing	14
Language Tests	13
Scores	11
Test Construction	10
Test Reliability	10
Correlation	9
English (Second Language)	9
Equated Scores	9
Reading Tests	9
Second Language Learning	9
Test Validity	9
Statistical Analysis	7
Undergraduate Students	7
Testing	6
Adaptive Testing	5
High School Students	5
Higher Education	5
More ▼

Publication Type

Reports - Research	44
Journal Articles	34
Speeches/Meeting Papers	15
Reports - Evaluative	5
Tests/Questionnaires	3
Dissertations/Theses -…	2
Collected Works - Proceedings	1
Information Analyses	1

Education Level

Higher Education	11
Postsecondary Education	11
Secondary Education	5
High Schools	3
Elementary Education	2
Elementary Secondary Education	1
Grade 12	1
Grade 8	1
Intermediate Grades	1
Middle Schools	1

Audience

Location

Indonesia	2
Japan	2
Spain	2
United Kingdom (England)	2
Canada	1
China	1
Europe	1
Germany	1
India	1
Iran	1
Macau	1
Netherlands	1
Nigeria	1
Sweden	1
Turkey (Ankara)	1
United Kingdom	1
United Kingdom (Northern…	1
United Kingdom (Wales)	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	3
Trends in International…	2
Advanced Placement…	1
Defining Issues Test	1
Embedded Figures Test	1
International English…	1
National Assessment of…	1
Peabody Individual…	1
Peabody Picture Vocabulary…	1
Program for International…	1
SAT (College Admission Test)	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 53 results Save | Export

Comparing the Score Interpretation across Modes in PISA: An Investigation of How Item Facets Affect Difficulty

Peer reviewed

Direct link

Harrison, Scott; Kroehne, Ulf; Goldhammer, Frank; Lüdtke, Oliver; Robitzsch, Alexander – Large-scale Assessments in Education, 2023

Background: Mode effects, the variations in item and scale properties attributed to the mode of test administration (paper vs. computer), have stimulated research around test equivalence and trend estimation in PISA. The PISA assessment framework provides the backbone to the interpretation of the results of the PISA test scores. However, an…

Descriptors: Scoring, Test Items, Difficulty Level, Foreign Countries

Closed Formula of Test Length Required for Adaptive Testing with Medium Probability of Solution

Peer reviewed

Direct link

Kárász, Judit T.; Széll, Krisztián; Takács, Szabolcs – Quality Assurance in Education: An International Perspective, 2023

Purpose: Based on the general formula, which depends on the length and difficulty of the test, the number of respondents and the number of ability levels, this study aims to provide a closed formula for the adaptive tests with medium difficulty (probability of solution is p = 1/2) to determine the accuracy of the parameters for each item and in…

Descriptors: Test Length, Probability, Comparative Analysis, Difficulty Level

Exploring the Comparability of Multiple-Choice and Constructed-Response Versions of Scenario-Based Assessment Tasks

Peer reviewed
PDF on ERIC

Download full text

Herrmann-Abell, Cari F.; Hardcastle, Joseph; DeBoer, George E. – Grantee Submission, 2022

As implementation of the "Next Generation Science Standards" moves forward, there is a need for new assessments that can measure students' integrated three-dimensional science learning. The National Research Council has suggested that these assessments be multicomponent tasks that utilize a combination of item formats including…

Descriptors: Multiple Choice Tests, Conditioning, Test Items, Item Response Theory

Test Score Equating of Multiple-Choice Mathematics Items: Techniques from Characteristic Curve of Modern Psychometric Theory

Peer reviewed

Direct link

Musa Adekunle Ayanwale – Discover Education, 2023

Examination scores obtained by students from the West African Examinations Council (WAEC), and National Business and Technical Examinations Board (NABTEB) may not be directly comparable due to differences in examination administration, item characteristics of the subject in question, and student abilities. For more accurate comparisons, scores…

Descriptors: Equated Scores, Mathematics Tests, Test Items, Test Format

Item Response Theory, Computer Adaptive Testing and the Risk of Self-Deception

Download full text

Benton, Tom – Research Matters, 2021

Computer adaptive testing is intended to make assessment more reliable by tailoring the difficulty of the questions a student has to answer to their level of ability. Most commonly, this benefit is used to justify the length of tests being shortened whilst retaining the reliability of a longer, non-adaptive test. Improvements due to adaptive…

Descriptors: Risk, Item Response Theory, Computer Assisted Testing, Difficulty Level

Improving Student Understanding of Quantum Measurement in Infinite-Dimensional Hilbert Space Using a Research-Based Multiple-Choice Question Sequence

Peer reviewed

Direct link

Yangqiuting Li; Chandralekha Singh – Physical Review Physics Education Research, 2025

Research-based multiple-choice questions implemented in class with peer instruction have been shown to be an effective tool for improving students' engagement and learning outcomes. Moreover, multiple-choice questions that are carefully sequenced to build on each other can be particularly helpful for students to develop a systematic understanding…

Descriptors: Physics, Science Instruction, Science Tests, Multiple Choice Tests

Investigating the Comparability of Multiple-Choice and Constructed-Response Science Assessments

Peer reviewed
PDF on ERIC

Download full text

Herrmann-Abell, Cari F.; Hardcastle, Joseph; DeBoer, George E. – Grantee Submission, 2019

The "Next Generation Science Standards" calls for new assessments that measure students' integrated three-dimensional science learning. The National Research Council has suggested that these assessments utilize a combination of item formats including constructed-response and multiple-choice. In this study, students were randomly assigned…

Descriptors: Science Tests, Multiple Choice Tests, Test Format, Test Items

A Comparative Study of Test Takers' Performance on Computer-Based Test and Paper-Based Test across Different CEFR Levels

Peer reviewed
PDF on ERIC

Download full text

Yao, Don – English Language Teaching, 2020

Computer-based test (CBT) and paper-based test (PBT) are two test modes to the test takers that have been widely adopted in the field of language testing or assessment over the last few decades. Due to the rapid development of science and technology, it is a trend for universities and educational institutions striving rather hard to deliver the…

Descriptors: Language Tests, Computer Assisted Testing, Test Format, Comparative Analysis

Technology-Enhanced Items in Grades 1-12 English Language Proficiency Assessments

Peer reviewed

Direct link

Kim, Ahyoung Alicia; Tywoniw, Rurik L.; Chapman, Mark – Language Assessment Quarterly, 2022

Technology-enhanced items (TEIs) are innovative, computer-delivered test items that allow test takers to better interact with the test environment compared to traditional multiple-choice items (MCIs). The interactive nature of TEIs offer improved construct coverage compared with MCIs but little research exists regarding students' performance on…

Descriptors: Language Tests, Test Items, Computer Assisted Testing, English (Second Language)

A Comparability Study of Text Difficulty and Task Characteristics of Parallel Academic IELTS Reading Tests

Peer reviewed
PDF on ERIC

Download full text

Liao, Linyu – English Language Teaching, 2020

As a high-stakes standardized test, IELTS is expected to have comparable forms of test papers so that test takers from different test administration on different dates receive comparable test scores. Therefore, this study examined the text difficulty and task characteristics of four parallel academic IELTS reading tests to reveal to what extent…

Descriptors: Second Language Learning, English (Second Language), Language Tests, High Stakes Tests

Examining Mode Effects for an Adapted Chinese Critical Thinking Assessment

Peer reviewed

Direct link

Gu, Lin; Ling, Guangming; Liu, Ou Lydia; Yang, Zhitong; Li, Guirong; Kardanova, Elena; Loyalka, Prashant – Assessment & Evaluation in Higher Education, 2021

We examine the effects of computer-based versus paper-based assessment of critical thinking skills, adapted from English (in the U.S.) to Chinese. Using data collected based on a random assignment between the two modes in multiple Chinese colleges, we investigate mode effects from multiple perspectives: mean scores, measurement precision, item…

Descriptors: Critical Thinking, Tests, Test Format, Computer Assisted Testing

Question Preview in English for Academic Purposes Listening Assessment: The Effect of Stem Preview on Difficulty, Item Type, and Discrimination

Peer reviewed

Direct link

Yeager, Rebecca; Meyer, Zachary – International Journal of Listening, 2022

This study investigates the effects of adding stem preview to an English for Academic Purposes (EAP) multiple-choice listening assessment. In stem preview, listeners may view the item stems, but not response options, before listening. Previous research indicates that adding preview to an exam typically decreases difficulty, but raises concerns…

Descriptors: English for Academic Purposes, Second Language Learning, Second Language Instruction, Teaching Methods

Multiple True-False Items: A Comparison of Scoring Algorithms

Peer reviewed

Direct link

Lahner, Felicitas-Maria; Lörwald, Andrea Carolin; Bauer, Daniel; Nouns, Zineb Miriam; Krebs, René; Guttormsen, Sissel; Fischer, Martin R.; Huwendiek, Sören – Advances in Health Sciences Education, 2018

Multiple true-false (MTF) items are a widely used supplement to the commonly used single-best answer (Type A) multiple choice format. However, an optimal scoring algorithm for MTF items has not yet been established, as existing studies yielded conflicting results. Therefore, this study analyzes two questions: What is the optimal scoring algorithm…

Descriptors: Scoring Formulas, Scoring Rubrics, Objective Tests, Multiple Choice Tests

Reading Comprehension in the TOEFL PBT: Which Sub-Skill Deserves More Intensive Training?

Peer reviewed
PDF on ERIC

Download full text

Elfiondri; Kasim, Usman; Mustafa, Faisal; Putra, Tomi Mandala – TESOL International Journal, 2020

Studies have shown that reading comprehension is the most difficult section of the Paper-Based Test (PBT) TOEFL. Therefore, this research aimed to identify which sub-skill in reading comprehension poses the greatest challenges for the students and how this sub-skill correlates to other reading comprehension sub-skills. To achieve this purpose,…

Descriptors: Reading Comprehension, Second Language Learning, Language Tests, English (Second Language)

The Equivalence of TOEP Forms

Peer reviewed

Direct link

Madya, Suwarsih; Retnawati, Heri; Purnawan, Ari; Putro, Nur Hidayanto Pancoro Setyo; Apino, Ezi – TEFLIN Journal: A publication on the teaching and learning of English, 2019

This explorative-descriptive study set out to examine the equivalence among Test of English Proficiency (TOEP) forms, developed by the Indonesian Testing Service Centre (ITSC) and co-founded by The Association for The Teaching of English as a Foreign Language in Indonesia (TEFLIN) and The Association of Psychology in Indonesia. Using a…

Descriptors: Language Tests, Language Proficiency, English (Second Language), Second Language Learning

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

ETS Research Report Series	4
Journal of Educational…	3
English Language Teaching	2
Grantee Submission	2
Language Testing	2
ProQuest LLC	2
Advances in Health Sciences…	1
Applied Measurement in…	1
Assessment	1
Assessment & Evaluation in…	1
Discover Education	1
Educational Research	1
Educational Research and…	1
International Journal of…	1
International Journal of…	1
Journal of Applied Measurement	1
Journal of Experimental…	1
Journal of Interactive Online…	1
Journal of Research in…	1
Journal of Speech, Language,…	1
Language Assessment Quarterly	1
Language Testing in Asia	1
Large-scale Assessments in…	1
Online Submission	1
Pearson	1
More ▼

Allen, Nancy L.	2
DeBoer, George E.	2
Hardcastle, Joseph	2
Herrmann-Abell, Cari F.	2
Kim, Sooyeon	2
Wainer, Howard	2
Alpayar, Cagla	1
Apino, Ezi	1
Babiar, Tasha Calvert	1
Baghaei, Purya	1
Batty, Aaron Olaf	1
Bauer, Daniel	1
Bennett, Randy Elliot	1
Benton, Tom	1
Berger, Steven G.	1
Binici, Salih	1
Borowski, Andreas	1
Chandralekha Singh	1
Chapman, Mark	1
Chen, Jing	1
Coats, Pamela K.	1
Culligan, Brent	1
DiBattista, David	1
Donoghue, John R.	1
More ▼