Publication Date
| In 2026 | 0 |
| Since 2025 | 81 |
| Since 2022 (last 5 years) | 449 |
| Since 2017 (last 10 years) | 1237 |
| Since 2007 (last 20 years) | 2511 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 122 |
| Teachers | 105 |
| Researchers | 64 |
| Students | 46 |
| Administrators | 14 |
| Policymakers | 7 |
| Counselors | 3 |
| Parents | 3 |
Location
| Canada | 134 |
| Turkey | 130 |
| Australia | 123 |
| Iran | 66 |
| Indonesia | 61 |
| United Kingdom | 51 |
| Germany | 50 |
| Taiwan | 46 |
| United States | 43 |
| China | 39 |
| California | 34 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 5 |
| Does not meet standards | 6 |
Yangqiuting Li; Chandralekha Singh – Physical Review Physics Education Research, 2025
Research-based multiple-choice questions implemented in class with peer instruction have been shown to be an effective tool for improving students' engagement and learning outcomes. Moreover, multiple-choice questions that are carefully sequenced to build on each other can be particularly helpful for students to develop a systematic understanding…
Descriptors: Physics, Science Instruction, Science Tests, Multiple Choice Tests
Nurussaniah Nurussaniah; Punaji Setyosari; Dedi Kuswandi; Saida Ulfa – Journal of Baltic Science Education, 2025
The accurate assessment of analytical thinking in physics, particularly in magnetism, poses substantial challenges due to the limitations of conventional tools in measuring higher-order cognitive skills. This study aimed to validate an analytical skills test in physics, based on Bloom's Revised Taxonomy, with an emphasis on the dimensions of…
Descriptors: Physics, Science Tests, Science Instruction, Thinking Skills
Xin Wei – Journal of Special Education Technology, 2025
This empirical research study investigates the relationship between the utilization of Universal Design (UD) elements and math performance among eighth graders. We analyzed 2017 National Assessment of Educational Progress process data using Poisson Generalized Linear Mixed-Effects Models to examine how the frequency of UD element usage varies…
Descriptors: Mathematics Achievement, Grade 8, Student Diversity, Students with Disabilities
Lee, Sora; Bolt, Daniel M. – Journal of Educational Measurement, 2018
Both the statistical and interpretational shortcomings of the three-parameter logistic (3PL) model in accommodating guessing effects on multiple-choice items are well documented. We consider the use of a residual heteroscedasticity (RH) model as an alternative, and compare its performance to the 3PL with real test data sets and through simulation…
Descriptors: Statistical Analysis, Models, Guessing (Tests), Multiple Choice Tests
Slepkov, A. D.; Van Bussel, M. L.; Fitze, K. M.; Burr, W. S. – SAGE Open, 2021
There is a broad literature in multiple-choice test development, both in terms of item-writing guidelines, and psychometric functionality as a measurement tool. However, most of the published literature concerns multiple-choice testing in the context of expert-designed high-stakes standardized assessments, with little attention being paid to the…
Descriptors: Foreign Countries, Undergraduate Students, Student Evaluation, Multiple Choice Tests
Fakhoury, Hana. M. A.; Fatoum, Hanaa A.; Aldeiry, MHD Amer; Alahmad, Hawazen; Enabi, Joud; Kayali, Sara; Bawahab, Yahya; Masuadi, Emad M.; Obeidat, Akef; Lumsden, Colin James – Biochemistry and Molecular Biology Education, 2021
The flipped classroom has gained prominence in higher education, but little has been written about its application in the Middle East. This study aimed to assess the feasibility, acceptability, and impact of flipping biochemistry classes in comparison to the traditional didactic program. The study was conducted on first-year medical students…
Descriptors: Foreign Countries, Flipped Classroom, Medical Education, Medical Students
Wind, Stefanie A.; Walker, A. Adrienne – Educational Measurement: Issues and Practice, 2021
Many large-scale performance assessments include score resolution procedures for resolving discrepancies in rater judgments. The goal of score resolution is conceptually similar to person fit analyses: To identify students for whom observed scores may not accurately reflect their achievement. Previously, researchers have observed that…
Descriptors: Goodness of Fit, Performance Based Assessment, Evaluators, Decision Making
Krell, Moritz; Samia Khan; Jan van Driel – Education Sciences, 2021
The development and evaluation of valid assessments of scientific reasoning are an integral part of research in science education. In the present study, we used the linear logistic test model (LLTM) to analyze how item features related to text complexity and the presence of visual representations influence the overall item difficulty of an…
Descriptors: Cognitive Processes, Difficulty Level, Science Tests, Logical Thinking
Kim, Seonghoon; Kolen, Michael J. – Applied Measurement in Education, 2019
In applications of item response theory (IRT), fixed parameter calibration (FPC) has been used to estimate the item parameters of a new test form on the existing ability scale of an item pool. The present paper presents an application of FPC to multiple examinee groups test data that are linked to the item pool via anchor items, and investigates…
Descriptors: Item Response Theory, Item Banks, Test Items, Computation
Wang, Lin – ETS Research Report Series, 2019
Rearranging response options in different versions of a test of multiple-choice items can be an effective strategy against cheating on the test. This study investigated if rearranging response options would affect item performance and test score comparability. A study test was assembled as the base version from which 3 variant versions were…
Descriptors: Multiple Choice Tests, Test Items, Test Format, Scores
Leo, J.; Kurdi, G.; Matentzoglu, N.; Parsia, B.; Sattler, U.; Forge, S.; Donato, G.; Dowling, W. – International Journal of Artificial Intelligence in Education, 2019
Designing good multiple choice questions (MCQs) for education and assessment is time consuming and error-prone. An abundance of structured and semi-structured data has led to the development of automatic MCQ generation methods. Recently, ontologies have emerged as powerful tools to enable the automatic generation of MCQs. However, current question…
Descriptors: Multiple Choice Tests, Test Items, Automation, Test Construction
Lee, Yi-Hsuan; Haberman, Shelby J.; Dorans, Neil J. – Journal of Educational Measurement, 2019
In many educational tests, both multiple-choice (MC) and constructed-response (CR) sections are used to measure different constructs. In many common cases, security concerns lead to the use of form-specific CR items that cannot be used for equating test scores, along with MC sections that can be linked to previous test forms via common items. In…
Descriptors: Scores, Multiple Choice Tests, Test Items, Responses
Çiftçi, Sabahattin – International Electronic Journal of Elementary Education, 2019
Open-ended exams and multiple-choice exams are two types of examinations that are highly preferred in educational sciences. They have several advantages in terms of their characteristics, and they also have some limitations. These advantages and limitations affect the use of these exams both in national exams and in the exams administered by…
Descriptors: Multiple Choice Tests, Test Format, Preservice Teachers, Figurative Language
Pelanek, Radek – IEEE Transactions on Learning Technologies, 2020
Learning systems can utilize many practice exercises, ranging from simple multiple-choice questions to complex problem-solving activities. In this article, we propose a classification framework for such exercises. The framework classifies exercises in three main aspects: (1) the primary type of interaction; (2) the presentation mode; and (3) the…
Descriptors: Integrated Learning Systems, Classification, Multiple Choice Tests, Problem Solving
Cong, Xin; Zhang, Yan; Xu, Hai; Liu, Li-Mei; Zheng, Ming; Xiang, Ruo-Lan; Wang, Jin-Yu; Jia, Shi; Cai, Jing-Yi; Liu, Cheng; Wu, Li-Ling – Advances in Physiology Education, 2020
Current interdisciplinary medical training calls for reforms and innovations in the assessment of pathophysiology education. Formative assessment is used to monitor student learning to provide ongoing feedback that can improve both learning and teaching. Beginning in 2016, we implemented a formative assessment composed of case-based…
Descriptors: Formative Evaluation, Psychophysiology, Student Attitudes, Case Method (Teaching Technique)

Peer reviewed
Direct link
