Publication Date
| In 2026 | 0 |
| Since 2025 | 4 |
| Since 2022 (last 5 years) | 83 |
| Since 2017 (last 10 years) | 219 |
| Since 2007 (last 20 years) | 444 |
Descriptor
| Comparative Analysis | 630 |
| Multiple Choice Tests | 630 |
| Foreign Countries | 218 |
| Teaching Methods | 179 |
| Test Items | 138 |
| Scores | 136 |
| Statistical Analysis | 130 |
| Second Language Learning | 98 |
| Undergraduate Students | 96 |
| Pretests Posttests | 95 |
| English (Second Language) | 94 |
| More ▼ | |
Source
Author
Publication Type
Education Level
| Higher Education | 232 |
| Postsecondary Education | 183 |
| Secondary Education | 95 |
| Elementary Education | 62 |
| High Schools | 49 |
| Middle Schools | 41 |
| Junior High Schools | 30 |
| Grade 8 | 19 |
| Grade 5 | 18 |
| Intermediate Grades | 17 |
| Grade 4 | 14 |
| More ▼ | |
Audience
| Practitioners | 6 |
| Researchers | 4 |
| Teachers | 3 |
| Administrators | 1 |
Location
| Turkey | 25 |
| Iran | 19 |
| Indonesia | 17 |
| Taiwan | 15 |
| Australia | 12 |
| Japan | 8 |
| United States | 8 |
| China | 7 |
| Florida | 6 |
| Germany | 6 |
| Belgium | 5 |
| More ▼ | |
Laws, Policies, & Programs
| Every Student Succeeds Act… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 2 |
Lee, Sora; Bolt, Daniel M. – Journal of Educational Measurement, 2018
Both the statistical and interpretational shortcomings of the three-parameter logistic (3PL) model in accommodating guessing effects on multiple-choice items are well documented. We consider the use of a residual heteroscedasticity (RH) model as an alternative, and compare its performance to the 3PL with real test data sets and through simulation…
Descriptors: Statistical Analysis, Models, Guessing (Tests), Multiple Choice Tests
Wind, Stefanie A.; Walker, A. Adrienne – Educational Measurement: Issues and Practice, 2021
Many large-scale performance assessments include score resolution procedures for resolving discrepancies in rater judgments. The goal of score resolution is conceptually similar to person fit analyses: To identify students for whom observed scores may not accurately reflect their achievement. Previously, researchers have observed that…
Descriptors: Goodness of Fit, Performance Based Assessment, Evaluators, Decision Making
Deribo, Tobias; Goldhammer, Frank; Kroehne, Ulf – Educational and Psychological Measurement, 2023
As researchers in the social sciences, we are often interested in studying not directly observable constructs through assessments and questionnaires. But even in a well-designed and well-implemented study, rapid-guessing behavior may occur. Under rapid-guessing behavior, a task is skimmed shortly but not read and engaged with in-depth. Hence, a…
Descriptors: Reaction Time, Guessing (Tests), Behavior Patterns, Bias
Yanjie Li; Cindy Brantmeier; Yanming Gao; Mike Strube – Reading in a Foreign Language, 2024
This experimental study examined the effect of answering strategic adjunct questions (AQs) on L2 reading comprehension and strategy use. Participants were 124 Chinese intermediate advanced EFL learners from a large public university in China. Of them, 24 and 100 participated in the pilot study and formal study, respectively. Participants read two…
Descriptors: Second Language Learning, Second Language Instruction, Questioning Techniques, Reading Strategies
Shear, Benjamin R. – Journal of Educational Measurement, 2023
Large-scale standardized tests are regularly used to measure student achievement overall and for student subgroups. These uses assume tests provide comparable measures of outcomes across student subgroups, but prior research suggests score comparisons across gender groups may be complicated by the type of test items used. This paper presents…
Descriptors: Gender Bias, Item Analysis, Test Items, Achievement Tests
Lee, Won-Chan; Kim, Stella Y.; Choi, Jiwon; Kang, Yujin – Journal of Educational Measurement, 2020
This article considers psychometric properties of composite raw scores and transformed scale scores on mixed-format tests that consist of a mixture of multiple-choice and free-response items. Test scores on several mixed-format tests are evaluated with respect to conditional and overall standard errors of measurement, score reliability, and…
Descriptors: Raw Scores, Item Response Theory, Test Format, Multiple Choice Tests
Kaman-Ertürk, Ayse; Gokgoz-Kurt, Burcu – Reading Matrix: An International Online Journal, 2023
Vocabulary learning constitutes an essential component of language learning and teaching. The type of input students receive is one of the factors that affect the pace and range of this learning. In the incidental vocabulary learning process, learners have been shown to benefit from exposure to a variety of input types to varying degrees. In this…
Descriptors: Vocabulary Development, Linguistic Input, Comparative Analysis, Learning Processes
Shakeel Mohammad Cassam Atchia; Dhavini Chummun; Saheel Luckho – Journal of Biological Education, 2024
This study uses a case study methodology to showcase the use of the DTSICM (Design thinking Strategy to Identify and Clear Misconceptions) model, which provides a design thinking approach to identify and clear a common misconception on photosynthesis, held by a sample of 27 A-level students. As a first stage, data collected through the…
Descriptors: Design, Thinking Skills, Scientific Concepts, Misconceptions
Dewey, Jessica; Hicks, Jenna; Schuchardt, Anita – CBE - Life Sciences Education, 2022
When conducting biological investigations, experts constantly integrate their conceptual and quantitative understanding of variation with the design and analysis of the investigation. This process is difficult for students, because curricula often treat these concepts as separate components. This study describes the effect of a curricular…
Descriptors: Biology, Science Instruction, Teaching Methods, Multiple Choice Tests
Kandaiah, Thiruchelvam; Latip, Siti Halijah – Journal of Science and Mathematics Education in Southeast Asia, 2022
Purpose: The aim of this paper is to study the use of FIS Response Analysis for Critical Thinking Assessment (FRACTA) method to assess critical thinking in STEM problem solving. The use of FIS (facts, ideas and solutions) chart as a tool to elicit student critical thinking responses and the method of scoring the responses are investigated. Method:…
Descriptors: Scoring, Critical Thinking, Feedback (Response), Credibility
Frizelle, Pauline; Thompson, Paul; Duta, Mihaela; Bishop, Dorothy V. M. – Language Learning, 2019
We examined the effect of two methods of assessment--multiple-choice sentence-picture matching and an animated sentence-verification task--on typically developing children's understanding of relative clauses. A sample of children between the ages of 3 years 6 months and 4 years 11 months took part in the study (N = 103). Results indicated that (a)…
Descriptors: Preschool Children, Testing, Syntax, Comparative Analysis
Gil Llinás, Julia; Tobaja Márquez, Luis Manuel – International Journal of Educational Methodology, 2023
This paper describes an experience based on the use of an active method in which students of a basic physics course prepare multiple choice questions (MCQs) to prepare for exams in the subject. The objective of the research was to provide the students with a method that would enhance their desire to learn physics, and consequently lead to an…
Descriptors: Teaching Methods, Physics, Science Instruction, Multiple Choice Tests
Subali, Bambang; Kumaidi; Aminah, Nonoh Siti – International Journal of Instruction, 2021
This research aims at comparing item characteristics of instruments for assessing the level of mastery in scientific method for elementary students as they were analyzed using Classical Test Theory (CTT) and Item Response Theory (IRT). The two analyses are usually done separately, for difference object, in this moment it was analyzed…
Descriptors: Test Items, Item Response Theory, Item Analysis, Comparative Analysis
Ladachart, Luecha; Radchanet, Visit; Phothong, Wilawan – Journal of Turkish Science Education, 2022
Design-based learning has been recognized by educational scholars as the key approach to science, technology, engineering, and mathematics (STEM) education at K-12 levels. However, it is unclear whether, and which dimensions of, design thinking mindsets support the conceptual learning of science. This quasi-experimental study aims to explore 37…
Descriptors: Design, Scientific Concepts, Concept Formation, Learning Processes
Polat, Murat – International Online Journal of Education and Teaching, 2022
Foreign language testing is a multi-dimensional phenomenon and obtaining objective and error-free scores on learners' language skills is often problematic. While assessing foreign language performance on high-stakes tests, using different testing approaches including Classical Test Theory (CTT), Generalizability Theory (GT) and/or Item Response…
Descriptors: Second Language Learning, Second Language Instruction, Item Response Theory, Language Tests

Peer reviewed
Direct link
