Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 4 |
| Since 2017 (last 10 years) | 9 |
| Since 2007 (last 20 years) | 25 |
Descriptor
| Comparative Analysis | 43 |
| Evaluation Methods | 43 |
| Multiple Choice Tests | 43 |
| Foreign Countries | 10 |
| Student Evaluation | 10 |
| College Students | 8 |
| Higher Education | 8 |
| Scores | 7 |
| Teaching Methods | 7 |
| Scoring | 6 |
| Test Construction | 6 |
| More ▼ | |
Source
Author
Publication Type
Education Level
| Higher Education | 12 |
| Postsecondary Education | 8 |
| Elementary Education | 6 |
| Secondary Education | 5 |
| High Schools | 3 |
| Middle Schools | 3 |
| Grade 4 | 2 |
| Grade 5 | 2 |
| Grade 6 | 2 |
| Grade 8 | 2 |
| Junior High Schools | 2 |
| More ▼ | |
Audience
| Researchers | 1 |
Location
| Australia | 3 |
| Malaysia | 2 |
| Sweden | 2 |
| Illinois | 1 |
| Illinois (Chicago) | 1 |
| Iran | 1 |
| Israel | 1 |
| Netherlands | 1 |
| New York | 1 |
| South Korea (Seoul) | 1 |
| Spain | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
| State of Texas Assessments of… | 1 |
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Walter M. Stroup; Anthony Petrosino; Corey Brady; Karen Duseau – North American Chapter of the International Group for the Psychology of Mathematics Education, 2023
Tests of statistical significance often play a decisive role in establishing the empirical warrant of evidence-based research in education. The results from pattern-based assessment items, as introduced in this paper, are categorical and multimodal and do not immediately support the use of measures of central tendency as typically related to…
Descriptors: Statistical Significance, Comparative Analysis, Research Methodology, Evaluation Methods
Wind, Stefanie A.; Walker, A. Adrienne – Educational Measurement: Issues and Practice, 2021
Many large-scale performance assessments include score resolution procedures for resolving discrepancies in rater judgments. The goal of score resolution is conceptually similar to person fit analyses: To identify students for whom observed scores may not accurately reflect their achievement. Previously, researchers have observed that…
Descriptors: Goodness of Fit, Performance Based Assessment, Evaluators, Decision Making
Kandaiah, Thiruchelvam; Latip, Siti Halijah – Journal of Science and Mathematics Education in Southeast Asia, 2022
Purpose: The aim of this paper is to study the use of FIS Response Analysis for Critical Thinking Assessment (FRACTA) method to assess critical thinking in STEM problem solving. The use of FIS (facts, ideas and solutions) chart as a tool to elicit student critical thinking responses and the method of scoring the responses are investigated. Method:…
Descriptors: Scoring, Critical Thinking, Feedback (Response), Credibility
Krall, Geoff – Canadian Journal of Science, Mathematics and Technology Education, 2023
In order to identify the potential benefits and challenges of implementing student portfolios as quality mathematics assessment, a pilot study was conducted with teachers in various secondary school settings. The multi-case study consisted of five teacher participants from geographically and demographically differing contexts, four in the USA and…
Descriptors: Portfolio Assessment, Mathematics Instruction, Evaluation Methods, Pilot Projects
Malec, Wojciech; Krzeminska-Adamek, Malgorzata – Practical Assessment, Research & Evaluation, 2020
The main objective of the article is to compare several methods of evaluating multiple-choice options through classical item analysis. The methods subjected to examination include the tabulation of choice distribution, the interpretation of trace lines, the point-biserial correlation, the categorical analysis of trace lines, and the investigation…
Descriptors: Comparative Analysis, Evaluation Methods, Multiple Choice Tests, Item Analysis
Bachelor, Jeremy W. – Language Learning Journal, 2022
The objective of this study was to employ a new tool for assessing pragmatic "proficiency" (over "performance") in L2 students and to determine if pragmatic lessons on compliment sequences had a positive impact on L2 Spanish students' intercultural competence. To this end, students at a Midwestern college engaged in live video…
Descriptors: Pragmatics, Second Language Learning, Second Language Instruction, Teaching Methods
Goncher, Andrea M.; Boles, Wageeh – European Journal of Engineering Education, 2019
Concept inventories (CIs) are assessment instruments designed to measure students' conceptual understanding of fundamental concepts in particular fields. CIs utilise multiple-choice questions (MCQs), and specifically designed response selections, to help identify misconceptions. One shortcoming of this assessment instrument is that it fails to…
Descriptors: Engineering Education, Misconceptions, Concept Formation, Evaluation Methods
Carlie, Johanna; Sahlén, Birgitta; Nirme, Jens; Andersson, Ketty; Rudner, Mary; Johansson, Roger; Gulz, Agneta; Brännström, K. Jonas – Journal of Speech, Language, and Hearing Research, 2021
Purpose: This study reports on the development of an auditory passage comprehension task for Swedish primary school children of cultural and linguistic diversity. It also reports on their performance on the task in quiet and in noise. Method: Eighty-eight children aged 7-9 years and showing normal hearing participated. The children were divided…
Descriptors: Foreign Countries, Swedish, Elementary School Students, Second Language Learning
Won, Mihye; Krabbe, Heiko; Ley, Siv Ling; Treagust, David F.; Fischer, Hans E. – Educational Assessment, 2017
In this study, we investigated the value of a concept map marking guide as an alternative formative assessment tool for science teachers to adopt for the topic of energy. Eight high school science teachers marked students' concept maps using an itemized holistic marking guide. Their marking was compared with the researchers' marking and the scores…
Descriptors: Science Teachers, Science Instruction, Concept Mapping, Formative Evaluation
He, Yong; Cui, Zhongmin; Fang, Yu; Chen, Hanwei – Applied Psychological Measurement, 2013
Common test items play an important role in equating alternate test forms under the common item nonequivalent groups design. When the item response theory (IRT) method is applied in equating, inconsistent item parameter estimates among common items can lead to large bias in equated scores. It is prudent to evaluate inconsistency in parameter…
Descriptors: Regression (Statistics), Item Response Theory, Test Items, Equated Scores
Chen, I-Jung – Computer Assisted Language Learning, 2016
This study compared how three different gloss modes affected college students' L2 reading comprehension and vocabulary acquisition. The study also compared how results on comprehension and vocabulary acquisition may differ depending on the four assessment methods used. A between-subjects design was employed with three groups of Mandarin-speaking…
Descriptors: Reading Comprehension, Multivariate Analysis, Scores, Multiple Choice Tests
Moses, Tim; Deng, Weiling; Zhang, Yu-Li – Applied Psychological Measurement, 2011
Nonequivalent groups with anchor test (NEAT) equating functions that use a single anchor can have accuracy problems when the groups are extremely different and/or when the anchor weakly correlates with the tests being equated. Proposals have been made to address these issues by incorporating more than one anchor into NEAT equating functions. These…
Descriptors: Equated Scores, Tests, Comparative Analysis, Correlation
Nixon, Ryan S.; Smith, Leigh K.; Wimmer, Jennifer J. – School Science and Mathematics, 2015
This quasi-experimental study investigated how explicit instruction about multiple modes of representation (MMR) impacted grades 7 (n = 61) and 8 (n = 141) students' learning and multimodal use on end-of-unit assessments. Half of each teacher's (n = 3) students received an intervention consisting of explicit instruction on MMR in science…
Descriptors: Quasiexperimental Design, Grade 7, Grade 8, Intervention
Heyborne, William H.; Clarke, Jennifer A.; Perrett, Jamis J. – Journal of College Science Teaching, 2011
Enrollment increases at many institutions have forced science faculty to reevaluate assessment decisions in light of increasing demands on time. Some have advocated the replacement of free-response examinations with forced-choice examinations as a time-saving strategy. The existing research literature contains many studies comparing student…
Descriptors: Evidence, Academic Achievement, Tests, Laboratories
Belov, Dmitry I. – Applied Psychological Measurement, 2011
This article presents the Variable Match Index (VM-Index), a new statistic for detecting answer copying. The power of the VM-Index relies on two-dimensional conditioning as well as the structure of the test. The asymptotic distribution of the VM-Index is analyzed by reduction to Poisson trials. A computational study comparing the VM-Index with the…
Descriptors: Cheating, Journal Articles, Computation, Comparative Analysis

Peer reviewed
Direct link
