Publication Date
| In 2026 | 0 |
| Since 2025 | 3 |
| Since 2022 (last 5 years) | 12 |
| Since 2017 (last 10 years) | 30 |
| Since 2007 (last 20 years) | 46 |
Descriptor
| Error Patterns | 80 |
| Item Analysis | 80 |
| Test Items | 28 |
| Foreign Countries | 22 |
| Comparative Analysis | 18 |
| Test Construction | 15 |
| Difficulty Level | 12 |
| Item Response Theory | 11 |
| Second Language Learning | 11 |
| Task Analysis | 11 |
| Undergraduate Students | 11 |
| More ▼ | |
Source
Author
Publication Type
Education Level
| Higher Education | 22 |
| Postsecondary Education | 21 |
| Elementary Secondary Education | 5 |
| Secondary Education | 5 |
| High Schools | 4 |
| Elementary Education | 2 |
| Grade 7 | 2 |
| Grade 8 | 2 |
| Junior High Schools | 2 |
| Middle Schools | 2 |
| Grade 10 | 1 |
| More ▼ | |
Audience
| Practitioners | 2 |
| Researchers | 1 |
| Students | 1 |
Location
| Canada | 2 |
| Germany | 2 |
| Australia | 1 |
| Austria | 1 |
| China | 1 |
| Czech Republic | 1 |
| Hong Kong | 1 |
| Indiana | 1 |
| Iran | 1 |
| Japan | 1 |
| Mexico | 1 |
| More ▼ | |
Laws, Policies, & Programs
| Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Zhang, Zhonghua – Journal of Experimental Education, 2022
Reporting standard errors of equating has been advocated as a standard practice when conducting test equating. The two most widely applied procedures for standard errors of equating including the bootstrap method and the delta method are either computationally intensive or confined to the derivations of complicated formulas. In the current study,…
Descriptors: Error of Measurement, Item Response Theory, True Scores, Equated Scores
Jiayi Deng – Large-scale Assessments in Education, 2025
Background: Test score comparability in international large-scale assessments (LSAs) is greatly important to ensure test fairness. To effectively compare test scores on an international scale, score linking is widely used to convert raw scores from different linguistic version of test forms into a common score scale. An example is the multigroup…
Descriptors: Guessing (Tests), Item Response Theory, Error Patterns, Arabic
Mimi Ismail; Ahmed Al - Badri; Said Al - Senaidi – Journal of Education and e-Learning Research, 2025
This study aimed to reveal the differences in individuals' abilities, their standard errors, and the psychometric properties of the test according to the two methods of applying the test (electronic and paper). The descriptive approach was used to achieve the study's objectives. The study sample consisted of 74 male and female students at the…
Descriptors: Achievement Tests, Computer Assisted Testing, Psychometrics, Item Response Theory
Kyeng Gea Lee; Mark J. Lee; Soo Jung Lee – International Journal of Technology in Education and Science, 2024
Online assessment is an essential part of online education, and if conducted properly, has been found to effectively gauge student learning. Generally, textbased questions have been the cornerstone of online assessment. Recently, however, the emergence of generative artificial intelligence has added a significant challenge to the integrity of…
Descriptors: Artificial Intelligence, Computer Software, Biology, Science Instruction
Salem, Alexandra C.; Gale, Robert; Casilio, Marianne; Fleegle, Mikala; Fergadiotis, Gerasimos; Bedrick, Steven – Journal of Speech, Language, and Hearing Research, 2023
Purpose: ParAlg (Paraphasia Algorithms) is a software that automatically categorizes a person with aphasia's naming error (paraphasia) in relation to its intended target on a picture-naming test. These classifications (based on lexicality as well as semantic, phonological, and morphological similarity to the target) are important for…
Descriptors: Semantics, Computer Software, Aphasia, Classification
Test of Understanding of Electric Field, Force, and Flux: A Reliable Multiple-Choice Assessment Tool
Eder Hernandez; Esmeralda Campos; Pablo Barniol; Genaro Zavala – Physical Review Physics Education Research, 2025
This study presents the development and validation of a novel multiple-choice test designed to assess university students' conceptual understanding of electric field, force, and flux. The test of understanding of electric field, force, and flux was constructed based on the results of previous studies using a phenomenographic approach to classify…
Descriptors: Physics, Scientific Concepts, Science Tests, Multiple Choice Tests
Abu-Ghazalah, Rashid M.; Dubins, David N.; Poon, Gregory M. K. – Applied Measurement in Education, 2023
Multiple choice results are inherently probabilistic outcomes, as correct responses reflect a combination of knowledge and guessing, while incorrect responses additionally reflect blunder, a confidently committed mistake. To objectively resolve knowledge from responses in an MC test structure, we evaluated probabilistic models that explicitly…
Descriptors: Guessing (Tests), Multiple Choice Tests, Probability, Models
Zhang, Zhonghua; Zhao, Mingren – Journal of Educational Measurement, 2019
The present study evaluated the multiple imputation method, a procedure that is similar to the one suggested by Li and Lissitz (2004), and compared the performance of this method with that of the bootstrap method and the delta method in obtaining the standard errors for the estimates of the parameter scale transformation coefficients in item…
Descriptors: Item Response Theory, Error Patterns, Item Analysis, Simulation
Yu-Chin, Chiu – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2023
Recent context-control learning studies have shown that switch costs are reduced in a particular context predicting a high probability of switching as compared to another context predicting a low probability of switching. These context-specific switch probability effects suggest that control of task sets, through experience, can become associated…
Descriptors: Learning Processes, Prior Learning, Task Analysis, Cognitive Ability
Fröber, Kerstin; Jurczyk, Vanessa; Dreisbach, Gesine – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2022
Frequent forced switching between tasks has been shown to reduce switch costs and increase voluntary switch rates. So far, however, the boundary conditions of the influence of forced task switching on voluntary task switching are unknown. Thus, the present study was aimed to test different aspects of generalizability (across items, tasks, and…
Descriptors: Cognitive Ability, Attention Control, Task Analysis, Generalization
Son, Gaeun; Oh, Byung-Il; Kang, Min-Suk; Chong, Sang Chul – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2020
We investigated whether clustering based on feature similarity improves the representational quality of visual working memory (VWM). We hypothesized that similar items are organized into clusters, and their recall precision increases with fewer clusters because of reduced memory load. In a series of 6 experiments, participants remembered…
Descriptors: Visual Perception, Short Term Memory, Recall (Psychology), Cognitive Ability
Spinelli, Giacomo; Lupker, Stephen J. – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2021
In the Stroop task, congruency effects (i.e., the color-naming latency difference between incongruent stimuli, e.g., the word BLUE written in the color red, and congruent stimuli, e.g., RED in red) are smaller in a list in which incongruent trials are frequent than in a list in which incongruent trials are infrequent. The traditional explanation…
Descriptors: Color, Interference (Learning), Visual Stimuli, Reaction Time
Yurtçu, Meltem; Güzeller, Cem Oktay – International Journal of Assessment Tools in Education, 2018
In this study purposes to indicate the effect of the number of DIF items and the distribution of DIF items in these forms, which be equalized on equating error. Mean-mean, mean-standard deviation, Haebara and Stocking-Lord Methods used in common item design equal groups as equalization methods. The study included six different simulation…
Descriptors: Error Patterns, Test Items, Item Analysis, Simulation
Farhat, Naha J.; Stanford, Courtney; Ruder, Suzanne M. – Journal of Chemical Education, 2019
Assessments can provide instructors and students with valuable information regarding student's level of knowledge and understanding, in order to improve both teaching and learning. In this study, we analyzed departmental assessment quizzes given to students at the start of Organic Chemistry 2, over an eight year period. This assessment quiz was…
Descriptors: Organic Chemistry, Teaching Methods, Science Instruction, Science Tests
Pelánek, Radek; Effenberger, Tomáš; Kukucka, Adam – Journal of Educational Data Mining, 2022
We study the automatic identification of educational items worthy of content authors' attention. Based on the results of such analysis, content authors can revise and improve the content of learning environments. We provide an overview of item properties relevant to this task, including difficulty and complexity measures, item discrimination, and…
Descriptors: Item Analysis, Identification, Difficulty Level, Case Studies

Peer reviewed
Direct link
