Publication Date
In 2025 | 24 |
Since 2024 | 118 |
Since 2021 (last 5 years) | 456 |
Since 2016 (last 10 years) | 861 |
Since 2006 (last 20 years) | 1341 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Practitioners | 195 |
Teachers | 159 |
Researchers | 92 |
Administrators | 49 |
Students | 34 |
Policymakers | 14 |
Parents | 12 |
Counselors | 2 |
Community | 1 |
Media Staff | 1 |
Support Staff | 1 |
More ▼ |
Location
Canada | 62 |
Turkey | 57 |
Germany | 40 |
Australia | 35 |
United Kingdom | 35 |
Japan | 34 |
China | 32 |
United States | 32 |
California | 25 |
United Kingdom (England) | 25 |
Netherlands | 24 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Filipe Manuel Vidal Falcão; Daniela S.M. Pereira; José Miguel Pêgo; Patrício Costa – Education and Information Technologies, 2024
Progress tests (PT) are a popular type of longitudinal assessment used for evaluating clinical knowledge retention and long-life learning in health professions education. Most PTs consist of multiple-choice questions (MCQs) whose development is costly and time-consuming. Automatic Item Generation (AIG) generates test items through algorithms,…
Descriptors: Automation, Test Items, Progress Monitoring, Medical Education
Inga Laukaityte; Marie Wiberg – Practical Assessment, Research & Evaluation, 2024
The overall aim was to examine effects of differences in group ability and features of the anchor test form on equating bias and the standard error of equating (SEE) using both real and simulated data. Chained kernel equating, Postratification kernel equating, and Circle-arc equating were studied. A college admissions test with four different…
Descriptors: Ability Grouping, Test Items, College Entrance Examinations, High Stakes Tests
Emily Cantillon – ProQuest LLC, 2024
It has been widely recognized that a visual impairment can limit an individual's ability to learn through visual observations. This decreased limited visual access which could impact how the skills to access and recognize the world around them develop. However, when the visual impairment was brain-based, such as in Cortical/Cerebral Visual…
Descriptors: Visual Impairments, Children, Intelligence Tests, Scores
Blair Lehman; Jesse R. Sparks; Jonathan Steinberg – ETS Research Report Series, 2024
Over the last 20 years, many methods have been proposed to use process data (e.g., response time) to detect changes in engagement during the test-taking process. However, many of these methods were developed and evaluated in highly similar testing contexts: 30 or more single-select multiple-choice items presented in a linear, fixed sequence in…
Descriptors: National Competency Tests, Secondary School Mathematics, Secondary School Students, Mathematics Tests
Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022
While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…
Descriptors: Scoring, Testing, Test Items, Test Format
Wagle, Rhea; Dowdy, Erin; Furlong, Michael J.; Nylund-Gibson, Karen; Carter, Delwin; Hinton, Tameisha – Assessment for Effective Intervention, 2022
Schools are an essential setting for mental health supports and services for students. To support student well-being, schools engage in universal mental health screening to identify students in need of support and to provide surveillance data for district-wide or state-wide policy changes. Mental health data have been collected via anonymous and…
Descriptors: Mental Health, Screening Tests, Student Surveys, High School Students
Sen, Sedat – Creativity Research Journal, 2022
The purpose of this study was to estimate the overall reliability values for the scores produced by Runco Ideational Behavior Scale (RIBS) and explore the variability of RIBS score reliability across studies. To achieve this, a reliability generalization meta-analysis was carried out using the 86 Cronbach's alpha estimates obtained from 77 studies…
Descriptors: Generalization, Creativity, Meta Analysis, Higher Education
Ozge Ersan Cinar – ProQuest LLC, 2022
In educational tests, a group of questions related to a shared stimulus is called a testlet (e.g., a reading passage with multiple related questions). Use of testlets is very common in educational tests. Additionally, computerized adaptive testing (CAT) is a mode of testing where the test forms are created in real time tailoring to the test…
Descriptors: Test Items, Computer Assisted Testing, Adaptive Testing, Educational Testing
VanDerHeyden, Amanda M.; Codding, Robin; Solomon, Benjamin G. – Remedial and Special Education, 2023
Computer-based curriculum-based measurement (CBM) is a relatively common practice, but surprisingly few studies have examined the reliability of computer-based CBM. This study sought to examine the reliability of CBM administered via paper/pencil versus the computer. Twenty-one of 25 students in two third-grade classes (N = 21) participated in two…
Descriptors: Curriculum Based Assessment, Computer Assisted Testing, Test Format, Grade 3
Harrison, Scott; Kroehne, Ulf; Goldhammer, Frank; Lüdtke, Oliver; Robitzsch, Alexander – Large-scale Assessments in Education, 2023
Background: Mode effects, the variations in item and scale properties attributed to the mode of test administration (paper vs. computer), have stimulated research around test equivalence and trend estimation in PISA. The PISA assessment framework provides the backbone to the interpretation of the results of the PISA test scores. However, an…
Descriptors: Scoring, Test Items, Difficulty Level, Foreign Countries
McGuire, Michael J. – International Journal for the Scholarship of Teaching and Learning, 2023
College students in a lower-division psychology course made metacognitive judgments by predicting and postdicting performance for true-false, multiple-choice, and fill-in-the-blank question sets on each of three exams. This study investigated which question format would result in the most accurate metacognitive judgments. Extending Koriat's (1997)…
Descriptors: Metacognition, Multiple Choice Tests, Accuracy, Test Format
Davis, Robert O.; Park, Taejung; Vincent, Joseph – Journal of Educational Computing Research, 2023
Over the past decade, meta-analyses have been conducted to evaluate the impact of embodied pedagogical agents on learning outcomes. Most review studies have evaluated learning outcomes from the perspective of testing labels such as transfer, retention, recognition, free recall, and various other classifications. This is problematic, because…
Descriptors: Meta Analysis, Test Format, Teaching Methods, Outcomes of Education
Zhao, Wenbo; Li, Jiaojiao; Shanks, David R.; Li, Baike; Hu, Xiao; Yang, Chunliang; Luo, Liang – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2023
Making metamemory judgments reactively changes item memory itself. Here we report the first investigation of reactive influences of making judgments of learning (JOLs) on interitem relational memory--specifically, temporal (serial) order memory. Experiment 1 found that making JOLs impaired order reconstruction. Experiment 2 observed minimal…
Descriptors: Metacognition, Memory, Meta Analysis, Recall (Psychology)
Wee Chun Tan – Discover Education, 2023
Despite the importance of the PhD viva in assessing the quality of doctoral research, how examiners approach the PhD viva remains underexplored in the Global South. This study fills this gap by investigating the conceptions of doctoral examiners in Malaysia, shedding light on how they approach the PhD viva and what they believe its key purposes…
Descriptors: Doctoral Students, Student Evaluation, Oral Language, Test Format
Benjamin Sorenson; Kenneth Hanson – Journal of Chemical Education, 2023
In spring 2020, the chemical education community faced an abrupt transition from in-person to online classes, which also necessitated online assessments. Building upon an existing three-semester study (F17, S19, and F19) using Rasch modeling and classical testing theory to improve in-person multiple choice exams, this study investigates the impact…
Descriptors: Undergraduate Study, Chemistry, COVID-19, Pandemics