ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	13
Since 2017 (last 10 years)	18
Since 2007 (last 20 years)	26

Descriptor

Automation	32
Test Format	32
Test Items	15
Computer Assisted Testing	14
Scoring	11
Test Construction	8
Artificial Intelligence	7
Grading	5
Models	5
Comparative Analysis	4
Educational Technology	4
Foreign Countries	4
Responses	4
Student Evaluation	4
Test Validity	4
Testing	4
Accuracy	3
Classification	3
College Science	3
Item Banks	3
Multiple Choice Tests	3
Natural Language Processing	3
Programming	3
Reading Tests	3
Science Tests	3
More ▼

Publication Type

Journal Articles	27
Reports - Research	16
Reports - Descriptive	10
Information Analyses	3
Reports - Evaluative	2
Speeches/Meeting Papers	2
Dissertations/Theses -…	1
Guides - Non-Classroom	1
Opinion Papers	1
Tests/Questionnaires	1

Education Level

Higher Education	7
Postsecondary Education	6
Elementary Education	2
Grade 12	1
Grade 4	1
High Schools	1
Intermediate Grades	1
Secondary Education	1
Two Year Colleges	1

Audience

Practitioners	2
Researchers	1

Location

California	1
Portugal	1
South Korea	1
Turkey	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing 1 to 15 of 32 results Save | Export

A Review of Automatic Item Generation Techniques Leveraging Large Language Models

Peer reviewed
PDF on ERIC

Download full text

Bin Tan; Nour Armoush; Elisabetta Mazzullo; Okan Bulut; Mark J. Gierl – International Journal of Assessment Tools in Education, 2025

This study reviews existing research on the use of large language models (LLMs) for automatic item generation (AIG). We performed a comprehensive literature search across seven research databases, selected studies based on predefined criteria, and summarized 60 relevant studies that employed LLMs in the AIG process. We identified the most commonly…

Descriptors: Artificial Intelligence, Test Items, Automation, Test Format

Analysis of Student Understanding in Short-Answer Explanations to Concept Questions Using a Human-Centered AI Approach

Peer reviewed

Direct link

Harpreet Auby; Namrata Shivagunde; Vijeta Deshpande; Anna Rumshisky; Milo D. Koretsky – Journal of Engineering Education, 2025

Background: Analyzing student short-answer written justifications to conceptually challenging questions has proven helpful to understand student thinking and improve conceptual understanding. However, qualitative analyses are limited by the burden of analyzing large amounts of text. Purpose: We apply dense and sparse Large Language Models (LLMs)…

Descriptors: Student Evaluation, Thinking Skills, Test Format, Cognitive Processes

Automated Scoring of Short-Answer Questions: A Progress Report

Peer reviewed

Direct link

Brian E. Clauser; Victoria Yaneva; Peter Baldwin; Le An Ha; Janet Mee – Applied Measurement in Education, 2024

Multiple-choice questions have become ubiquitous in educational measurement because the format allows for efficient and accurate scoring. Nonetheless, there remains continued interest in constructed-response formats. This interest has driven efforts to develop computer-based scoring procedures that can accurately and efficiently score these items.…

Descriptors: Computer Uses in Education, Artificial Intelligence, Scoring, Responses

To Score or Not to Score: Factors Influencing Performance and Feasibility of Automatic Content Scoring of Text Responses

Peer reviewed

Direct link

Zesch, Torsten; Horbach, Andrea; Zehner, Fabian – Educational Measurement: Issues and Practice, 2023

In this article, we systematize the factors influencing performance and feasibility of automatic content scoring methods for short text responses. We argue that performance (i.e., how well an automatic system agrees with human judgments) mainly depends on the linguistic variance seen in the responses and that this variance is indirectly influenced…

Descriptors: Influences, Academic Achievement, Feasibility Studies, Automation

Short-Answer Grading for German: Addressing the Challenges

Peer reviewed

Direct link

Ulrike Padó; Yunus Eryilmaz; Larissa Kirschner – International Journal of Artificial Intelligence in Education, 2024

Short-Answer Grading (SAG) is a time-consuming task for teachers that automated SAG models have long promised to make easier. However, there are three challenges for their broad-scale adoption: A technical challenge regarding the need for high-quality models, which is exacerbated for languages with fewer resources than English; a usability…

Descriptors: Grading, Automation, Test Format, Computer Assisted Testing

Checkbox Grading of Handwritten Mathematics Exams with Multiple Assessors: How Do Students React to the Resulting Atomic Feedback? A Mixed-Method Study

Peer reviewed

Direct link

Filip Moons; Paola Iannone; Ellen Vandervieren – ZDM: Mathematics Education, 2024

Handwritten tasks are better suited than digital ones to assess higher-order mathematics skills, as students can express themselves more freely. However, maintaining reliability and providing feedback can be challenging when assessing high-stakes, handwritten mathematics exams involving multiple assessors. This paper discusses a new semi-automated…

Descriptors: Grading, Mathematics Tests, Handwriting, Test Format

Best Practices for Constructed-Response Scoring. Research Report. ETS RR-22-17

Peer reviewed
PDF on ERIC

Download full text

McCaffrey, Daniel F.; Casabianca, Jodi M.; Ricker-Pedley, Kathryn L.; Lawless, René R.; Wendler, Cathy – ETS Research Report Series, 2022

This document describes a set of best practices for developing, implementing, and maintaining the critical process of scoring constructed-response tasks. These practices address both the use of human raters and automated scoring systems as part of the scoring process and cover the scoring of written, spoken, performance, or multimodal responses.…

Descriptors: Best Practices, Scoring, Test Format, Computer Assisted Testing

Interpreting Testing and Assessment: A State-of-the-Art Review

Peer reviewed

Direct link

Han, Chao – Language Testing, 2022

Over the past decade, testing and assessing spoken-language interpreting has garnered an increasing amount of attention from stakeholders in interpreter education, professional certification, and interpreting research. This is because in these fields assessment results provide a critical evidential basis for high-stakes decisions, such as the…

Descriptors: Translation, Language Tests, Testing, Evaluation Methods

Progress Is Impossible without Change: Implementing Automatic Item Generation in Medical Knowledge Progress Testing

Peer reviewed

Direct link

Filipe Manuel Vidal Falcão; Daniela S.M. Pereira; José Miguel Pêgo; Patrício Costa – Education and Information Technologies, 2024

Progress tests (PT) are a popular type of longitudinal assessment used for evaluating clinical knowledge retention and long-life learning in health professions education. Most PTs consist of multiple-choice questions (MCQs) whose development is costly and time-consuming. Automatic Item Generation (AIG) generates test items through algorithms,…

Descriptors: Automation, Test Items, Progress Monitoring, Medical Education

A Deep-Learning-Based Grading System (ASAG) for Reading Comprehension Assessment by Using Aphorisms as Open-Answer-Questions

Peer reviewed

Direct link

Ivan D. Mardini G.; Christian G. Quintero M.; César A. Viloria N.; Winston S. Percybrooks B.; Heydy S. Robles N.; Karen Villalba R. – Education and Information Technologies, 2024

Today reading comprehension is considered an essential skill in modern life, therefore, higher education students require more specific skills to understand, interpret and evaluate texts effectively. Short answer questions (SAQs) are one of the relevant and proper tools for assessing reading comprehension skills. Unlike multiple-choice questions,…

Descriptors: Reading Comprehension, Reading Tests, Learning Strategies, Grading

Feature Engineering and Ensemble-Based Approach for Improving Automatic Short-Answer Grading Performance

Peer reviewed

Direct link

Sahu, Archana; Bhowmick, Plaban Kumar – IEEE Transactions on Learning Technologies, 2020

In this paper, we studied different automatic short answer grading (ASAG) systems to provide a comprehensive view of the feature spaces explored by previous works. While the performance reported in previous works have been encouraging, systematic study of the features is lacking. Apart from providing systematic feature space exploration, we also…

Descriptors: Automation, Grading, Test Format, Artificial Intelligence

Automated Assessment in Computer Science Education: A State-of-the-Art Review

Peer reviewed

Direct link

Paiva, José Carlos; Leal, José Paulo; Figueira, Álvaro – ACM Transactions on Computing Education, 2022

Practical programming competencies are critical to the success in computer science (CS) education and go-to-market of fresh graduates. Acquiring the required level of skills is a long journey of discovery, trial and error, and optimization seeking through a broad range of programming activities that learners must perform themselves. It is not…

Descriptors: Automation, Computer Assisted Testing, Student Evaluation, Computer Science Education

Automatic Item Generation for Non-Verbal Reasoning Items

Peer reviewed
PDF on ERIC

Download full text

Ayfer Sayin; Sabiha Bozdag; Mark J. Gierl – International Journal of Assessment Tools in Education, 2023

The purpose of this study is to generate non-verbal items for a visual reasoning test using templated-based automatic item generation (AIG). The fundamental research method involved following the three stages of template-based AIG. An item from the 2016 4th-grade entrance exam of the Science and Art Center (known as BILSEM) was chosen as the…

Descriptors: Test Items, Test Format, Nonverbal Tests, Visual Measures

Generating Reading Comprehension Items Using Automated Processes

Peer reviewed

Direct link

Shin, Jinnie; Gierl, Mark J. – International Journal of Testing, 2022

Over the last five years, tremendous strides have been made in advancing the AIG methodology required to produce items in diverse content areas. However, the one content area where enormous problems remain unsolved is language arts, generally, and reading comprehension, more specifically. While reading comprehension test items can be created using…

Descriptors: Reading Comprehension, Test Construction, Test Items, Natural Language Processing

A Comparison of Constraint Programming and Mixed-Integer Programming for Automated Test-Form Generation

Peer reviewed

Direct link

Li, Jie; van der Linden, Wim J. – Journal of Educational Measurement, 2018

The final step of the typical process of developing educational and psychological tests is to place the selected test items in a formatted form. The step involves the grouping and ordering of the items to meet a variety of formatting constraints. As this activity tends to be time-intensive, the use of mixed-integer programming (MIP) has been…

Descriptors: Programming, Automation, Test Items, Test Format

Previous Page | Next Page »

Pages: 1 | 2 | 3

Applied Measurement in…	3
ETS Research Report Series	2
Education and Information…	2
International Journal of…	2
Journal of Educational…	2
ACM Transactions on Computing…	1
ACT, Inc.	1
Applied Psychological…	1
Educational Measurement:…	1
Engineering Education	1
IEEE Transactions on Learning…	1
International Association for…	1
International Educational…	1
International Journal of…	1
International Journal of…	1
Journal of Dental Education	1
Journal of Engineering…	1
Journal of Psychoeducational…	1
Journal of Research in…	1
Language Assessment Quarterly	1
Language Testing	1
Measurement:…	1
ProQuest LLC	1
Programmed Learning and…	1
ZDM: Mathematics Education	1
More ▼

van der Linden, Wim J.	3
Diao, Qi	2
Mark J. Gierl	2
Martinez, Michael E.	2
Anna Rumshisky	1
Ayfer Sayin	1
Bennett, Randy Elliot	1
Bhowmick, Plaban Kumar	1
Bin Tan	1
Boyer, Michelle	1
Brian E. Clauser	1
Carr, Nathan T.	1
Carter, Kelli Patrice	1
Casabianca, Jodi M.	1
Chang, Hua-Hua	1
Chatzigiannakou, Maria	1
Chi, Min	1
Christian G. Quintero M.	1
César A. Viloria N.	1
Daniela S.M. Pereira	1
Elisabetta Mazzullo	1
Ellen Vandervieren	1
Embretson, Susan E.	1
Figueira, Álvaro	1
More ▼