ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	4
Since 2017 (last 10 years)	7
Since 2007 (last 20 years)	18

Descriptor

Computer Software	19
Foreign Countries	19
Interrater Reliability	19
Comparative Analysis	10
Correlation	8
Computer Assisted Testing	7
Second Language Learning	7
English (Second Language)	6
Evaluation Methods	6
Evaluators	6
Scores	6
Artificial Intelligence	5
Educational Technology	5
Models	5
Scoring	5
Second Language Instruction	5
Writing Evaluation	5
College Students	4
Essays	4
Identification	4
Likert Scales	4
Statistical Analysis	4
Undergraduate Students	4
Accuracy	3
Case Studies	3
More ▼

Source

Education and Information…	2
International Association for…	2
ALT-J: Research in Learning…	1
Advances in Physiology…	1
Australasian Journal of…	1
ETS Research Report Series	1
English Language Teaching	1
Interactive Learning…	1
International Educational…	1
International Journal of…	1
Journal of Baltic Science…	1
Journal of Educational…	1
Journal of Interactive Online…	1
Journal of Teaching in…	1
Language Testing	1
ReCALL	1
SAGE Open	1
More ▼

Publication Type

Journal Articles	16
Reports - Research	15
Tests/Questionnaires	4
Reports - Evaluative	3
Speeches/Meeting Papers	2
Collected Works - Proceedings	1

Education Level

Higher Education	11
Postsecondary Education	10
Elementary Secondary Education	5
Secondary Education	5
High Schools	1
Middle Schools	1

Audience

Location

Netherlands	3
Singapore	3
China	2
Egypt	2
Germany	2
Hong Kong	2
Israel	2
Japan	2
Turkey	2
Asia	1
Australia	1
Brazil	1
Canada	1
Connecticut	1
Cuba	1
Denmark	1
Estonia	1
Florida	1
Greece	1
Hawaii	1
India	1
Ireland	1
Italy	1
Kazakhstan	1
Norway	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Torrance Tests of Creative…

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

Graders of the Future: Comparing the Consistency and Accuracy of GPT4 and Pre-Service Teachers in Physics Essay Question Assessments

Peer reviewed
PDF on ERIC

Download full text

Yubin Xu; Lin Liu; Jianwen Xiong; Guangtian Zhu – Journal of Baltic Science Education, 2025

As the development and application of large language models (LLMs) in physics education progress, the well-known AI-based chatbot ChatGPT4 has presented numerous opportunities for educational assessment. Investigating the potential of AI tools in practical educational assessment carries profound significance. This study explored the comparative…

Descriptors: Physics, Artificial Intelligence, Computer Software, Accuracy

Revolutionising Essay Evaluation: A Cutting-Edge Rubric for AI-Assisted Writing

Peer reviewed

Direct link

Hassan Saleh Mahdi; Ahmed Alkhateeb – International Journal of Computer-Assisted Language Learning and Teaching, 2025

This study aims to develop a robust rubric for evaluating artificial intelligence (AI)--assisted essay writing in English as a Foreign Language (EFL) contexts. Employing a modified Delphi technique, we conducted a comprehensive literature review and administered Likert scale questionnaires. This process yielded nine key evaluation criteria,…

Descriptors: Scoring Rubrics, Essays, Writing Evaluation, Artificial Intelligence

Accuracy and Reliability of Large Language Models in Assessing Learning Outcomes Achievement across Cognitive Domains

Peer reviewed

Direct link

Swapna Haresh Teckwani; Amanda Huee-Ping Wong; Nathasha Vihangi Luke; Ivan Cherh Chiet Low – Advances in Physiology Education, 2024

The advent of artificial intelligence (AI), particularly large language models (LLMs) like ChatGPT and Gemini, has significantly impacted the educational landscape, offering unique opportunities for learning and assessment. In the realm of written assessment grading, traditionally viewed as a laborious and subjective process, this study sought to…

Descriptors: Accuracy, Reliability, Computational Linguistics, Standards

The Use of Semantic Similarity Tools in Automated Content Scoring of Fact-Based Essays Written by EFL Learners

Peer reviewed

Direct link

Wang, Qiao – Education and Information Technologies, 2022

This study searched for open-source semantic similarity tools and evaluated their effectiveness in automated content scoring of fact-based essays written by English-as-a-Foreign-Language (EFL) learners. Fifty writing samples under a fact-based writing task from an academic English course in a Japanese university were collected and a gold standard…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Scoring

Modeling Creativity in Visual Programming: From Theory to Practice

Peer reviewed
PDF on ERIC

Download full text

Kovalkov, Anastasia; Paassen, Benjamin; Segal, Avi; Gal, Kobi; Pinkwart, Niels – International Educational Data Mining Society, 2021

Promoting creativity is considered an important goal of education, but creativity is notoriously hard to define and measure. In this paper, we make the journey from defining a formal creativity and applying the measure in a practical domain. The measure relies on core theoretical concepts in creativity theory, namely fluency, flexibility, and…

Descriptors: Creativity, Theory Practice Relationship, Evaluators, Specialists

Comparison of Automatic and Expert Teachers' Rating of Computerized English Listening-Speaking Test

Peer reviewed
PDF on ERIC

Download full text

Linlin, Cao – English Language Teaching, 2020

Through Many-Facet Rasch analysis, this study explores the rating differences between 1 computer automatic rater and 5 expert teacher raters on scoring 119 students in a computerized English listening-speaking test. Results indicate that both automatic and the teacher raters demonstrate good inter-rater reliability, though the automatic rater…

Descriptors: Language Tests, Computer Assisted Testing, English (Second Language), Second Language Learning

Communication Skills Training Exploiting Multimodal Emotion Recognition

Peer reviewed

Direct link

Bahreini, Kiavash; Nadolski, Rob; Westera, Wim – Interactive Learning Environments, 2017

The teaching of communication skills is a labour-intensive task because of the detailed feedback that should be given to learners during their prolonged practice. This study investigates to what extent our FILTWAM facial and vocal emotion recognition software can be used for improving a serious game (the Communication Advisor) that delivers a…

Descriptors: Communication Skills, Skill Development, Training Methods, Computer Software

Development of a Rubric to Assess Academic Writing Incorporating Plagiarism Detectors

Peer reviewed

Direct link

Razi, Salim – SAGE Open, 2015

Similarity reports of plagiarism detectors should be approached with caution as they may not be sufficient to support allegations of plagiarism. This study developed a 50-item rubric to simplify and standardize evaluation of academic papers. In the spring semester of 2011-2012 academic year, 161 freshmen's papers at the English Language Teaching…

Descriptors: Foreign Countries, Scoring Rubrics, Writing Evaluation, Writing (Composition)

Towards Real-Time Speech Emotion Recognition for Affective E-Learning

Peer reviewed

Direct link

Bahreini, Kiavash; Nadolski, Rob; Westera, Wim – Education and Information Technologies, 2016

This paper presents the voice emotion recognition part of the FILTWAM framework for real-time emotion recognition in affective e-learning settings. FILTWAM (Framework for Improving Learning Through Webcams And Microphones) intends to offer timely and appropriate online feedback based upon learner's vocal intonations and facial expressions in order…

Descriptors: Affective Behavior, Emotional Response, Electronic Learning, Recognition (Psychology)

SLA Developmental Stages and Teachers' Assessment of Written French: Exploring Direkt Profil as a Diagnostic Assessment Tool

Peer reviewed

Direct link

Granfeldt, Jonas; Ågren, Malin – Language Testing, 2014

One core area of research in Second Language Acquisition is the identification and definition of developmental stages in different L2s. For L2 French, Bartning and Schlyter (2004) presented a model of six morphosyntactic stages of development in the shape of grammatical profiles. The model formed the basis for the computer program Direkt Profil…

Descriptors: Second Language Learning, Language Tests, French, Language Teachers

Investigating the Suitability of Implementing the "e-rater"® Scoring Engine in a Large-Scale English Language Testing Program. Research Report. ETS RR-13-36

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013

In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…

Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests

The Impact of ICT as Another Route to Overcome Learning Barriers for Students with SEN: A Case Study in an Egyptian Context

Download full text

Al-Gawhary, Wedad; Kambouri, Maria – International Association for Development of the Information Society, 2012

The purpose of this case study was to measure the impact of using ICT in Individual Learning Programmes of students with learning disabilities. Twenty five students and thirteen teachers took part in the research which was based on classroom observations. The Kappa coefficient was employed as a measure to statistically quantify the students'…

Descriptors: Foreign Countries, Special Needs Students, Down Syndrome, Autism

On the Reliability and Validity of Human and LSA-Based Evaluations of Complex Student-Authored Texts

Peer reviewed

Direct link

Seifried, Eva; Lenhard, Wolfgang; Baier, Herbert; Spinath, Birgit – Journal of Educational Computing Research, 2012

This study investigates the potential of a software tool based on Latent Semantic Analysis (LSA; Landauer, McNamara, Dennis, & Kintsch, 2007) to automatically evaluate complex German texts. A sample of N = 94 German university students provided written answers to questions that involved a high amount of analytical reasoning and evaluation.…

Descriptors: Foreign Countries, Computer Software, Computer Software Evaluation, Computer Uses in Education

Typing Compared with Handwriting for Essay Examinations at University: Letting the Students Choose

Peer reviewed

Direct link

Mogey, Nora; Paterson, Jessie; Burk, John; Purcell, Michael – ALT-J: Research in Learning Technology, 2010

Students at the University of Edinburgh do almost all their work on computers, but at the end of the semester they are examined by handwritten essays. Intuitively it would be appealing to allow students the choice of handwriting or typing, but this raises a concern that perhaps this might not be "fair"--that the choice a student makes,…

Descriptors: Handwriting, Essay Tests, Interrater Reliability, Grading

Experimenting with a Computer Essay-Scoring Program Based on ESL Student Writing Scripts

Peer reviewed

Direct link

Coniam, David – ReCALL, 2009

This paper describes a study of the computer essay-scoring program BETSY. While the use of computers in rating written scripts has been criticised in some quarters for lacking transparency or lack of fit with how human raters rate written scripts, a number of essay rating programs are available commercially, many of which claim to offer comparable…

Descriptors: Writing Tests, Scoring, Foreign Countries, Interrater Reliability

Previous Page | Next Page »

Pages: 1 | 2

Bahreini, Kiavash	2
Nadolski, Rob	2
Westera, Wim	2
Ahmed Alkhateeb	1
Al-Gawhary, Wedad	1
Amanda Huee-Ping Wong	1
Baier, Herbert	1
Breyer, F. Jay	1
Burk, John	1
Cave, Diana	1
Coniam, David	1
Fry, Joan	1
Gal, Kobi	1
Granfeldt, Jonas	1
Guangtian Zhu	1
Hassan Saleh Mahdi	1
Ivan Cherh Chiet Low	1
Jianwen Xiong	1
Kambouri, Maria	1
Kay, Robin H.	1
Knaack, Liesel	1
Kovalkov, Anastasia	1
Lenhard, Wolfgang	1
Lin Liu	1
Linlin, Cao	1
More ▼