ERIC - Search Results

Publication Date

In 2026	0
Since 2025	4
Since 2022 (last 5 years)	16
Since 2017 (last 10 years)	53
Since 2007 (last 20 years)	86

Descriptor

Foreign Countries	100
Interrater Reliability	100
Second Language Learning	100
English (Second Language)	87
Second Language Instruction	53
Language Tests	41
Language Proficiency	27
Scores	26
College Students	25
Correlation	24
Evaluators	24
Comparative Analysis	22
Writing Evaluation	22
Statistical Analysis	21
Scoring	20
Teaching Methods	20
Oral Language	19
Undergraduate Students	19
Essays	17
Computer Assisted Testing	15
Student Attitudes	15
Language Teachers	14
Pretests Posttests	13
Speech Communication	12
Task Analysis	12
More ▼

Publication Type

Journal Articles	93
Reports - Research	87
Tests/Questionnaires	22
Reports - Descriptive	4
Reports - Evaluative	4
Dissertations/Theses -…	3
Speeches/Meeting Papers	3
Information Analyses	2
Collected Works - Proceedings	1
Reports - General	1

Education Level

Higher Education	59
Postsecondary Education	49
Secondary Education	9
Adult Education	4
Elementary Secondary Education	3
High Schools	3
Early Childhood Education	2
Elementary Education	2
Grade 11	2
Kindergarten	2
Primary Education	2
More ▼

Audience

Location

Iran	13
Turkey	12
China	10
Japan	10
Hong Kong	6
Netherlands	4
Saudi Arabia	4
Australia	3
Canada	3
Europe	3
Germany	3
Philippines	3
South Korea	3
Spain	3
Denmark	2
India	2
Iran (Tehran)	2
Israel	2
Switzerland	2
Taiwan	2
Thailand	2
Turkey (Istanbul)	2
United States	2
Asia	1
Brazil	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	8
International English…	4
Flesch Kincaid Grade Level…	1
Modern Language Aptitude Test	1
Peabody Picture Vocabulary…	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 100 results Save | Export

Artificial Intelligence in International English Language Testing System Writing Assessments: A Comparative Study of Human Ratings and DeepAI

Peer reviewed
PDF on ERIC

Download full text

Somayeh Fathali; Fatemeh Mohajeri – Technology in Language Teaching & Learning, 2025

The International English Language Testing System (IELTS) is a high-stakes exam where Writing Task 2 significantly influences the overall scores, requiring reliable evaluation. While trained human raters perform this task, concerns about subjectivity and inconsistency have led to growing interest in artificial intelligence (AI)-based assessment…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Artificial Intelligence

ChatGPT4o as an AI Peer Assessor in EFL Speaking Classrooms: Examining Scoring Reliability and Feedback Effectiveness

Peer reviewed

Direct link

Junfei Li; Jinyan Huang; Thomas Sheeran – SAGE Open, 2025

This study investigated the role of ChatGPT4o as an AI peer assessor in English-as-a-foreign-language (EFL) speaking classrooms, with a focus on its scoring reliability and the effectiveness of its feedback. The research involved 40 first-year English major students from two parallel classes at a Chinese university. Twenty from one class served as…

Descriptors: Artificial Intelligence, Technology Uses in Education, Peer Evaluation, English (Second Language)

Examining AI-Based Accuracy Assessment in L2 Learners' Writing

Peer reviewed

Direct link

On-Soon Lee – Journal of Pan-Pacific Association of Applied Linguistics, 2024

Despite the increasing interest in using AI tools as assistant agents in instructional settings, the effectiveness of ChatGPT, the generative pretrained AI, for evaluating the accuracy of second language (L2) writing has been largely unexplored in formative assessment. Therefore, the current study aims to examine how ChatGPT, as an evaluator,…

Descriptors: Foreign Countries, Undergraduate Students, English (Second Language), Second Language Learning

Examining Rater Reliability When Using an Analytical Rubric for Oral Presentation Assessments

Peer reviewed
PDF on ERIC

Download full text

Sasithorn Limgomolvilas; Patsawut Sukserm – LEARN Journal: Language Education and Acquisition Research Network, 2025

The assessment of English speaking in EFL environments can be inherently subjective and influenced by various factors beyond linguistic ability, including choice of assessment criteria, and even the rubric type. In classroom assessment, the type of rubric recommended for English speaking tasks is the analytical rubric. Driven by three aims, this…

Descriptors: Oral Language, Speech Communication, English (Second Language), Second Language Learning

Automated Sign Language Vocabulary Assessment: Comparing Human and Machine Ratings and Studying Learner Perceptions

Peer reviewed

Direct link

Franz Holzknecht; Sandrine Tornay; Alessia Battisti; Aaron Olaf Batty; Katja Tissi; Tobias Haug; Sarah Ebling – Language Assessment Quarterly, 2024

Although automated spoken language assessment is rapidly growing, such systems have not been widely developed for signed languages. This study provides validity evidence for an automated web application that was developed to assess and give feedback on handshape and hand movement of L2 learners' Swiss German Sign Language signs. The study shows…

Descriptors: Sign Language, Vocabulary Development, Educational Assessment, Automation

Impact of Self-Construal on Rater Severity in Peer Assessments of Oral Presentations

Peer reviewed

Direct link

Tanaka, Mitsuko; Ross, Steven J. – Assessment in Education: Principles, Policy & Practice, 2023

Raters vary from each other in their severity and leniency in rating performance. This study examined the factors affecting rater severity in peer assessments of oral presentations in English as a Foreign Language (EFL), focusing on peer raters' self-construal and presentation abilities. Japanese university students enrolled in EFL classes…

Descriptors: Evaluators, Interrater Reliability, Item Response Theory, Peer Evaluation

Examining Rater Biases of Peer Assessors in Different Assessment Environments

Peer reviewed
PDF on ERIC

Download full text

Yesilçinar, Sabahattin; Sata, Mehmet – International Journal of Psychology and Educational Studies, 2021

The current study employed many-facet Rasch measurement (MFRM) to explain the rater bias patterns of EFL student teachers (hereafter students) when they rate the teaching performance of their peers in three assessment environments: online, face-to-face, and anonymous. Twenty-four students and two instructors rated 72 micro-teachings performed by…

Descriptors: Peer Evaluation, Preservice Teachers, English (Second Language), Second Language Learning

Depth-Perception-Based Representation in Holistic Rating on ESL Essay Writing

Peer reviewed

Direct link

Lian Li; Jiehui Hu; Yu Dai; Ping Zhou; Wanhong Zhang – Reading & Writing Quarterly, 2024

This paper proposes to use depth perception to represent raters' decision in holistic evaluation of ESL essays, as an alternative medium to conventional form of numerical scores. The researchers verified the new method's accuracy and inter/intra-rater reliability by inviting 24 ESL teachers to perform different representations when rating 60…

Descriptors: Essays, Holistic Approach, Writing Evaluation, Accuracy

Revolutionising Essay Evaluation: A Cutting-Edge Rubric for AI-Assisted Writing

Peer reviewed

Direct link

Hassan Saleh Mahdi; Ahmed Alkhateeb – International Journal of Computer-Assisted Language Learning and Teaching, 2025

This study aims to develop a robust rubric for evaluating artificial intelligence (AI)--assisted essay writing in English as a Foreign Language (EFL) contexts. Employing a modified Delphi technique, we conducted a comprehensive literature review and administered Likert scale questionnaires. This process yielded nine key evaluation criteria,…

Descriptors: Scoring Rubrics, Essays, Writing Evaluation, Artificial Intelligence

A Rasch Analysis of Rater Behaviour in Speaking Assessment

Peer reviewed
PDF on ERIC

Download full text

Polat, Murat – International Online Journal of Education and Teaching, 2020

The assessment of speaking skills in foreign language testing has always had some pros (testing learners' speaking skills doubles the validity of any language test) and cons (many testrelevant/irrelevant variables interfere) since it is a multi-dimensional process. In the meantime, exploring grader behaviours while scoring learners' speaking…

Descriptors: Item Response Theory, Interrater Reliability, Speech Skills, Second Language Learning

Applying Generalizability Theory in Language Testing: Comparing Nested and Crossed Scoring Designs in the Assessment of Speaking Skills

Peer reviewed
PDF on ERIC

Download full text

Polat, Murat; Turhan, Nihan Sölpük – International Journal of Curriculum and Instruction, 2021

Scoring language learners' speaking skills is open to a number of measurement errors since raters' personal judgements could involve in the process. Different grading designs in which raters score a student's whole speaking skills or a specific dimension of the speaking performance could be settled to control and minimize the amount of the error…

Descriptors: Language Tests, Scoring, Speech Communication, State Universities

The Longitudinal Stability of Rating Characteristics in an EFL Examination: Methodological and Substantive Considerations

Peer reviewed

Direct link

Lamprianou, Iasonas; Tsagari, Dina; Kyriakou, Nansia – Language Testing, 2021

This longitudinal study (2002-2014) investigates the stability of rating characteristics of a large group of raters over time in the context of the writing paper of a national high-stakes examination. The study uses one measure of rater severity and two measures of rater consistency. The results suggest that the rating characteristics of…

Descriptors: Longitudinal Studies, Evaluators, High Stakes Tests, Writing Evaluation

Rater Judgments and Word Difficulty: Conceptualizing the Substantive Validity of the VST

Peer reviewed
PDF on ERIC

Download full text

Derek N. Canning; Stuart McLean; Joseph P. Vitta – Vocabulary Learning and Instruction, 2022

The substantive component of construct validity requires a confrontation between empirical test results and content relevance. The Vocabulary Size Test (VST) has been extensively validated in terms of empirical results. Less is known, however, about expert judgments of content relevance. The VST was constructed and validated according to the…

Descriptors: Foreign Countries, Undergraduate Students, College Faculty, Vocabulary Skills

Developing and Validating a Computerized Oral Proficiency Test of English as a Foreign Language (COPTEFL)

Peer reviewed
PDF on ERIC

Download full text

Isler, Cemre; Aydin, Belgin – International Journal of Assessment Tools in Education, 2021

This study is about the development and validation process of the Computerized Oral Proficiency Test of English as a Foreign Language (COPTEFL). The test aims at assessing the speaking proficiency levels of students in Anadolu University School of Foreign Languages (AUSFL). For this purpose, three monologic tasks were developed based on the Global…

Descriptors: Test Construction, Construct Validity, Interrater Reliability, Scores

An Empirical Study of English Language Teachers' Methodology on the Career Growth of Saudi Students

Peer reviewed
PDF on ERIC

Download full text

Alkhanani, Badriah – International Journal of Language Education, 2022

The purpose of this study was to find the effect of English Language Teachers' Methodology (ELTM) on the Career Growth (CG) of the Saudi students. In order to provide a solid basis for this research study, a cross-sectional-descriptive research design was employed. For scale development and tool standardization, inter-class correlation…

Descriptors: Career Development, English (Second Language), Second Language Learning, Second Language Instruction

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Language Testing	7
English Language Teaching	5
Language Assessment Quarterly	5
Online Submission	5
System	4
ETS Research Report Series	3
Iranian Journal of Language…	3
SAGE Open	3
Advances in Language and…	2
International Journal of…	2
Journal of Pan-Pacific…	2
Language Learning Journal	2
ProQuest LLC	2
Reading Matrix: An…	2
Second Language Research	2
Assessing Writing	1
Assessment in Education:…	1
Australian Review of Applied…	1
Cogent Education	1
Current Issues in Education	1
ELT Journal	1
Education and Information…	1
Educational Research and…	1
Foreign Language Annals	1
IEEE Transactions on Learning…	1
More ▼

Coniam, David	3
Ahmadi, Alireza	2
Aydin, Selami	2
McNamara, T. F.	2
Polat, Murat	2
de Jong, Nivja H.	2
Aaron Olaf Batty	1
Adams, R. J.	1
Afzali, Katayoon	1
Ahmadi Shirazi, Masoumeh	1
Ahmed Alkhateeb	1
Ahour, Touran	1
Alanen, Riikka	1
Alavi, Sahar Zahed	1
Alessia Battisti	1
Alhaisoni, Eid	1
Alharthi, Saleh	1
Alkhanani, Badriah	1
Aydin, Belgin	1
Barkaoui, Khaled	1
Beh-Afarin, Seyed Reza	1
Bergeron, Annie	1
Bijani, Houman	1
Bilginer, Hayriye	1
More ▼