ERIC - Search Results

Publication Date

In 2025	3
Since 2024	8
Since 2021 (last 5 years)	32
Since 2016 (last 10 years)	81
Since 2006 (last 20 years)	153

Descriptor

English (Second Language)	204
Interrater Reliability	204
Second Language Learning	136
Foreign Countries	119
Language Tests	84
Second Language Instruction	78
Language Proficiency	53
Writing Evaluation	52
Evaluators	50
Scores	47
Scoring	44
Oral Language	41
Comparative Analysis	39
College Students	30
Correlation	30
Essays	30
Statistical Analysis	30
Computer Assisted Testing	27
Teaching Methods	27
Test Validity	27
Rating Scales	26
Language Teachers	25
Undergraduate Students	25
Native Speakers	24
Student Attitudes	22
More ▼

Publication Type

Reports - Research	163
Journal Articles	162
Tests/Questionnaires	34
Reports - Evaluative	18
Speeches/Meeting Papers	16
Dissertations/Theses -…	9
Reports - Descriptive	7
Information Analyses	4
Guides - Non-Classroom	2
Numerical/Quantitative Data	2
Books	1
Collected Works - General	1
Collected Works - Proceedings	1
Collected Works - Serials	1
Opinion Papers	1
Reports - General	1
More ▼

Education Level

Higher Education	74
Postsecondary Education	56
Secondary Education	11
Elementary Education	10
Adult Education	8
High Schools	5
Early Childhood Education	4
Elementary Secondary Education	4
Grade 11	4
Primary Education	4
Grade 10	2
Grade 2	2
Grade 5	2
Grade 6	2
Intermediate Grades	2
Kindergarten	2
Grade 1	1
Grade 12	1
Grade 3	1
Grade 4	1
Preschool Education	1
More ▼

Audience

Practitioners	4
Teachers	3
Researchers	2

Location

Japan	15
China	14
Turkey	14
Iran	13
Hong Kong	6
India	5
Taiwan	5
Canada	4
Saudi Arabia	4
South Korea	4
Europe	3
Germany	3
Netherlands	3
Pennsylvania	3
Philippines	3
Spain	3
United States	3
Arizona	2
Australia	2
California	2
Denmark	2
Egypt	2
Israel	2
Malaysia	2
Nepal	2
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	27
International English…	6
ACTFL Oral Proficiency…	2
Graduate Record Examinations	2
Test of English for…	2
Expressive One Word Picture…	1
Flesch Kincaid Grade Level…	1
Kaufman Assessment Battery…	1
Mean Length of Utterance	1
Modern Language Aptitude Test	1
Oral and Written Language…	1
Peabody Picture Vocabulary…	1
Reading Miscue Inventory	1
Test of Written English	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 1 to 15 of 204 results Save | Export

Developing an Automatic Pronunciation Scorer: Aligning Speech Evaluation Models and Applied Linguistics Constructs

Peer reviewed

Direct link

Danwei Cai; Ben Naismith; Maria Kostromitina; Zhongwei Teng; Kevin P. Yancey; Geoffrey T. LaFlair – Language Learning, 2025

Globalization and increases in the numbers of English language learners have led to a growing demand for English proficiency assessments of spoken language. In this paper, we describe the development of an automatic pronunciation scorer built on state-of-the-art deep neural network models. The model is trained on a bespoke human-rated dataset that…

Descriptors: Automation, Scoring, Pronunciation, Speech Tests

The Reliability of Using ChatGPT in Rating EFL Writings

Peer reviewed
PDF on ERIC

Download full text

Yang Yang – Shanlax International Journal of Education, 2024

This paper explores the reliability of using ChatGPT in evaluating EFL writing by assessing its intra- and inter-rater reliability. Eighty-two compositions were randomly sampled from the Written English Corpus of Chinese Learners. These compositions were rated by three experienced raters with regard to 'language', 'content', and 'organization'.…

Descriptors: English (Second Language), Second Language Instruction, Writing (Composition), Evaluation Methods

Teachers or ChatGPT: The Issue of Accuracy and Consistency in L2 Assessment

Peer reviewed
PDF on ERIC

Download full text

Ramy Shabara; Khaled ElEbyary; Deena Boraie – Teaching English with Technology, 2024

Although there are claims that ChatGPT, an AI-based language model, is capable of assessing the writing of L2 learners accurately and consistently in the classroom, a number of recent studies have shown discrepancies between AI and human raters. Furthermore, there is a lack of studies investigating the intrareliability of ChatGPT scores.…

Descriptors: Foreign Countries, Artificial Intelligence, Scoring Rubrics, Student Evaluation

Examining AI-Based Accuracy Assessment in L2 Learners' Writing

Peer reviewed

Direct link

On-Soon Lee – Journal of Pan-Pacific Association of Applied Linguistics, 2024

Despite the increasing interest in using AI tools as assistant agents in instructional settings, the effectiveness of ChatGPT, the generative pretrained AI, for evaluating the accuracy of second language (L2) writing has been largely unexplored in formative assessment. Therefore, the current study aims to examine how ChatGPT, as an evaluator,…

Descriptors: Foreign Countries, Undergraduate Students, English (Second Language), Second Language Learning

The Rashomon Effect: Which Features of a Speaker's Talk Do Listeners Notice?

Peer reviewed

Direct link

Seedhouse, Paul; Satar, Müge – Classroom Discourse, 2023

The same L2 speaking performance may be analysed and evaluated in very different ways by different teachers or raters. We present a new, technology-assisted research design which opens up to investigation the trajectories of convergence and divergence between raters. We tracked and recorded what different raters noticed when, whilst grading a…

Descriptors: Language Tests, English (Second Language), Second Language Learning, Oral Language

Examining Rater Reliability When Using an Analytical Rubric for Oral Presentation Assessments

Peer reviewed
PDF on ERIC

Download full text

Sasithorn Limgomolvilas; Patsawut Sukserm – LEARN Journal: Language Education and Acquisition Research Network, 2025

The assessment of English speaking in EFL environments can be inherently subjective and influenced by various factors beyond linguistic ability, including choice of assessment criteria, and even the rubric type. In classroom assessment, the type of rubric recommended for English speaking tasks is the analytical rubric. Driven by three aims, this…

Descriptors: Oral Language, Speech Communication, English (Second Language), Second Language Learning

Impact of Self-Construal on Rater Severity in Peer Assessments of Oral Presentations

Peer reviewed

Direct link

Tanaka, Mitsuko; Ross, Steven J. – Assessment in Education: Principles, Policy & Practice, 2023

Raters vary from each other in their severity and leniency in rating performance. This study examined the factors affecting rater severity in peer assessments of oral presentations in English as a Foreign Language (EFL), focusing on peer raters' self-construal and presentation abilities. Japanese university students enrolled in EFL classes…

Descriptors: Evaluators, Interrater Reliability, Item Response Theory, Peer Evaluation

Examining Rater Biases of Peer Assessors in Different Assessment Environments

Peer reviewed
PDF on ERIC

Download full text

Yesilçinar, Sabahattin; Sata, Mehmet – International Journal of Psychology and Educational Studies, 2021

The current study employed many-facet Rasch measurement (MFRM) to explain the rater bias patterns of EFL student teachers (hereafter students) when they rate the teaching performance of their peers in three assessment environments: online, face-to-face, and anonymous. Twenty-four students and two instructors rated 72 micro-teachings performed by…

Descriptors: Peer Evaluation, Preservice Teachers, English (Second Language), Second Language Learning

"How Do Raters Learn to Rate?" Many-Facet Rasch Modeling of Rater Performance over the Course of a Rater Certification Program

Peer reviewed

Direct link

Yan, Xun; Chuang, Ping-Lin – Language Testing, 2023

This study employed a mixed-methods approach to examine how rater performance develops during a semester-long rater certification program for an English as a Second Language (ESL) writing placement test at a large US university. From 2016 to 2018, we tracked three groups of novice raters (n = 30) across four rounds in the certification program.…

Descriptors: Evaluators, Interrater Reliability, Item Response Theory, Certification

Meta-Analysis of Inter-Rater Agreement and Discrepancy Between Human and Automated English Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jiyeo Yun – English Teaching, 2023

Studies on automatic scoring systems in writing assessments have also evaluated the relationship between human and machine scores for the reliability of automated essay scoring systems. This study investigated the magnitudes of indices for inter-rater agreement and discrepancy, especially regarding human and machine scoring, in writing assessment.…

Descriptors: Meta Analysis, Interrater Reliability, Essays, Scoring

Examining Consistency among Different Rubrics for Assessing Writing

Peer reviewed

Direct link

Shabani, Enayat A.; Panahi, Jaleh – Language Testing in Asia, 2020

The literature on using scoring rubrics in writing assessment denotes the significance of rubrics as practical and useful means to assess the quality of writing tasks. This study tries to investigate the agreement among rubrics endorsed and used for assessing the essay writing tasks by the internationally recognized tests of English language…

Descriptors: Writing Evaluation, Scoring Rubrics, Scores, Interrater Reliability

Accuracy of Peer Ratings on the Quality of Spoken-Language Interpreting

Peer reviewed

Direct link

Han, Chao; Zhao, Xiao – Assessment & Evaluation in Higher Education, 2021

The accuracy of peer ratings on students' performance has attracted much attention from higher education researchers. In this study, we attempted to explore the accuracy of peer ratings on the quality of spoken-language interpreting in the context of tertiary-level interpreter training. We sought to understand how different types of peer raters…

Descriptors: Accuracy, Peer Evaluation, Oral Language, Interpretive Skills

Depth-Perception-Based Representation in Holistic Rating on ESL Essay Writing

Peer reviewed

Direct link

Lian Li; Jiehui Hu; Yu Dai; Ping Zhou; Wanhong Zhang – Reading & Writing Quarterly, 2024

This paper proposes to use depth perception to represent raters' decision in holistic evaluation of ESL essays, as an alternative medium to conventional form of numerical scores. The researchers verified the new method's accuracy and inter/intra-rater reliability by inviting 24 ESL teachers to perform different representations when rating 60…

Descriptors: Essays, Holistic Approach, Writing Evaluation, Accuracy

Revolutionising Essay Evaluation: A Cutting-Edge Rubric for AI-Assisted Writing

Peer reviewed

Direct link

Hassan Saleh Mahdi; Ahmed Alkhateeb – International Journal of Computer-Assisted Language Learning and Teaching, 2025

This study aims to develop a robust rubric for evaluating artificial intelligence (AI)--assisted essay writing in English as a Foreign Language (EFL) contexts. Employing a modified Delphi technique, we conducted a comprehensive literature review and administered Likert scale questionnaires. This process yielded nine key evaluation criteria,…

Descriptors: Scoring Rubrics, Essays, Writing Evaluation, Artificial Intelligence

A Rasch Analysis of Rater Behaviour in Speaking Assessment

Peer reviewed
PDF on ERIC

Download full text

Polat, Murat – International Online Journal of Education and Teaching, 2020

The assessment of speaking skills in foreign language testing has always had some pros (testing learners' speaking skills doubles the validity of any language test) and cons (many testrelevant/irrelevant variables interfere) since it is a multi-dimensional process. In the meantime, exploring grader behaviours while scoring learners' speaking…

Descriptors: Item Response Theory, Interrater Reliability, Speech Skills, Second Language Learning

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 14

Language Testing	13
Language Assessment Quarterly	10
ETS Research Report Series	8
ProQuest LLC	8
English Language Teaching	7
Online Submission	6
System	4
Advances in Language and…	3
Assessing Writing	3
Assessment in Education:…	3
Cogent Education	3
Language Testing in Asia	3
Studies in Second Language…	3
Classroom Discourse	2
ELT Journal	2
Education and Information…	2
Educational Testing Service	2
Foreign Language Annals	2
International Journal of…	2
Iranian Journal of Language…	2
JALT CALL Journal	2
Journal of Communication…	2
Journal of Pan-Pacific…	2
Journal of Speech, Language,…	2
Language Learning	2
More ▼

Coniam, David	4
Nakamura, Yuji	3
Ahmadi, Alireza	2
Aydin, Selami	2
Barkaoui, Khaled	2
Bejar, Isaac I.	2
Bijani, Houman	2
Carlson, Sybil B.	2
Davis, Larry	2
Elder, Catherine	2
Gersten, Russell	2
Hamp-Lyons, Liz	2
Henning, Grant	2
Hsieh, Mingchuan	2
McNamara, T. F.	2
Mollaun, Pam	2
Sata, Mehmet	2
Thompson, Irene	2
Weigle, Sara Cushing	2
Xi, Xiaoming	2
Yan, Xun	2
Adams, R. J.	1
Afzali, Katayoon	1
Ahmadi Safa, Mohammad	1
More ▼