ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	8
Since 2017 (last 10 years)	13
Since 2007 (last 20 years)	21

Descriptor

Interrater Reliability	22
Scoring Rubrics	22
Second Language Learning	22
English (Second Language)	18
Evaluators	11
Second Language Instruction	11
Foreign Countries	10
Language Tests	10
Scores	10
Oral Language	7
Undergraduate Students	7
Writing Evaluation	7
Evaluation Criteria	6
Language Proficiency	5
Statistical Analysis	5
Computer Assisted Testing	4
Correlation	4
Essays	4
Language Teachers	4
Speech Communication	4
Chinese	3
Comparative Analysis	3
Language Usage	3
Native Language	3
Pronunciation	3
More ▼

Publication Type

Journal Articles	20
Reports - Research	20
Tests/Questionnaires	7
Dissertations/Theses -…	1
Guides - Non-Classroom	1
Reports - Evaluative	1

Education Level

Higher Education	11
Postsecondary Education	11
Adult Education	2
Early Childhood Education	1
Elementary Education	1
Grade 3	1
Grade 5	1
Intermediate Grades	1
Middle Schools	1
Primary Education	1
Secondary Education	1
More ▼

Audience

Practitioners

Location

China	2
Georgia	1
India	1
Japan	1
Malaysia	1
New York (New York)	1
Saudi Arabia	1
South Korea	1
Thailand	1
Turkey	1
Turkey (Istanbul)	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	3
Flesch Kincaid Grade Level…	1
International English…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 22 results Save | Export

Examining AI-Based Accuracy Assessment in L2 Learners' Writing

Peer reviewed

Direct link

On-Soon Lee – Journal of Pan-Pacific Association of Applied Linguistics, 2024

Despite the increasing interest in using AI tools as assistant agents in instructional settings, the effectiveness of ChatGPT, the generative pretrained AI, for evaluating the accuracy of second language (L2) writing has been largely unexplored in formative assessment. Therefore, the current study aims to examine how ChatGPT, as an evaluator,…

Descriptors: Foreign Countries, Undergraduate Students, English (Second Language), Second Language Learning

Examining Rater Reliability When Using an Analytical Rubric for Oral Presentation Assessments

Peer reviewed
PDF on ERIC

Download full text

Sasithorn Limgomolvilas; Patsawut Sukserm – LEARN Journal: Language Education and Acquisition Research Network, 2025

The assessment of English speaking in EFL environments can be inherently subjective and influenced by various factors beyond linguistic ability, including choice of assessment criteria, and even the rubric type. In classroom assessment, the type of rubric recommended for English speaking tasks is the analytical rubric. Driven by three aims, this…

Descriptors: Oral Language, Speech Communication, English (Second Language), Second Language Learning

How Many Raters Can Be Enough: G Theory Applied to Assessment and Measurement of L2 Speech Perception

Peer reviewed
PDF on ERIC

Download full text

Kevin Hirschi; Okim Kang – Language Teaching Research Quarterly, 2023

This paper extends the use of Generalizability Theory to the measurement of extemporaneous L2 speech through the lens of speech perception. Using six datasets of previous studies, it reports on "G studies"--a method of breaking down measurement variance--and "D studies"--a predictive study of the impact on reliability when…

Descriptors: Evaluators, Generalization, Evaluation Methods, Speech Communication

Examining Consistency among Different Rubrics for Assessing Writing

Peer reviewed

Direct link

Shabani, Enayat A.; Panahi, Jaleh – Language Testing in Asia, 2020

The literature on using scoring rubrics in writing assessment denotes the significance of rubrics as practical and useful means to assess the quality of writing tasks. This study tries to investigate the agreement among rubrics endorsed and used for assessing the essay writing tasks by the internationally recognized tests of English language…

Descriptors: Writing Evaluation, Scoring Rubrics, Scores, Interrater Reliability

Depth-Perception-Based Representation in Holistic Rating on ESL Essay Writing

Peer reviewed

Direct link

Lian Li; Jiehui Hu; Yu Dai; Ping Zhou; Wanhong Zhang – Reading & Writing Quarterly, 2024

This paper proposes to use depth perception to represent raters' decision in holistic evaluation of ESL essays, as an alternative medium to conventional form of numerical scores. The researchers verified the new method's accuracy and inter/intra-rater reliability by inviting 24 ESL teachers to perform different representations when rating 60…

Descriptors: Essays, Holistic Approach, Writing Evaluation, Accuracy

Revolutionising Essay Evaluation: A Cutting-Edge Rubric for AI-Assisted Writing

Peer reviewed

Direct link

Hassan Saleh Mahdi; Ahmed Alkhateeb – International Journal of Computer-Assisted Language Learning and Teaching, 2025

This study aims to develop a robust rubric for evaluating artificial intelligence (AI)--assisted essay writing in English as a Foreign Language (EFL) contexts. Employing a modified Delphi technique, we conducted a comprehensive literature review and administered Likert scale questionnaires. This process yielded nine key evaluation criteria,…

Descriptors: Scoring Rubrics, Essays, Writing Evaluation, Artificial Intelligence

Scoring Rubric Reliability and Internal Validity in Rater-Mediated EFL Writing Assessment: Insights from Many-Facet Rasch Measurement

Peer reviewed

Direct link

Li, Wentao – Reading and Writing: An Interdisciplinary Journal, 2022

Scoring rubrics are known to be effective for assessing writing for both testing and classroom teaching purposes. How raters interpret the descriptors in a rubric can significantly impact the subsequent final score, and further, the descriptors may also color a rater's judgment of a student's writing quality. Little is known, however, about how…

Descriptors: Scoring Rubrics, Interrater Reliability, Writing Evaluation, Teaching Methods

A Genre-Based Approach in Teaching Writing to Student Teachers of English Language Teaching in a Digital Context

Peer reviewed

Direct link

Kinik, Betul; Genc, Bilal – Reading Matrix: An International Online Journal, 2022

The current study presents the findings of a pre-test/post-test design to explore the efficacy of a genre-based approach to teaching argumentative essay writing during synchronous classes. The study is conducted with the participation of a group of freshman and junior year student teachers of English Language Teaching enrolled at the course of…

Descriptors: Literary Genres, English (Second Language), Second Language Learning, Second Language Instruction

Learning an L2 and L3 at the Same Time: Help or Hinder?

Peer reviewed

Direct link

Huang, Ting; Steinkrauss, Rasmus; Verspoor, Marjolijn – International Journal of Multilingualism, 2022

There is quite a bit of evidence showing that the experience of learning an L2 will help in learning an L3, but as far as we know, very little research has investigated the possible impact of L3 learning on the already existing and still developing L2 system within the learner. According to Complex Dynamic Systems Theory (CDST), language…

Descriptors: Multilingualism, Second Language Learning, Second Language Instruction, Transfer of Training

Can Subject Matter Experts Rate the English Language Skills of Customer Services Representatives (CSRs) at Work in Indian Contact Centre?

Peer reviewed

Direct link

Lockwood, Jane; Raquel, Michelle – Language Assessment Quarterly, 2019

Millions of customer services representatives are assessed each year by subject matter experts (e.g., recruiters, team leaders) in Asian contact centres to ensure good spoken communication skills when serving customers on the phones. In other workplace contexts, language experts are employed to do this work but in Asian contact centres, a…

Descriptors: English (Second Language), Second Language Learning, Language Skills, Telecommunications

Malaysian Speaking Proficiency Assessment Effectiveness for Undergraduates Suffering from Minimal Descriptors

Peer reviewed
PDF on ERIC

Download full text

Saeed, Karwan Mustafa; Ismail, Shaik Abdul Malik Mohamad; Eng, Lin Siew – International Journal of Instruction, 2019

This study was primarily aimed at developing an English-speaking proficiency test and analytic rubrics designed to measure speaking proficiency of Malaysian undergraduates. On the basis of Littlewood's Methodological Framework and Long's Interaction Hypothesis, the researchers derived three speaking tasks from four sources: (a) syllabus of the…

Descriptors: Foreign Countries, Undergraduate Students, Second Language Learning, English (Second Language)

Assessing Individual and Group Oral Exams: Scoring Criteria and Rater Interaction

Peer reviewed
PDF on ERIC

Download full text

Yalçin-Çolakoglu, Özlem; Selçuk, Merve – Advances in Language and Literary Studies, 2019

Criterion referenced tests of second language speaking performance are administered in different institutions using different procedures. The present study reports raters' practices of second language speaking tests, in particular the correspondence between test-takers' grades when assessed individually and in groups. Data derived from…

Descriptors: Oral Language, Language Tests, Test Validity, Inferences

The Influence of Training and Experience on Rater Performance in Scoring Spoken Language

Peer reviewed

Direct link

Davis, Larry – Language Testing, 2016

Two factors were investigated that are thought to contribute to consistency in rater scoring judgments: rater training and experience in scoring. Also considered were the relative effects of scoring rubrics and exemplars on rater performance. Experienced teachers of English (N = 20) scored recorded responses from the TOEFL iBT speaking test prior…

Descriptors: Evaluators, Oral Language, Scores, Language Tests

Native and Non-Native Raters of L2 Speaking Performance: Accent Familiarity and Cognitive Processes

Direct link

Bogorevich, Valeriia – ProQuest LLC, 2018

Rater variation in performance assessment can impact test-takers' scores and compromise assessments' fairness and validity (Crooks, Kane, & Cohen, 1996). Rater variation can also undermine a test's validity and fairness; therefore, it is important to investigate raters' scoring patterns in order to inform rater training. Substantial work has…

Descriptors: Pronunciation, Familiarity, English (Second Language), Second Language Learning

Analysis of Rater Severity on Written Expression Exam Using Many Faceted Rasch Measurement

Peer reviewed
PDF on ERIC

Download full text

Prieto, Gerardo; Nieto, Eloísa – Psicologica: International Journal of Methodology and Experimental Psychology, 2014

This paper describes how a Many Faceted Rasch Measurement (MFRM) approach can be applied to performance assessment focusing on rater analysis. The article provides an introduction to MFRM, a description of MFRM analysis procedures, and an example to illustrate how to examine the effects of various sources of variability on test takers' performance…

Descriptors: Item Response Theory, Interrater Reliability, Rating Scales, Error of Measurement

Previous Page | Next Page »

Pages: 1 | 2

ETS Research Report Series	2
Language Assessment Quarterly	2
Language Testing	2
Advances in Language and…	1
English Language Teaching	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Pan-Pacific…	1
LEARN Journal: Language…	1
Language Teaching Research…	1
Language Testing in Asia	1
ProQuest LLC	1
Psicologica: International…	1
Reading & Writing Quarterly	1
Reading Matrix: An…	1
Reading and Writing: An…	1
Society for Research on…	1
Working Papers in TESOL &…	1
More ▼

Ahmed Alkhateeb	1
Barkhuizen, Gary	1
Beltrán, Jorge	1
Bogorevich, Valeriia	1
Davis, Larry	1
Elder, Catherine	1
Eng, Lin Siew	1
Genc, Bilal	1
Gonzalez Canche, Manuel S.	1
Hassan Saleh Mahdi	1
Hijikata-Someya, Yuko	1
Huang, Ting	1
Ismail, Shaik Abdul Malik…	1
Jamieson, Joan	1
Jiehui Hu	1
Kevin Hirschi	1
Kim, Hyun Jung	1
Kinik, Betul	1
Knoch, Ute	1
Li, Wentao	1
Lian Li	1
Lockwood, Jane	1
Mellon, Paula J.	1
Mollaun, Pam	1
Nieto, Eloísa	1
More ▼