ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	18
Since 2017 (last 10 years)	45
Since 2007 (last 20 years)	75

Descriptor

Interrater Reliability	104
Second Language Instruction	104
Second Language Learning	83
English (Second Language)	78
Foreign Countries	65
Language Tests	47
Comparative Analysis	30
Language Proficiency	27
Writing Evaluation	25
Oral Language	24
College Students	23
Language Teachers	23
Statistical Analysis	23
Teaching Methods	22
Scores	21
Correlation	19
Evaluators	19
Undergraduate Students	17
Scoring	16
Higher Education	15
Student Attitudes	15
Essays	14
Interviews	14
Writing Skills	14
Pretests Posttests	13
More ▼

Publication Type

Journal Articles	87
Reports - Research	80
Tests/Questionnaires	19
Reports - Evaluative	7
Speeches/Meeting Papers	6
Dissertations/Theses -…	5
Information Analyses	5
Reports - Descriptive	5
Opinion Papers	2
Books	1
Collected Works - General	1
Collected Works - Proceedings	1
Collected Works - Serials	1
Guides - Non-Classroom	1
Reports - General	1
More ▼

Education Level

Higher Education	44
Postsecondary Education	39
Adult Education	4
Secondary Education	4
Elementary Education	3
Early Childhood Education	2
Elementary Secondary Education	2
Grade 5	2
Primary Education	2
Grade 3	1
Grade 4	1
Intermediate Grades	1
Kindergarten	1
Middle Schools	1
More ▼

Audience

Practitioners	3
Teachers	2

Location

Japan	11
Iran	10
China	7
Turkey	7
Saudi Arabia	4
Hong Kong	3
Philippines	3
Spain	3
Europe	2
Netherlands	2
Thailand	2
Turkey (Istanbul)	2
Asia	1
Australia	1
Brazil	1
Canada	1
Connecticut	1
Cyprus	1
Denmark	1
Egypt	1
Estonia	1
Florida	1
Georgia	1
Germany	1
Greece	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	4
Flesch Kincaid Grade Level…	1
International English…	1
Modern Language Aptitude Test	1
Peabody Picture Vocabulary…	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 104 results Save | Export

Scoring Rubric Reliability and Internal Validity in Rater-Mediated EFL Writing Assessment: Insights from Many-Facet Rasch Measurement

Peer reviewed

Direct link

Li, Wentao – Reading and Writing: An Interdisciplinary Journal, 2022

Scoring rubrics are known to be effective for assessing writing for both testing and classroom teaching purposes. How raters interpret the descriptors in a rubric can significantly impact the subsequent final score, and further, the descriptors may also color a rater's judgment of a student's writing quality. Little is known, however, about how…

Descriptors: Scoring Rubrics, Interrater Reliability, Writing Evaluation, Teaching Methods

The Reliability of Using ChatGPT in Rating EFL Writings

Peer reviewed
PDF on ERIC

Download full text

Yang Yang – Shanlax International Journal of Education, 2024

This paper explores the reliability of using ChatGPT in evaluating EFL writing by assessing its intra- and inter-rater reliability. Eighty-two compositions were randomly sampled from the Written English Corpus of Chinese Learners. These compositions were rated by three experienced raters with regard to 'language', 'content', and 'organization'.…

Descriptors: English (Second Language), Second Language Instruction, Writing (Composition), Evaluation Methods

An Empirical Study of English Language Teachers' Methodology on the Career Growth of Saudi Students

Peer reviewed
PDF on ERIC

Download full text

Alkhanani, Badriah – International Journal of Language Education, 2022

The purpose of this study was to find the effect of English Language Teachers' Methodology (ELTM) on the Career Growth (CG) of the Saudi students. In order to provide a solid basis for this research study, a cross-sectional-descriptive research design was employed. For scale development and tool standardization, inter-class correlation…

Descriptors: Career Development, English (Second Language), Second Language Learning, Second Language Instruction

Examining Rater Reliability When Using an Analytical Rubric for Oral Presentation Assessments

Peer reviewed
PDF on ERIC

Download full text

Sasithorn Limgomolvilas; Patsawut Sukserm – LEARN Journal: Language Education and Acquisition Research Network, 2025

The assessment of English speaking in EFL environments can be inherently subjective and influenced by various factors beyond linguistic ability, including choice of assessment criteria, and even the rubric type. In classroom assessment, the type of rubric recommended for English speaking tasks is the analytical rubric. Driven by three aims, this…

Descriptors: Oral Language, Speech Communication, English (Second Language), Second Language Learning

Meta-Analysis of Inter-Rater Agreement and Discrepancy Between Human and Automated English Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jiyeo Yun – English Teaching, 2023

Studies on automatic scoring systems in writing assessments have also evaluated the relationship between human and machine scores for the reliability of automated essay scoring systems. This study investigated the magnitudes of indices for inter-rater agreement and discrepancy, especially regarding human and machine scoring, in writing assessment.…

Descriptors: Meta Analysis, Interrater Reliability, Essays, Scoring

Depth-Perception-Based Representation in Holistic Rating on ESL Essay Writing

Peer reviewed

Direct link

Lian Li; Jiehui Hu; Yu Dai; Ping Zhou; Wanhong Zhang – Reading & Writing Quarterly, 2024

This paper proposes to use depth perception to represent raters' decision in holistic evaluation of ESL essays, as an alternative medium to conventional form of numerical scores. The researchers verified the new method's accuracy and inter/intra-rater reliability by inviting 24 ESL teachers to perform different representations when rating 60…

Descriptors: Essays, Holistic Approach, Writing Evaluation, Accuracy

Revolutionising Essay Evaluation: A Cutting-Edge Rubric for AI-Assisted Writing

Peer reviewed

Direct link

Hassan Saleh Mahdi; Ahmed Alkhateeb – International Journal of Computer-Assisted Language Learning and Teaching, 2025

This study aims to develop a robust rubric for evaluating artificial intelligence (AI)--assisted essay writing in English as a Foreign Language (EFL) contexts. Employing a modified Delphi technique, we conducted a comprehensive literature review and administered Likert scale questionnaires. This process yielded nine key evaluation criteria,…

Descriptors: Scoring Rubrics, Essays, Writing Evaluation, Artificial Intelligence

Automated Assessment of Second Language Comprehensibility: Review, Training, Validation, and Generalization Studies

Peer reviewed

Direct link

Saito, Kazuya; Macmillan, Konstantinos; Kachlicka, Magdalena; Kunihara, Takuya; Minematsu, Nobuaki – Studies in Second Language Acquisition, 2023

Whereas many scholars have emphasized the relative importance of "comprehensibility" as an ecologically valid goal for L2 speech training, testing, and development, eliciting listeners' judgments is time-consuming. Following calls for research on more efficient L2 speech rating methods in applied linguistics, and growing attention toward…

Descriptors: Second Language Learning, Second Language Instruction, Interrater Reliability, Speech Communication

Impacts of ChatGPT-Assisted Writing for EFL English Majors: Feasibility and Challenges

Peer reviewed

Direct link

Chung-You Tsai; Yi-Ti Lin; Iain Kelsall Brown – Education and Information Technologies, 2024

To determine the impacts of using ChatGPT to assist English as a foreign language (EFL) English college majors in revising essays and the possibility of leading to higher scores and potentially causing unfairness. A prospective, double-blinded, paired-comparison study was conducted in Feb. 2023. A total of 44 students provided 44 original essays…

Descriptors: Artificial Intelligence, Computer Software, Technology Uses in Education, English (Second Language)

Applying Generalizability Theory in Language Testing: Comparing Nested and Crossed Scoring Designs in the Assessment of Speaking Skills

Peer reviewed
PDF on ERIC

Download full text

Polat, Murat; Turhan, Nihan Sölpük – International Journal of Curriculum and Instruction, 2021

Scoring language learners' speaking skills is open to a number of measurement errors since raters' personal judgements could involve in the process. Different grading designs in which raters score a student's whole speaking skills or a specific dimension of the speaking performance could be settled to control and minimize the amount of the error…

Descriptors: Language Tests, Scoring, Speech Communication, State Universities

The Rater Performance Categorization System (RPCS) for Intensive English Programs

Peer reviewed
PDF on ERIC

Download full text

Sahin, Alper – Shanlax International Journal of Education, 2021

There are several student performances assessed in Intensive English Programs (IEPs) worldwide in each academic year. These student performances are mostly graded by human raters with a certain degree of error. However, the accuracy of these performance assessments is of utmost importance because they feed data into some high stakes decisions…

Descriptors: Intensive Language Courses, Second Language Instruction, Second Language Learning, English (Second Language)

Fairness in Oral Language Assessment: Training Raters and Considering Examinees' Expectations

Peer reviewed
PDF on ERIC

Download full text

Doosti, Mehdi; Ahmadi Safa, Mohammad – International Journal of Language Testing, 2021

This study examined the effect of rater training on promoting inter-rater reliability in oral language assessment. It also investigated whether rater training and the consideration of the examinees' expectations by the examiners have any effect on test-takers' perceptions of being fairly evaluated. To this end, four raters scored 31 Iranian…

Descriptors: Oral Language, Language Tests, Interrater Reliability, Training

Rater Judgments and Word Difficulty: Conceptualizing the Substantive Validity of the VST

Peer reviewed
PDF on ERIC

Download full text

Derek N. Canning; Stuart McLean; Joseph P. Vitta – Vocabulary Learning and Instruction, 2022

The substantive component of construct validity requires a confrontation between empirical test results and content relevance. The Vocabulary Size Test (VST) has been extensively validated in terms of empirical results. Less is known, however, about expert judgments of content relevance. The VST was constructed and validated according to the…

Descriptors: Foreign Countries, Undergraduate Students, College Faculty, Vocabulary Skills

The Significance of Instructional Design: Analysis of Content in Language MOOC Forums

Peer reviewed
PDF on ERIC

Download full text

Díez-Arcón, Paz – JALT CALL Journal, 2023

Language MOOC research has evolved over the last three years to a more mature stage in which researchers have gained a deeper comprehension of the theories that enable effective language learning in this format. The application of these theoretical advances should be reflected in the instructional design of the courses. This study is based on this…

Descriptors: MOOCs, Second Language Learning, Second Language Instruction, Learning Theories

Investigation of Interrater Reliability in the Evaluation of Foreign Language Writing Skills with Multigroup Confirmatory Factor Analysis

Peer reviewed
PDF on ERIC

Download full text

Önen, Emine; Yayvak, Melike Kübra Tasdelen – Journal of Education and Training Studies, 2019

In this study, it was aimed to examine the interrater reliability of the scoring of paragraph writing skills on foreign languages with the measurement invariance tests. The study group consists of 267 students studying English at the Preparatory School at Gazi University. In the study, where students write a paragraph on the same topic, the…

Descriptors: Second Language Learning, Second Language Instruction, Factor Analysis, English (Second Language)

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

English Language Teaching	5
ProQuest LLC	5
Language Testing	4
Foreign Language Annals	3
Iranian Journal of Language…	3
Language Assessment Quarterly	3
Online Submission	3
Advances in Language and…	2
Canadian Modern Language…	2
ELT Journal	2
Education and Information…	2
JALT CALL Journal	2
Journal of Communication…	2
Language Learning Journal	2
Language Testing in Asia	2
Reading Matrix: An…	2
Shanlax International Journal…	2
Studies in Second Language…	2
System	2
Annual Review of Applied…	1
Applied Language Learning	1
Assessing Writing	1
Cogent Education	1
Cross Currents	1
Edinburgh Working Papers in…	1
More ▼

Nakamura, Yuji	3
Saito, Kazuya	2
Afzali, Katayoon	1
Ahmadi Safa, Mohammad	1
Ahmadi, Alireza	1
Ahmed Alkhateeb	1
Ahour, Touran	1
Alhaisoni, Eid	1
Alharthi, Saleh	1
Alkhanani, Badriah	1
Arikan, Arda	1
Bagherkazemi, Marzieh	1
Barnwell, David	1
Beh-Afarin, Seyed Reza	1
Bijani, Houman	1
Birjandi, Parviz	1
Bogorevich, Valeriia	1
Brown, Annie	1
Chalhoub-Deville, Micheline	1
Chambers, Francine	1
Chan, Stephanie W. Y.	1
Chang, Lei	1
Cheung, Wai Ming	1
Chung-You Tsai	1
Coniam, David	1
More ▼