ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	5
Since 2017 (last 10 years)	16
Since 2007 (last 20 years)	35

Descriptor

Comparative Analysis	42
Interrater Reliability	42
Second Language Learning	42
Second Language Instruction	28
English (Second Language)	27
Foreign Countries	22
Language Tests	19
Evaluators	14
Correlation	11
Statistical Analysis	11
Writing Evaluation	11
Computer Assisted Testing	10
Essays	10
Language Proficiency	10
Scores	10
College Students	9
Language Teachers	9
Scoring	9
Computer Software	8
Oral Language	8
Undergraduate Students	8
Questionnaires	7
Teaching Methods	7
Evaluation Methods	6
Native Speakers	6
More ▼

Publication Type

Journal Articles	36
Reports - Research	33
Tests/Questionnaires	8
Reports - Evaluative	4
Information Analyses	3
Speeches/Meeting Papers	3
Collected Works - Proceedings	1
Guides - Non-Classroom	1
Reports - Descriptive	1

Education Level

Higher Education	20
Postsecondary Education	17
Secondary Education	4
Adult Education	2
Elementary Secondary Education	2
Elementary Education	1
Grade 11	1
High Schools	1
Preschool Education	1

Audience

Practitioners

Location

Iran	5
China	3
Turkey	3
Hong Kong	2
Japan	2
Netherlands	2
Philippines	2
Saudi Arabia	2
Arizona	1
Asia	1
Australia	1
Brazil	1
Connecticut	1
Denmark	1
Egypt	1
Estonia	1
Europe	1
Florida	1
Germany	1
Greece	1
Hawaii	1
Ireland	1
Israel	1
Italy	1
Kazakhstan	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	3
Expressive One Word Picture…	1
Mean Length of Utterance	1
Peabody Picture Vocabulary…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 42 results Save | Export

Meta-Analysis of Inter-Rater Agreement and Discrepancy Between Human and Automated English Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jiyeo Yun – English Teaching, 2023

Studies on automatic scoring systems in writing assessments have also evaluated the relationship between human and machine scores for the reliability of automated essay scoring systems. This study investigated the magnitudes of indices for inter-rater agreement and discrepancy, especially regarding human and machine scoring, in writing assessment.…

Descriptors: Meta Analysis, Interrater Reliability, Essays, Scoring

Depth-Perception-Based Representation in Holistic Rating on ESL Essay Writing

Peer reviewed

Direct link

Lian Li; Jiehui Hu; Yu Dai; Ping Zhou; Wanhong Zhang – Reading & Writing Quarterly, 2024

This paper proposes to use depth perception to represent raters' decision in holistic evaluation of ESL essays, as an alternative medium to conventional form of numerical scores. The researchers verified the new method's accuracy and inter/intra-rater reliability by inviting 24 ESL teachers to perform different representations when rating 60…

Descriptors: Essays, Holistic Approach, Writing Evaluation, Accuracy

Automated Assessment of Second Language Comprehensibility: Review, Training, Validation, and Generalization Studies

Peer reviewed

Direct link

Saito, Kazuya; Macmillan, Konstantinos; Kachlicka, Magdalena; Kunihara, Takuya; Minematsu, Nobuaki – Studies in Second Language Acquisition, 2023

Whereas many scholars have emphasized the relative importance of "comprehensibility" as an ecologically valid goal for L2 speech training, testing, and development, eliciting listeners' judgments is time-consuming. Following calls for research on more efficient L2 speech rating methods in applied linguistics, and growing attention toward…

Descriptors: Second Language Learning, Second Language Instruction, Interrater Reliability, Speech Communication

Impacts of ChatGPT-Assisted Writing for EFL English Majors: Feasibility and Challenges

Peer reviewed

Direct link

Chung-You Tsai; Yi-Ti Lin; Iain Kelsall Brown – Education and Information Technologies, 2024

To determine the impacts of using ChatGPT to assist English as a foreign language (EFL) English college majors in revising essays and the possibility of leading to higher scores and potentially causing unfairness. A prospective, double-blinded, paired-comparison study was conducted in Feb. 2023. A total of 44 students provided 44 original essays…

Descriptors: Artificial Intelligence, Computer Software, Technology Uses in Education, English (Second Language)

Applying Generalizability Theory in Language Testing: Comparing Nested and Crossed Scoring Designs in the Assessment of Speaking Skills

Peer reviewed
PDF on ERIC

Download full text

Polat, Murat; Turhan, Nihan Sölpük – International Journal of Curriculum and Instruction, 2021

Scoring language learners' speaking skills is open to a number of measurement errors since raters' personal judgements could involve in the process. Different grading designs in which raters score a student's whole speaking skills or a specific dimension of the speaking performance could be settled to control and minimize the amount of the error…

Descriptors: Language Tests, Scoring, Speech Communication, State Universities

Comparative Judgement: Assess Student Production without Absolute Judgements

Peer reviewed
PDF on ERIC

Download full text

Sumner, Josh – Research-publishing.net, 2021

Comparative Judgement (CJ) has emerged as a technique that typically makes use of holistic judgement to assess difficult-to-specify constructs such as production (speaking and writing) in Modern Foreign Languages (MFL). In traditional approaches, markers assess candidates' work one-by-one in an absolute manner, assigning scores to different…

Descriptors: Holistic Approach, Student Evaluation, Comparative Analysis, Decision Making

Writing Scale Effects on Raters: An Exploratory Study

Peer reviewed

Direct link

Jeong, Heejeong – Language Testing in Asia, 2019

In writing assessment, finding a valid, reliable, and efficient scale is critical. Appropriate scales, increase rater reliability, and can also save time and money. This exploratory study compared the effects of a binary scale and an analytic scale across teacher raters and expert raters. The purpose of the study is to find out how different scale…

Descriptors: Writing Evaluation, English (Second Language), Second Language Learning, Second Language Instruction

Rater Effects on L2 Oral Assessment: Focusing on Accent Familiarity of L2 Teachers

Peer reviewed

Direct link

Park, Mi Sun – Language Assessment Quarterly, 2020

In the present study, I examined the effects of rater characteristics, in particular, raters' familiarity with a foreign accent, on the assessment of second language (L2) pronunciation. Forty-three native English-speaking teachers were divided into three groups according to their reported types of familiarity with Korean accents: heritage,…

Descriptors: Evaluators, Familiarity, Second Language Learning, English (Second Language)

Comparison of Automatic and Expert Teachers' Rating of Computerized English Listening-Speaking Test

Peer reviewed
PDF on ERIC

Download full text

Linlin, Cao – English Language Teaching, 2020

Through Many-Facet Rasch analysis, this study explores the rating differences between 1 computer automatic rater and 5 expert teacher raters on scoring 119 students in a computerized English listening-speaking test. Results indicate that both automatic and the teacher raters demonstrate good inter-rater reliability, though the automatic rater…

Descriptors: Language Tests, Computer Assisted Testing, English (Second Language), Second Language Learning

The Effects of Proficiency and Study-Abroad on Chinese EFL Learners' Refusals

Peer reviewed

Direct link

Wang, Yuqi; Ren, Wei – Language Learning Journal, 2022

L2 pragmatics have explored the effects of different factors on different aspects of learners' pragmatic performance, but often not simultaneously. In addition, syntactic complexity is rarely examined in L2 pragmatics. This cross-sectional study aimed to conduct a multidimensional analysis to explore the effects of proficiency and study-abroad…

Descriptors: Pragmatics, Second Language Learning, Second Language Instruction, English (Second Language)

Rater Cognition in L2 Speaking Assessment: A Review of the Literature

Peer reviewed
PDF on ERIC

Download full text

Han, Qie – Working Papers in TESOL & Applied Linguistics, 2016

This literature review attempts to survey representative studies within the context of L2 speaking assessment that have contributed to the conceptualization of rater cognition. Two types of studies are looked at: 1) studies that examine "how" raters differ (and sometimes agree) in their cognitive processes and rating behaviors, in terms…

Descriptors: Second Language Learning, Student Evaluation, Evaluators, Speech Tests

Assessing Kahoot's Impact on EFL Students' Learning Outcomes

Peer reviewed
PDF on ERIC

Download full text

Alharthi, Saleh – TESOL International Journal, 2020

Technology use in the classroom to improve student's learning has gained significant attention over the past few years. Technology has metamorphosed from CALL to MALL and the use of gamification. Teachers are more concerned with methodologies that can improve students' motivation and engagement, particularly in EFL classrooms. This mixed method…

Descriptors: Teaching Methods, Second Language Learning, Second Language Instruction, English (Second Language)

Evaluating CEFR Rater Performance through the Analysis of Spoken Learner Corpora

Peer reviewed

Direct link

Huang, Lan-fen; Kubelec, Simon; Keng, Nicole; Hsu, Lung-hsun – Language Testing in Asia, 2018

Background: Although teachers of English are required to assess students' speaking proficiency in the Common European Framework of Reference for Languages (CEFR), their ability to rate is seldom evaluated. The application of descriptors in the assessment of English speaking on CEFR in the context of English as a foreign language has not often been…

Descriptors: Evaluators, Second Language Learning, Second Language Instruction, English (Second Language)

The Effect of Training and Rater Differences on Oral Proficiency Assessment

Peer reviewed

Direct link

Kang, Okim; Rubin, Don; Kermad, Alyssa – Language Testing, 2019

As a result of the fact that judgments of non-native speech are closely tied to social biases, oral proficiency ratings are susceptible to error because of rater background and social attitudes. In the present study we seek first to estimate the variance attributable to rater background and attitudinal variables on novice raters' assessments of L2…

Descriptors: Evaluators, Second Language Learning, Language Tests, English (Second Language)

A Comparative Analysis of Face to Face Instruction vs. Telegram Mobile Instruction in Terms of Narrative Writing

Peer reviewed
PDF on ERIC

Download full text

Heidari, Jamshid; Khodabandeh, Farzaneh; Soleimani, Hassan – JALT CALL Journal, 2018

The emergence of computer technology in English language teaching has paved the way for teachers' application of Mobile Assisted Language Learning (mall) and its advantages in teaching. This study aimed to compare the effectiveness of the face to face instruction with Telegram mobile instruction. Based on a toefl test, 60 English foreign language…

Descriptors: Comparative Analysis, Conventional Instruction, Teaching Methods, Computer Assisted Instruction

Previous Page | Next Page »

Pages: 1 | 2 | 3

Language Testing	4
English Language Teaching	3
Language Assessment Quarterly	3
Language Testing in Asia	2
Online Submission	2
Working Papers in TESOL &…	2
Assessing Writing	1
ETS Research Report Series	1
Education and Information…	1
Educational Research and…	1
English Teaching	1
Foreign Language Annals	1
International Association for…	1
International Journal of…	1
Iranian Journal of Language…	1
JALT CALL Journal	1
Journal of Speech, Language,…	1
Language Learning Journal	1
Language, Speech, and Hearing…	1
ReCALL	1
Reading & Writing Quarterly	1
Reading Matrix: An…	1
Research-publishing.net	1
Second Language Research	1
Studies in Second Language…	1
More ▼

Coniam, David	2
Adams, R. J.	1
Ahmadi, Alireza	1
Ahour, Touran	1
Alhaisoni, Eid	1
Alharthi, Saleh	1
Alt, Mary	1
Beltrán, Jorge	1
Bilginer, Hayriye	1
Breyer, F. Jay	1
Chalhoub-Deville, Micheline	1
Chung-You Tsai	1
Dubasik, Virginia L.	1
Elder, Catherine	1
Entezari Maleki, Saeideh	1
Ferroli, Lou	1
Figueroa, Cecilia	1
Gilbers, Steven	1
Granfeldt, Jonas	1
Gustilo, Leah E.	1
Halleck, Gene B.	1
Han, Qie	1
Heidari, Jamshid	1
Hsu, Lung-hsun	1
Huang, Lan-fen	1
More ▼