ERIC - Search Results

Publication Date

In 2025	1
Since 2024	5
Since 2021 (last 5 years)	13
Since 2016 (last 10 years)	30
Since 2006 (last 20 years)	54

Descriptor

Evaluators	58
Second Language Learning	58
Language Tests	45
English (Second Language)	38
Language Proficiency	25
Oral Language	22
Foreign Countries	18
Scores	17
Second Language Instruction	17
Scoring	15
Writing Evaluation	14
Correlation	13
Rating Scales	13
Speech Communication	11
Evaluation Criteria	10
Language Teachers	10
Testing	10
Comparative Analysis	9
Interrater Reliability	9
Undergraduate Students	9
Writing Tests	9
Essays	8
Grammar	8
Computer Assisted Testing	7
Language Fluency	7
More ▼

Source

Language Testing

Publication Type

Journal Articles	58
Reports - Research	50
Tests/Questionnaires	6
Reports - Evaluative	4
Reports - Descriptive	2
Information Analyses	1
Opinion Papers	1
Speeches/Meeting Papers	1

Education Level

Higher Education	20
Postsecondary Education	13
Secondary Education	4
Adult Education	1
Elementary Secondary Education	1

Audience

Location

China	6
Australia	3
Europe	2
India	2
Turkey	2
California (San Francisco)	1
Canada	1
Colombia	1
Hawaii	1
Illinois (Urbana)	1
Japan	1
Michigan	1
Netherlands	1
New York (New York)	1
Ohio	1
South Korea	1
Switzerland	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	7
International English…	3
ACTFL Oral Proficiency…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 58 results Save | Export

Do Source Use Features Impact Raters' Judgment of Argumentation? An Experimental Study

Peer reviewed

Direct link

Ping-Lin Chuang – Language Testing, 2025

This experimental study explores how source use features impact raters' judgment of argumentation in a second language (L2) integrated writing test. One hundred four experienced and novice raters were recruited to complete a rating task that simulated the scoring assignment of a local English Placement Test (EPT). Sixty written responses were…

Descriptors: Interrater Reliability, Evaluators, Information Sources, Primary Sources

Revisiting Raters' Accent Familiarity in Speaking Tests: Evidence That Presentation Mode Interacts with Accent Familiarity to Variably Affect Comprehensibility Ratings

Peer reviewed

Direct link

Michael D. Carey; Stefan Szocs – Language Testing, 2024

This controlled experimental study investigated the interaction of variables associated with rating the pronunciation component of high-stakes English-language-speaking tests such as IELTS and TOEFL iBT. One hundred experienced raters who were all either familiar or unfamiliar with Brazilian-accented English or Papua New Guinean Tok Pisin-accented…

Descriptors: Dialects, Pronunciation, Suprasegmentals, Familiarity

Assessing the Content Quality of Essays in Content and Language Integrated Learning: Exploring the Construct from Subject Specialists' Perspectives

Peer reviewed

Direct link

Takanori Sato – Language Testing, 2024

Assessing the content of learners' compositions is a common practice in second language (L2) writing assessment. However, the construct definition of content in L2 writing assessment potentially underrepresents the target competence in content and language integrated learning (CLIL), which aims to foster not only L2 proficiency but also critical…

Descriptors: Language Tests, Content and Language Integrated Learning, Writing Evaluation, Writing Tests

Making Each Point Count: Revising a Local Adaptation of the Jacobs et al.'s (1981) ESL COMPOSITION PROFILE Rubric

Peer reviewed

Direct link

Yu-Tzu Chang; Ann Tai Choe; Daniel Holden; Daniel R. Isbell – Language Testing, 2024

In this Brief Report, we describe an evaluation of and revisions to a rubric adapted from the Jacobs et al.'s (1981) ESL COMPOSITION PROFILE, with four rubric categories and 20-point rating scales, in the context of an intensive English program writing placement test. Analysis of 4 years of rating data (2016-2021, including 434 essays) using…

Descriptors: Language Tests, Rating Scales, Second Language Learning, English (Second Language)

"How Do Raters Learn to Rate?" Many-Facet Rasch Modeling of Rater Performance over the Course of a Rater Certification Program

Peer reviewed

Direct link

Yan, Xun; Chuang, Ping-Lin – Language Testing, 2023

This study employed a mixed-methods approach to examine how rater performance develops during a semester-long rater certification program for an English as a Second Language (ESL) writing placement test at a large US university. From 2016 to 2018, we tracked three groups of novice raters (n = 30) across four rounds in the certification program.…

Descriptors: Evaluators, Interrater Reliability, Item Response Theory, Certification

Challenges in Rating Signed Production: A Mixed-Methods Study of a Swiss German Sign Language Form-Recall Vocabulary Test

Peer reviewed

Direct link

Batty, Aaron Olaf; Haug, Tobias; Ebling, Sarah; Tissi, Katja; Sidler-Miserez, Sandra – Language Testing, 2023

Sign languages present particular challenges to language assessors in relation to variation in signs, weakly defined citation forms, and a general lack of standard-setting work even in long-established measures of productive sign proficiency. The present article addresses and explores these issues via a mixed-methods study of a human-rated…

Descriptors: Sign Language, Language Tests, Standard Setting, Barriers

The Longitudinal Stability of Rating Characteristics in an EFL Examination: Methodological and Substantive Considerations

Peer reviewed

Direct link

Lamprianou, Iasonas; Tsagari, Dina; Kyriakou, Nansia – Language Testing, 2021

This longitudinal study (2002-2014) investigates the stability of rating characteristics of a large group of raters over time in the context of the writing paper of a national high-stakes examination. The study uses one measure of rater severity and two measures of rater consistency. The results suggest that the rating characteristics of…

Descriptors: Longitudinal Studies, Evaluators, High Stakes Tests, Writing Evaluation

Temporal Fluency and Floor/Ceiling Scoring of Intermediate and Advanced Speech on the ACTFL Spanish Oral Proficiency Interview--Computer

Peer reviewed

Direct link

Cox, Troy L.; Brown, Alan V.; Thompson, Gregory L. – Language Testing, 2023

The rating of proficiency tests that use the Inter-agency Roundtable (ILR) and American Council on the Teaching of Foreign Languages (ACTFL) guidelines claims that each major level is based on hierarchal linguistic functions that require mastery of multidimensional traits in such a way that each level subsumes the levels beneath it. These…

Descriptors: Oral Language, Language Fluency, Scoring, Cues

Evaluating the Impact of Nonverbal Behavior on Language Ability Ratings

Peer reviewed

Direct link

J. Dylan Burton – Language Testing, 2024

Nonverbal behavior can impact language proficiency scores in speaking tests, but there is little empirical information of the size or consistency of its effects or whether language proficiency may be a moderating variable. In this study, 100 novice raters watched and scored 30 recordings of test takers taking an international, high stakes…

Descriptors: Nonverbal Ability, Language Fluency, Second Language Learning, Language Proficiency

Comparing Holistic and Analytic Marking Methods in Assessing Speech Act Production in L2 Chinese

Peer reviewed

Direct link

Li, Shuai; Wen, Ting; Li, Xian; Feng, Yali; Lin, Chuan – Language Testing, 2023

This study compared holistic and analytic marking methods for their effects on parameter estimation (of examinees, raters, and items) and rater cognition in assessing speech act production in L2 Chinese. Seventy American learners of Chinese completed an oral Discourse Completion Test assessing requests and refusals. Four first-language (L1)…

Descriptors: Speech Acts, Second Language Learning, Second Language Instruction, Chinese

A Comparison of Holistic, Analytic, and Part Marking Models in Speaking Assessment

Peer reviewed

Direct link

Khabbazbashi, Nahal; Galaczi, Evelina D. – Language Testing, 2020

This mixed methods study examined holistic, analytic, and part marking models (MMs) in terms of their measurement properties and impact on candidate CEFR classifications in a semi-direct online speaking test. Speaking performances of 240 candidates were first marked holistically and by part (phase 1). On the basis of phase 1 findings--which…

Descriptors: Holistic Approach, Classification, Grading, Language Tests

A Comparative Judgment Approach to Assessing Chinese Sign Language Interpreting

Peer reviewed

Direct link

Han, Chao; Xiao, Xiaoyan – Language Testing, 2022

The quality of sign language interpreting (SLI) is a gripping construct among practitioners, educators and researchers, calling for reliable and valid assessment. There has been a diverse array of methods in the extant literature to measure SLI quality, ranging from traditional error analysis to recent rubric scoring. In this study, we want to…

Descriptors: Comparative Analysis, Sign Language, Deaf Interpreting, Evaluators

Speaking in Turns and Sequences: Interactional Competence as a Target Construct in Testing Speaking

Peer reviewed

Direct link

Roever, Carsten; Kasper, Gabriele – Language Testing, 2018

In the assessment of speaking, a psycholinguistically based speaking construct has predominated. In this paper, we argue for the integration of the construct of interactional competence (IC) in speaking assessments to broaden the range of defensible inferences from speaking tests. IC emphasizes the co-constructed nature of interaction and enables…

Descriptors: Language Tests, Testing, Second Language Learning, Language Proficiency

What the Analytic versus Holistic Scoring of International Teaching Assistants Can Reveal: Lexical Grammar Matters

Peer reviewed

Direct link

Ma, Wenyue – Language Testing, 2022

Second-language (L2) testing researchers have explored the relationship between speakers' overall speaking ability, reflected by holistic scores, and the speakers' performance on speaking subcomponents, reflected by analytic scores (e.g., McNamara, 1990; Sato, 2011). These research studies have advanced applied linguists' understanding of how…

Descriptors: Language Tests, Teaching Assistants, Second Language Learning, Second Language Instruction

Developing Tools for Learning Oriented Assessment of Interactional Competence: Bridging Theory and Practice

Peer reviewed

Direct link

May, Lyn; Nakatsuhara, Fumiyo; Lam, Daniel; Galaczi, Evelina – Language Testing, 2020

In this paper we report on a project in which we developed tools to support the classroom assessment of learners' interactional competence (IC) and provided learning oriented feedback in the context of preparation for a high-stakes face-to-face speaking test. Six trained examiners provided stimulated verbal reports (n = 72) on 12 paired…

Descriptors: Intercultural Communication, High Stakes Tests, Feedback (Response), Evaluators

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Pill, John	4
Barkaoui, Khaled	2
Elder, Catherine	2
Han, Chao	2
Kuiken, Folkert	2
Lim, Gad S.	2
May, Lyn	2
Mollaun, Pamela	2
Vedder, Ineke	2
Xi, Xiaoming	2
Yan, Xun	2
Zhang, Ying	2
Ann Tai Choe	1
Bachman, Lyle F.	1
Barkhuizen, Gary	1
Batty, Aaron Olaf	1
Bridgeman, Brent	1
Briggs, Sarah L.	1
Brown, Alan V.	1
Brown, Anne	1
Brown, Annie	1
Carey, Michael D.	1
Chalhoub-Deville, Micheline	1
Chuang, Ping-Lin	1
Cox, Troy L.	1
More ▼