ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	5
Since 2017 (last 10 years)	9
Since 2007 (last 20 years)	16

Descriptor

Rating Scales	19
Language Tests	14
Second Language Learning	11
English (Second Language)	9
Test Validity	8
Speech Communication	6
Validity	6
Foreign Countries	5
Interrater Reliability	5
Language Proficiency	5
Language Teachers	4
Oral Language	4
Test Construction	4
College Students	3
Construct Validity	3
Correlation	3
Discourse Analysis	3
Evaluators	3
Factor Analysis	3
Grammar	3
Role Playing	3
Scores	3
Second Language Instruction	3
Spanish	3
Testing	3
More ▼

Source

Language Testing

Publication Type

Journal Articles	19
Reports - Research	15
Reports - Evaluative	3
Information Analyses	1
Reports - Descriptive	1

Education Level

Higher Education	2
Elementary Secondary Education	1
Postsecondary Education	1

Audience

Location

Arizona	1
Europe	1
Finland	1
Iran	1
Netherlands	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

Comparing Two Formats of Data-Driven Rating Scales for Classroom Assessment of Pragmatic Performance with Roleplays

Peer reviewed

Direct link

Yunwen Su; Sun-Young Shin – Language Testing, 2024

Rating scales that language testers design should be tailored to the specific test purpose and score use as well as reflect the target construct. Researchers have long argued for the value of data-driven scales for classroom performance assessment, because they are specific to pedagogical tasks and objectives, have rich descriptors to offer useful…

Descriptors: Rating Scales, Language Tests, Test Construction, Performance Based Assessment

Assessing Speaking through Multimodal Oral Presentations: The Case of Construct Underrepresentation in EAP Contexts

Peer reviewed

Direct link

Louise Palmour – Language Testing, 2024

This article explores the nature of the construct underlying classroom-based English for academic purpose (EAP) oral presentation assessments, which are used, in part, to determine admission to programmes of study at UK universities. Through analysis of qualitative data (from questionnaires, interviews, rating discussions, and fieldnotes), the…

Descriptors: English for Academic Purposes, Public Speaking, College Students, Foreign Countries

Setting Standards for a Diagnostic Test of Aviation English for Student Pilots

Peer reviewed

Direct link

Maria Treadaway; John Read – Language Testing, 2024

Standard-setting is an essential component of test development, supporting the meaningfulness and appropriate interpretation of test scores. However, in the high-stakes testing environment of aviation, standard-setting studies are underexplored. To address this gap, we document two stages in the standard-setting procedures for the Overseas Flight…

Descriptors: Standard Setting, Diagnostic Tests, High Stakes Tests, English for Special Purposes

Validation of Rating Processes within an Argument-Based Framework

Peer reviewed

Direct link

Knoch, Ute; Chapelle, Carol A. – Language Testing, 2018

Argument-based validation requires test developers and researchers to specify what is entailed in test interpretation and use. Doing so has been shown to yield advantages (Chapelle, Enright, & Jamieson, 2010), but it also requires an analysis of how the concerns of language testers can be conceptualized in the terms used to construct a…

Descriptors: Test Validity, Language Tests, Evaluation Research, Rating Scales

Towards More Valid Scoring Criteria for Integrated Reading-Writing and Listening-Writing Summary Tasks

Peer reviewed

Direct link

Chan, Sathena; May, Lyn – Language Testing, 2023

Despite the increased use of integrated tasks in high-stakes academic writing assessment, research on rating criteria which reflect the unique construct of integrated summary writing skills is comparatively rare. Using a mixed-method approach of expert judgement, text analysis, and statistical analysis, this study examines writing features that…

Descriptors: Scoring, Writing Evaluation, Reading Tests, Listening Skills

Developing a Level-Specific Checklist for Assessing EFL Writing

Peer reviewed

Direct link

Lukácsi, Zoltán – Language Testing, 2021

In second language writing assessment, rating scales and scores from human-mediated assessment have been criticized for a number of shortcomings including problems with adequacy, relevance, and reliability (Hamp-Lyons, 1990; McNamara, 1996; Weigle, 2002). In its testing practice, Euroexam International also detected that the rating scales for…

Descriptors: Test Construction, Test Validity, Test Items, Check Lists

A Comparison of Holistic, Analytic, and Part Marking Models in Speaking Assessment

Peer reviewed

Direct link

Khabbazbashi, Nahal; Galaczi, Evelina D. – Language Testing, 2020

This mixed methods study examined holistic, analytic, and part marking models (MMs) in terms of their measurement properties and impact on candidate CEFR classifications in a semi-direct online speaking test. Speaking performances of 240 candidates were first marked holistically and by part (phase 1). On the basis of phase 1 findings--which…

Descriptors: Holistic Approach, Classification, Grading, Language Tests

Dimensionality of Speech Fluency: Examining the Relationships among Complexity, Accuracy, and Fluency (CAF) Features of Speaking Performances on the Aptis Test

Peer reviewed

Direct link

Yan, Xun; Kim, Ha Ram; Kim, Ji Young – Language Testing, 2021

Speech fluency has been extensively researched as a core construct for second language (L2) speaking assessment. Despite the broad consensus on its multifaceted nature, few researchers have empirically explored the dimensionality of this construct. Operationalizations of fluency vary across research and practice, using both holistic and…

Descriptors: Language Fluency, Language Tests, Accuracy, Speech Communication

Critical Language Assessment Literacy of EFL Teachers: Scale Construction and Validation

Peer reviewed

Direct link

Tajeddin, Zia; Khatib, Mohammad; Mahdavi, Mohsen – Language Testing, 2022

Critical language assessment (CLA) has been addressed in numerous studies. However, the majority of the studies have overlooked the need for a practical framework to measure the CLA dimension of teachers' language assessment literacy (LAL). This gap prompted us to develop and validate a critical language assessment literacy (CLAL) scale to further…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Language Tests

Determining the Scoring Validity of a Co-Constructed CEFR-Based Rating Scale

Peer reviewed

Direct link

Deygers, Bart; Van Gorp, Koen – Language Testing, 2015

Considering scoring validity as encompassing both reliable rating scale use and valid descriptor interpretation, this study reports on the validation of a CEFR-based scale that was co-constructed and used by novice raters. The research questions this paper wishes to answer are (a) whether it is possible to construct a CEFR-based rating scale with…

Descriptors: Rating Scales, Scoring, Validity, Interrater Reliability

Standard Setting in Specific-Purpose Language Testing: What Can a Qualitative Study Add?

Peer reviewed

Direct link

Manias, Elizabeth; McNamara, Tim – Language Testing, 2016

This paper explores the views of nursing and medical domain experts in considering the standards for a specific-purpose English language screening test, the Occupational English Test (OET), for professional registration for immigrant health professionals. Since individuals who score performances in the test setting are often language experts…

Descriptors: Standard Setting, Academic Standards, English for Special Purposes, Language Tests

Assessing Learners' Writing Skills in a SLA Study: Validating the Rating Process across Tasks, Scales and Languages

Peer reviewed

Direct link

Huhta, Ari; Alanen, Riikka; Tarnanen, Mirja; Martin, Maisa; Hirvelä, Tuija – Language Testing, 2014

There is still relatively little research on how well the CEFR and similar holistic scales work when they are used to rate L2 texts. Using both multifaceted Rasch analyses and qualitative data from rater comments and interviews, the ratings obtained by using a CEFR-based writing scale and the Finnish National Core Curriculum scale for L2 writing…

Descriptors: Foreign Countries, Writing Skills, Second Language Learning, Finno Ugric Languages

Investigating the Validity of an Integrated Listening-Speaking Task: A Discourse-Based Analysis of Test Takers' Oral Performances

Peer reviewed

Direct link

Frost, Kellie; Elder, Catherine; Wigglesworth, Gillian – Language Testing, 2012

Performance on integrated tasks requires candidates to engage skills and strategies beyond language proficiency alone, in ways that can be difficult to define and measure for testing purposes. While it has been widely recognized that stimulus materials impact test performance, our understanding of the way in which test takers make use of these…

Descriptors: Speech Communication, Language Tests, Testing, Rating Scales

Effective Rating Scale Development for Speaking Tests: Performance Decision Trees

Peer reviewed

Direct link

Fulcher, Glenn; Davidson, Fred; Kemp, Jenny – Language Testing, 2011

Rating scale design and development for testing speaking is generally conducted using one of two approaches: the measurement-driven approach or the performance data-driven approach. The measurement-driven approach prioritizes the ordering of descriptors onto a single scale. Meaning is derived from the scaling methodology and the agreement of…

Descriptors: Speech Communication, Rating Scales, Inferences, English (Second Language)

Assessing Paired Orals: Raters' Orientation to Interaction

Peer reviewed

Direct link

Ducasse, Ana Maria; Brown, Annie – Language Testing, 2009

Speaking tasks involving peer-to-peer candidate interaction are increasingly being incorporated into language proficiency assessments, in both large-scale international testing contexts, and in smaller-scale, for example course-related, ones. This growth in the popularity and use of paired and group orals has stimulated research, particularly into…

Descriptors: Oral Language, Interpersonal Communication, Second Language Learning, Language Tests

Previous Page | Next Page »

Pages: 1 | 2

Fulcher, Glenn	2
Alanen, Riikka	1
Brown, Annie	1
Chan, Sathena	1
Chapelle, Carol A.	1
Davidson, Fred	1
Deygers, Bart	1
Ducasse, Ana Maria	1
Elder, Catherine	1
Frost, Kellie	1
Galaczi, Evelina D.	1
Grant, Leslie	1
Henning, Grant	1
Hirvelä, Tuija	1
Huhta, Ari	1
John Read	1
Kemp, Jenny	1
Khabbazbashi, Nahal	1
Khatib, Mohammad	1
Kim, Ha Ram	1
Kim, Ji Young	1
Knoch, Ute	1
Louise Palmour	1
Lukácsi, Zoltán	1
Mahdavi, Mohsen	1
More ▼