ERIC - Search Results

Publication Date

In 2026	0
Since 2025	3
Since 2022 (last 5 years)	6
Since 2017 (last 10 years)	18
Since 2007 (last 20 years)	32

Source

Language Testing

Publication Type

Journal Articles	39
Reports - Research	32
Reports - Evaluative	6
Information Analyses	2
Tests/Questionnaires	2
Reports - Descriptive	1

Education Level

Higher Education	9
Postsecondary Education	6
Secondary Education	3
Elementary Education	2
Adult Education	1
Early Childhood Education	1
Elementary Secondary Education	1
Grade 6	1
Intermediate Grades	1
Kindergarten	1
Primary Education	1
More ▼

Audience

Location

Netherlands	5
Finland	3
Japan	2
Arizona	1
China	1
Europe	1
Georgia	1
Hong Kong	1
Illinois (Urbana)	1
India	1
Japan (Tokyo)	1
Ohio	1
South Korea	1
Sweden	1
Taiwan	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	3
Graduate Record Examinations	1
Peabody Picture Vocabulary…	1

What Works Clearinghouse Rating

Language Testing X

Showing 16 to 30 of 39 results Save | Export

The Effect of Training and Rater Differences on Oral Proficiency Assessment

Peer reviewed

Direct link

Kang, Okim; Rubin, Don; Kermad, Alyssa – Language Testing, 2019

As a result of the fact that judgments of non-native speech are closely tied to social biases, oral proficiency ratings are susceptible to error because of rater background and social attitudes. In the present study we seek first to estimate the variance attributable to rater background and attitudinal variables on novice raters' assessments of L2…

Descriptors: Evaluators, Second Language Learning, Language Tests, English (Second Language)

The Influence of Training and Experience on Rater Performance in Scoring Spoken Language

Peer reviewed

Direct link

Davis, Larry – Language Testing, 2016

Two factors were investigated that are thought to contribute to consistency in rater scoring judgments: rater training and experience in scoring. Also considered were the relative effects of scoring rubrics and exemplars on rater performance. Experienced teachers of English (N = 20) scored recorded responses from the TOEFL iBT speaking test prior…

Descriptors: Evaluators, Oral Language, Scores, Language Tests

Functional Adequacy in L2 Writing: Towards a New Rating Scale

Peer reviewed

Direct link

Kuiken, Folkert; Vedder, Ineke – Language Testing, 2017

The importance of functional adequacy as an essential component of L2 proficiency has been observed by several authors (Pallotti, 2009; De Jong, Steinel, Florijn, Schoonen, & Hulstijn, 2012a, b). The rationale underlying the present study is that the assessment of writing proficiency in L2 is not fully possible without taking into account the…

Descriptors: Second Language Learning, Rating Scales, Computational Linguistics, Persuasive Discourse

A Comparison of Newly-Trained and Experienced Raters on a Standardized Writing Assessment

Peer reviewed

Direct link

Attali, Yigal – Language Testing, 2016

A short training program for evaluating responses to an essay writing task consisted of scoring 20 training essays with immediate feedback about the correct score. The same scoring session also served as a certification test for trainees. Participants with little or no previous rating experience completed this session and 14 trainees who passed an…

Descriptors: Writing Evaluation, Writing Tests, Standardized Tests, Evaluators

An Examination of Rater Performance on a Local Oral English Proficiency Test: A Mixed-Methods Approach

Peer reviewed

Direct link

Yan, Xun – Language Testing, 2014

This paper reports on a mixed-methods approach to evaluate rater performance on a local oral English proficiency test. Three types of reliability estimates were reported to examine rater performance from different perspectives. Quantitative results were also triangulated with qualitative rater comments to arrive at a more representative picture of…

Descriptors: Mixed Methods Research, Language Tests, Oral Language, Language Proficiency

Scoring with the Computer: Alternative Procedures for Improving the Reliability of Holistic Essay Scoring

Peer reviewed

Direct link

Attali, Yigal; Lewis, Will; Steier, Michael – Language Testing, 2013

Automated essay scoring can produce reliable scores that are highly correlated with human scores, but is limited in its evaluation of content and other higher-order aspects of writing. The increased use of automated essay scoring in high-stakes testing underscores the need for human scoring that is focused on higher-order aspects of writing. This…

Descriptors: Scoring, Essay Tests, Reliability, High Stakes Tests

Determining the Scoring Validity of a Co-Constructed CEFR-Based Rating Scale

Peer reviewed

Direct link

Deygers, Bart; Van Gorp, Koen – Language Testing, 2015

Considering scoring validity as encompassing both reliable rating scale use and valid descriptor interpretation, this study reports on the validation of a CEFR-based scale that was co-constructed and used by novice raters. The research questions this paper wishes to answer are (a) whether it is possible to construct a CEFR-based rating scale with…

Descriptors: Rating Scales, Scoring, Validity, Interrater Reliability

Grounding Lexical Diversity in Human Judgments

Peer reviewed

Direct link

Jarvis, Scott – Language Testing, 2017

The present study discusses the relevance of measures of lexical diversity (LD) to the assessment of learner corpora. It also argues that existing measures of LD, many of which have become specialized for use with language corpora, are fundamentally measures of lexical repetition, are based on an etic perspective of language, and lack construct…

Descriptors: Computational Linguistics, English (Second Language), Second Language Learning, Native Speakers

An Application of Multifaceted Rasch Measurement in the Yes/No Angoff Standard Setting Procedure

Peer reviewed

Direct link

Hsieh, Mingchuan – Language Testing, 2013

When implementing standard setting procedures, there are two major concerns: variance between panelists and efficiency in conducting multiple rounds of judgments. With regard to the former, there is concern over the consistency of the cutoff scores made by different panelists. If the cut scores show an inordinately wide range then further rounds…

Descriptors: Item Response Theory, Standard Setting (Scoring), Language Tests, English (Second Language)

SLA Developmental Stages and Teachers' Assessment of Written French: Exploring Direkt Profil as a Diagnostic Assessment Tool

Peer reviewed

Direct link

Granfeldt, Jonas; Ågren, Malin – Language Testing, 2014

One core area of research in Second Language Acquisition is the identification and definition of developmental stages in different L2s. For L2 French, Bartning and Schlyter (2004) presented a model of six morphosyntactic stages of development in the shape of grammatical profiles. The model formed the basis for the computer program Direkt Profil…

Descriptors: Second Language Learning, Language Tests, French, Language Teachers

The Essentials of Assessment Literacy: Contrasts between Testers and Users

Peer reviewed

Direct link

Malone, Margaret E – Language Testing, 2013

Language assessment literacy refers to language instructors' familiarity with testing definitions and the application of this knowledge to classroom practices in general and specifically to issues related to assessing language. While it is widely agreed that classroom teachers need to assess student progress, many teachers and other test…

Descriptors: Literacy, Language Tests, Interviews, Feedback (Response)

Assessing Learners' Writing Skills in a SLA Study: Validating the Rating Process across Tasks, Scales and Languages

Peer reviewed

Direct link

Huhta, Ari; Alanen, Riikka; Tarnanen, Mirja; Martin, Maisa; Hirvelä, Tuija – Language Testing, 2014

There is still relatively little research on how well the CEFR and similar holistic scales work when they are used to rate L2 texts. Using both multifaceted Rasch analyses and qualitative data from rater comments and interviews, the ratings obtained by using a CEFR-based writing scale and the Finnish National Core Curriculum scale for L2 writing…

Descriptors: Foreign Countries, Writing Skills, Second Language Learning, Finno Ugric Languages

Hebrew Language Assessment Measure for Preschool Children: A Comparison between Typically Developing Children and Children with Specific Language Impairment

Peer reviewed

Direct link

Katzenberger, Irit; Meilijson, Sara – Language Testing, 2014

The Katzenberger Hebrew Language Assessment for Preschool Children (henceforth: the KHLA) is the first comprehensive, standardized language assessment tool developed in Hebrew specifically for older preschoolers (4;0-5;11 years). The KHLA is a norm-referenced, Hebrew specific assessment, based on well-established psycholinguistic principles, as…

Descriptors: Semitic Languages, Preschool Children, Language Impairments, Language Tests

Native Speakers' Perceptions of Fluency and Accent in L2 Speech

Peer reviewed

Direct link

Pinget, Anne-France; Bosker, Hans Rutger; Quené, Hugo; de Jong, Nivja H. – Language Testing, 2014

Oral fluency and foreign accent distinguish L2 from L1 speech production. In language testing practices, both fluency and accent are usually assessed by raters. This study investigates what exactly native raters of fluency and accent take into account when judging L2. Our aim is to explore the relationship between objectively measured temporal,…

Descriptors: Native Speakers, Language Fluency, Suprasegmentals, Second Language Learning

Does a Rater's Familiarity with a Candidate's Pronunciation Affect the Rating in Oral Proficiency Interviews?

Peer reviewed

Direct link

Carey, Michael D.; Mannell, Robert H.; Dunn, Peter K. – Language Testing, 2011

This study investigated factors that could affect inter-examiner reliability in the pronunciation assessment component of speaking tests. We hypothesized that the rating of pronunciation is susceptible to variation in assessment due to the amount of exposure examiners have to nonnative English accents. An inter-rater variability analysis was…

Descriptors: Oral Language, Pronunciation, Phonology, Interlanguage

« Previous Page | Next Page »

Pages: 1 | 2 | 3

Attali, Yigal	2
Iasonas Lamprianou	2
Knoch, Ute	2
Reeta Neittaanmäki	2
Schoonen, Rob	2
Wind, Stefanie A.	2
Yan, Xun	2
de Jong, Nivja H.	2
Alanen, Riikka	1
Barkhuizen, Gary	1
Bosker, Hans Rutger	1
Brown, Annie	1
Carey, Michael D.	1
Chan, Stephanie W. Y.	1
Chapelle, Carol A.	1
Cheung, Wai Ming	1
Chuang, Ping-Lin	1
Davis, Larry	1
Deygers, Bart	1
Duijm, Klaartje	1
Dunn, Peter K.	1
Elder, Catherine	1
Erik Voss	1
Granfeldt, Jonas	1
More ▼

Interrater Reliability	39
Language Tests	23
Second Language Learning	21
English (Second Language)	15
Evaluators	15
Foreign Countries	15
Correlation	10
Rating Scales	9
Scoring	9
Writing Evaluation	9
Language Proficiency	8
Item Response Theory	7
Oral Language	7
Statistical Analysis	7
Writing Tests	7
Comparative Analysis	6
Language Teachers	5
Scores	5
Testing	5
Accuracy	4
Evaluation Methods	4
Expertise	4
Feedback (Response)	4
Generalizability Theory	4
High Stakes Tests	4
More ▼