ERIC - Search Results

Publication Date

In 2026	0
Since 2025	3
Since 2022 (last 5 years)	6
Since 2017 (last 10 years)	18
Since 2007 (last 20 years)	32

Source

Language Testing

Publication Type

Journal Articles	39
Reports - Research	32
Reports - Evaluative	6
Information Analyses	2
Tests/Questionnaires	2
Reports - Descriptive	1

Education Level

Higher Education	9
Postsecondary Education	6
Secondary Education	3
Elementary Education	2
Adult Education	1
Early Childhood Education	1
Elementary Secondary Education	1
Grade 6	1
Intermediate Grades	1
Kindergarten	1
Primary Education	1
More ▼

Audience

Location

Netherlands	5
Finland	3
Japan	2
Arizona	1
China	1
Europe	1
Georgia	1
Hong Kong	1
Illinois (Urbana)	1
India	1
Japan (Tokyo)	1
Ohio	1
South Korea	1
Sweden	1
Taiwan	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	3
Graduate Record Examinations	1
Peabody Picture Vocabulary…	1

What Works Clearinghouse Rating

Language Testing X

Showing 31 to 39 of 39 results Save | Export

Rater Bias Patterns in an EFL Writing Assessment

Peer reviewed

Direct link

Schaefer, Edward – Language Testing, 2008

The present study employed multi-faceted Rasch measurement (MFRM) to explore the rater bias patterns of native English-speaker (NES) raters when they rate EFL essays. Forty NES raters rated 40 essays written by female Japanese university students on a single topic adapted from the TOEFL Test of Written English (TWE). The essays were assessed using…

Descriptors: Writing Evaluation, Writing Tests, Program Effectiveness, Essays

The Assessment of Writing Ability: Expert Readers versus Lay Readers.

Peer reviewed

Schoonen, Rob; And Others – Language Testing, 1997

Reports on three studies conducted in the Netherlands about the reading reliability of lay and expert readers in rating content and language usage of students' writing performances in three kinds of writing assignments. Findings reveal that expert readers are more reliable in rating usage, whereas both lay and expert readers are reliable raters of…

Descriptors: Foreign Countries, Interrater Reliability, Language Usage, Models

Interviewer Variation and the Co-construction of Speaking Proficiency.

Peer reviewed

Brown, Annie – Language Testing, 2003

Examines the question of variation among interviewers of oral language proficiency interviews in the ways that they elicit demonstrations of communicative ability and the impact of this variation on candidate performance and raters' perceptions of candidate ability. A discourse analysis of two interviews involving the same candidate with two…

Descriptors: Discourse Analysis, Interrater Reliability, Interviews, Language Proficiency

Accounting for Nonsystematic Error in Performance Ratings.

Peer reviewed

Henning, Grant – Language Testing, 1996

Analyzes simulated performance ratings on a six-point scale by two independent raters to account for nonsystematic error in performance ratings. Results suggest that rater agreement or covariance is not always a dependable estimate of score reliability and that the practice of seeking additional raters for adjudication of discrepant ratings is not…

Descriptors: Correlation, Error Patterns, Interrater Reliability, Language Tests

Using GENOVA and FACETS to Set Multiple Standards on Performance Assessment for Certification in Medical Translation from Japanese into English

Peer reviewed

Direct link

Kozaki ,Y. – Language Testing, 2004

This article presents a standard-setting procedure for performance assessment in a foreign language, through which some of the major problems facing performance assessment in criterion-referenced testing can be addressed. The procedure, which was geared to revealing and accommodating inter-judge variability, employed the synergy of multiple…

Descriptors: Data Analysis, Testing, Performance Tests, Generalizability Theory

Evaluating Rater Responses to an Online Training Program for L2 Writing Assessment

Peer reviewed

Direct link

Elder, Catherine; Barkhuizen, Gary; Knoch, Ute; von Randow, Janet – Language Testing, 2007

The use of online rater self-training is growing in popularity and has obvious practical benefits, facilitating access to training materials and rating samples and allowing raters to reorient themselves to the rating scale and self monitor their behaviour at their own convenience. However there has thus far been little research into rater…

Descriptors: Writing Evaluation, Writing Tests, Scoring Rubrics, Rating Scales

Testing the Language Proficiency of Bilingual Teachers: Arizona's Spanish Proficiency Test.

Peer reviewed

Grant, Leslie – Language Testing, 1997

Describes current procedures used for testing bilingual teachers in the United States and focuses on one means of assessment used in Arizona. Examinee questionnaire responses, teacher questionnaire responses and test section analysis all contributed evidence for validity. (33 references) (Author/CK)

Descriptors: Bilingualism, Criterion Referenced Tests, Interrater Reliability, Language Teachers

Validity Evidence in a University Group Oral Test

Peer reviewed

Direct link

Van Moere, Alistair – Language Testing, 2006

This article investigates a group oral test as administered at a university in Japan to find if it is appropriate to use scores for higher stakes decision making. It is one component of an in-house English proficiency test used for placing students, evaluating their progress, and making informed decisions for the development of the English…

Descriptors: Foreign Countries, Generalizability Theory, Achievement Tests, English (Second Language)

An Investigation of Planning Time and Proficiency Level on Oral Test Discourse.

Peer reviewed

Wigglesworth, Gillian – Language Testing, 1997

In this study, planning time was manipulated as a variable in a trial administration of a semi-direct oral interaction test. Discourse analytic techniques were used to determine the nature and/or significance of difference in the elicited discourse across two conditions in terms of complexity and accuracy. Findings suggest that planning time may…

Descriptors: Cognitive Development, Communicative Competence (Languages), Comparative Analysis, Discourse Analysis

« Previous Page | Next Page

Pages: 1 | 2 | 3

Attali, Yigal	2
Iasonas Lamprianou	2
Knoch, Ute	2
Reeta Neittaanmäki	2
Schoonen, Rob	2
Wind, Stefanie A.	2
Yan, Xun	2
de Jong, Nivja H.	2
Alanen, Riikka	1
Barkhuizen, Gary	1
Bosker, Hans Rutger	1
Brown, Annie	1
Carey, Michael D.	1
Chan, Stephanie W. Y.	1
Chapelle, Carol A.	1
Cheung, Wai Ming	1
Chuang, Ping-Lin	1
Davis, Larry	1
Deygers, Bart	1
Duijm, Klaartje	1
Dunn, Peter K.	1
Elder, Catherine	1
Erik Voss	1
Granfeldt, Jonas	1
More ▼

Interrater Reliability	39
Language Tests	23
Second Language Learning	21
English (Second Language)	15
Evaluators	15
Foreign Countries	15
Correlation	10
Rating Scales	9
Scoring	9
Writing Evaluation	9
Language Proficiency	8
Item Response Theory	7
Oral Language	7
Statistical Analysis	7
Writing Tests	7
Comparative Analysis	6
Language Teachers	5
Scores	5
Testing	5
Accuracy	4
Evaluation Methods	4
Expertise	4
Feedback (Response)	4
Generalizability Theory	4
High Stakes Tests	4
More ▼