Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 2 |
| Since 2007 (last 20 years) | 3 |
Descriptor
| Evaluation Methods | 6 |
| Interrater Reliability | 6 |
| Language Proficiency | 6 |
| Language Tests | 5 |
| Evaluators | 4 |
| Second Language Learning | 4 |
| English (Second Language) | 3 |
| Oral Language | 3 |
| Correlation | 2 |
| Foreign Countries | 2 |
| Rating Scales | 2 |
| More ▼ | |
Source
| Canadian Modern Language… | 2 |
| ETS Research Report Series | 1 |
| Language Education &… | 1 |
| Language Testing | 1 |
| System | 1 |
Author
| Magnan, Sally Sieloff | 2 |
| Bejar, Isaac I. | 1 |
| Hemat, Ramin | 1 |
| Jafarpur, Abdoljavad | 1 |
| Kuiken, Folkert | 1 |
| Sheehan, Susan | 1 |
| Thai, Thuy | 1 |
| Vedder, Ineke | 1 |
| Zechner, Klaus | 1 |
Publication Type
| Journal Articles | 6 |
| Reports - Research | 4 |
| Reports - Evaluative | 2 |
| Tests/Questionnaires | 1 |
Education Level
Audience
Laws, Policies, & Programs
Assessments and Surveys
| Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Thai, Thuy; Sheehan, Susan – Language Education & Assessment, 2022
In language performance tests, raters are important as their scoring decisions determine which aspects of performance the scores represent; however, raters are considered as one of the potential sources contributing to unwanted variability in scores (Davis, 2012). Although a great number of studies have been conducted to unpack how rater…
Descriptors: Rating Scales, Speech Communication, Second Language Learning, Second Language Instruction
Kuiken, Folkert; Vedder, Ineke – Language Testing, 2017
The importance of functional adequacy as an essential component of L2 proficiency has been observed by several authors (Pallotti, 2009; De Jong, Steinel, Florijn, Schoonen, & Hulstijn, 2012a, b). The rationale underlying the present study is that the assessment of writing proficiency in L2 is not fully possible without taking into account the…
Descriptors: Second Language Learning, Rating Scales, Computational Linguistics, Persuasive Discourse
Peer reviewedJafarpur, Abdoljavad – System, 1988
Investigation of non-native English speakers' ratings of other non-native English learners' oral proficiency. Results indicate that the judges' ratings significantly differed, and the average of three judges' ratings was a better appraisal of the testee's true ability than that of any single rating or pair of ratings. (Author/CB)
Descriptors: English (Second Language), Evaluation Methods, Foreign Countries, Interrater Reliability
Peer reviewedMagnan, Sally Sieloff – Canadian Modern Language Review, 1987
Differences between the academic (American Council on the Teaching of Foreign Languages) and government (Foreign Service Institute) versions of the oral proficiency interview test are examined, and data from two studies of interrater reliability are presented and discussed. (MSE)
Descriptors: Evaluation Methods, Interrater Reliability, Language Proficiency, Language Tests
Peer reviewedMagnan, Sally Sieloff – Canadian Modern Language Review, 1987
Differences in procedures used by academic institutions and government agencies in administering the American Council on the Teaching of Foreign Languages' Oral Proficiency Interview test are examined, and results and implications of two studies of interrater reliability are discussed. (MSE)
Descriptors: Comparative Analysis, Correlation, Evaluation Methods, Evaluators
Zechner, Klaus; Bejar, Isaac I.; Hemat, Ramin – ETS Research Report Series, 2007
The increasing availability and performance of computer-based testing has prompted more research on the automatic assessment of language and speaking proficiency. In this investigation, we evaluated the feasibility of using an off-the-shelf speech-recognition system for scoring speaking prompts from the LanguEdge field test of 2002. We first…
Descriptors: Role, Computer Assisted Testing, Language Proficiency, Oral Language

Direct link
