Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 4 |
| Since 2007 (last 20 years) | 16 |
Descriptor
| Interrater Reliability | 19 |
| Language Tests | 19 |
| Statistical Analysis | 19 |
| Second Language Learning | 14 |
| English (Second Language) | 12 |
| Second Language Instruction | 11 |
| Foreign Countries | 10 |
| Oral Language | 9 |
| Evaluators | 7 |
| Language Proficiency | 6 |
| Computer Assisted Testing | 5 |
| More ▼ | |
Source
Author
| Ahmadi, Alireza | 1 |
| Ahour, Touran | 1 |
| Chambers, Francine | 1 |
| Clevinger, Amanda | 1 |
| Coniam, David | 1 |
| Cordier, Deborah | 1 |
| Crossley, Scott | 1 |
| Davis, Larry | 1 |
| Entezari Maleki, Saeideh | 1 |
| Granfeldt, Jonas | 1 |
| Hou, Leijuan | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 17 |
| Reports - Research | 16 |
| Tests/Questionnaires | 3 |
| Books | 1 |
| Collected Works - General | 1 |
| Dissertations/Theses -… | 1 |
| Information Analyses | 1 |
| Reports - Descriptive | 1 |
Education Level
| Higher Education | 8 |
| Postsecondary Education | 7 |
| Secondary Education | 2 |
Audience
| Practitioners | 1 |
| Teachers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| Test of English as a Foreign… | 5 |
| Test of English for… | 1 |
What Works Clearinghouse Rating
Wind, Stefanie A.; Peterson, Meghan E. – Language Testing, 2018
The use of assessments that require rater judgment (i.e., rater-mediated assessments) has become increasingly popular in high-stakes language assessments worldwide. Using a systematic literature review, the purpose of this study is to identify and explore the dominant methods for evaluating rating quality within the context of research on…
Descriptors: Language Tests, Evaluators, Evaluation Methods, Interrater Reliability
Davis, Larry – Language Testing, 2016
Two factors were investigated that are thought to contribute to consistency in rater scoring judgments: rater training and experience in scoring. Also considered were the relative effects of scoring rubrics and exemplars on rater performance. Experienced teachers of English (N = 20) scored recorded responses from the TOEFL iBT speaking test prior…
Descriptors: Evaluators, Oral Language, Scores, Language Tests
Hubert, Michael D.; Vigil, Donny – Applied Language Learning, 2017
Adult second/foreign language acquisition is extremely challenging for many learners, with pronunciation often the one aspect in which otherwise very proficient language users fall short. Although a great deal of research has found that formal instruction in phonetics/phonology may improve learner pronunciation, no study has yet addressed the…
Descriptors: Teaching Methods, Second Language Learning, Second Language Instruction, Pronunciation Instruction
Li, Hui – English Language Teaching, 2016
The aim of the study was to investigate how raters come to their decisions when judging spoken vocabulary. Segmental rating was introduced to quantify raters' decision-making process. It is hoped that this simulated study brings fresh insight to future methodological considerations with spoken data. Twenty trainee raters assessed five Chinese…
Descriptors: Foreign Countries, Evaluators, Interrater Reliability, Decision Making
Syllabification of Final Consonant Clusters: A Salient Pronunciation Problem of Kurdish EFL Learners
Keshavarz, Mohammad Hossein – Iranian Journal of Language Teaching Research, 2017
While there is a plethora of research on pronunciation problems of EFL learners with different L1 backgrounds, published empirical studies on syllabification errors of Iraqi Kurdish EFL learners are scarce. Therefore, to contribute to this line of research, the present study set out to investigate difficulties of this group of learners in the…
Descriptors: Phonemes, English (Second Language), Second Language Learning, Second Language Instruction
Ahmadi, Alireza; Sadeghi, Elham – Language Assessment Quarterly, 2016
In the present study we investigated the effect of test format on oral performance in terms of test scores and discourse features (accuracy, fluency, and complexity). Moreover, we explored how the scores obtained on different test formats relate to such features. To this end, 23 Iranian EFL learners participated in three test formats of monologue,…
Descriptors: Oral Language, Comparative Analysis, Language Fluency, Accuracy
Prieto, Gerardo; Nieto, Eloísa – Psicologica: International Journal of Methodology and Experimental Psychology, 2014
This paper describes how a Many Faceted Rasch Measurement (MFRM) approach can be applied to performance assessment focusing on rater analysis. The article provides an introduction to MFRM, a description of MFRM analysis procedures, and an example to illustrate how to examine the effects of various sources of variability on test takers' performance…
Descriptors: Item Response Theory, Interrater Reliability, Rating Scales, Error of Measurement
Tabari, Mahmoud Abdi – TESL-EJ, 2017
Much research has investigated the role of planning time in second language writing; however, the results show that there are inconsistent findings about the effects of planning time conditions on the complexity of the EFL learners' textual output. The current study attempted to consider the differential effects of planning time conditions in…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Persuasive Discourse
Granfeldt, Jonas; Ågren, Malin – Language Testing, 2014
One core area of research in Second Language Acquisition is the identification and definition of developmental stages in different L2s. For L2 French, Bartning and Schlyter (2004) presented a model of six morphosyntactic stages of development in the shape of grammatical profiles. The model formed the basis for the computer program Direkt Profil…
Descriptors: Second Language Learning, Language Tests, French, Language Teachers
Crossley, Scott; Clevinger, Amanda; Kim, YouJin – Language Assessment Quarterly, 2014
There has been a growing interest in the use of integrated tasks in the field of second language testing to enhance the authenticity of language tests. However, the role of text integration in test takers' performance has not been widely investigated. The purpose of the current study is to examine the effects of text-based relational (i.e.,…
Descriptors: Language Proficiency, Connected Discourse, Language Tests, English (Second Language)
Ahour, Touran; Entezari Maleki, Saeideh – English Language Teaching, 2014
This study attempted to unveil the effect of metadiscourse instruction on the improvement of the speaking ability of Iranian EFL learners. After the administration of a language proficiency test, 34 homogeneous participants were assigned into the experimental and control groups. Then, the two groups were compared on their speaking ability. After…
Descriptors: Foreign Countries, English (Second Language), Second Language Learning, Second Language Instruction
Jamieson, Joan; Poonpon, Kornwipa – ETS Research Report Series, 2013
Research and development of a new type of scoring rubric for the integrated speaking tasks of "TOEFL iBT"® are described. These "analytic rating guides" could be helpful if tasks modeled after those in TOEFL iBT were used for formative assessment, a purpose which is different from TOEFL iBT's primary use for admission…
Descriptors: Oral Language, Language Proficiency, Scaling, Scores
Coniam, David – New Horizons in Education, 2011
Background: This article reports a study into the double marking of Liberal Studies in Hong Kong. This is now a compulsory subject in Hong Kong's Years 10-12 curriculum which, when first examined in the new Hong Kong Diploma of Secondary Education in 2012, will increase its candidature from its current 3,300 to 80,000. Aims: To examine the…
Descriptors: Tests, Foreign Countries, English (Second Language), Second Language Learning
Wang, Ping – English Language Teaching, 2009
This paper makes a study of the rater reliability in scoring composition in the test of English as a foreign language (EFL) and focuses on the inter-rater reliability as well as several interactions between raters and the other facets involved (that is examinees, rating criteria and rating methods). Results showed that raters were fairly…
Descriptors: Interrater Reliability, Scoring, Writing (Composition), English (Second Language)
Cordier, Deborah – ProQuest LLC, 2009
A renewed focus on foreign language (FL) learning and speech for communication has resulted in computer-assisted language learning (CALL) software developed with Automatic Speech Recognition (ASR). ASR features for FL pronunciation (Lafford, 2004) are functional components of CALL designs used for FL teaching and learning. The ASR features…
Descriptors: Feedback (Response), Computer Assisted Instruction, Validity, Computer Software
Previous Page | Next Page »
Pages: 1 | 2
Peer reviewed
Direct link
