Publication Date
| In 2026 | 0 |
| Since 2025 | 10 |
| Since 2022 (last 5 years) | 68 |
| Since 2017 (last 10 years) | 147 |
| Since 2007 (last 20 years) | 255 |
Descriptor
Source
| Language Testing | 361 |
Author
| Cho, Yeonsuk | 5 |
| McNamara, Tim | 5 |
| Pill, John | 5 |
| Wigglesworth, Gillian | 5 |
| Yan, Xun | 5 |
| Brunfaut, Tineke | 4 |
| Crossley, Scott A. | 4 |
| Elder, Catherine | 4 |
| Ginther, April | 4 |
| Kormos, Judit | 4 |
| Ockey, Gary J. | 4 |
| More ▼ | |
Publication Type
| Journal Articles | 361 |
| Reports - Research | 361 |
| Tests/Questionnaires | 27 |
| Reports - Descriptive | 3 |
| Information Analyses | 2 |
| Speeches/Meeting Papers | 2 |
| Opinion Papers | 1 |
Education Level
Audience
Location
| China | 24 |
| Japan | 24 |
| Australia | 18 |
| Canada | 10 |
| South Korea | 10 |
| United Kingdom | 8 |
| Europe | 6 |
| Netherlands | 5 |
| Turkey | 5 |
| United States | 5 |
| Germany | 4 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 2 |
Assessments and Surveys
What Works Clearinghouse Rating
Erik Voss – Language Testing, 2025
An increasing number of language testing companies are developing and deploying deep learning-based automated essay scoring systems (AES) to replace traditional approaches that rely on handcrafted feature extraction. However, there is hesitation to accept neural network approaches to automated essay scoring because the features are automatically…
Descriptors: Artificial Intelligence, Automation, Scoring, English (Second Language)
Jianmin Gao; Peijian Paul Sun; Chenxin Li – Language Testing, 2025
Second language (L2) utterance fluency is crucial for speaking proficiency assessment. The measurement of L2 utterance fluency relies heavily on silent pause identification. However, empirical studies establishing specific silent pause thresholds for L2 monologic speaking are scarce, and even fewer exist for L2 dialogic speaking. This study thus…
Descriptors: Second Language Learning, Language Fluency, Communicative Competence (Languages), Oral Language
Emma Bruce; Karen Dunn; Tony Clark – Language Testing, 2025
Several high-stakes English proficiency tests including but not limited to IELTS, PTE Academic, and TOEFL iBT recommend a 2-year time limit on validity for score usage. Although this timeframe provides a useful rule-of-thumb for the recency of testing, it can have far-reaching consequences. In response to stakeholder queries around IELTS validity…
Descriptors: High Stakes Tests, Language Tests, Test Validity, Scores
Haiwei Zhang; Peng Sun; Yaowaluk Bianglae; Winda Widiawati – Language Testing, 2024
In order to address the needs of the continually growing number of Chinese language learners, the present study developed and presented initial validation of a 100-item Chinese vocabulary proficiency test (CVPT) for learners of Chinese as a second/foreign language (CS/FL) using Item Response Theory among 170 CS/FL learners from Indonesia and 354…
Descriptors: Test Construction, Vocabulary, Language Proficiency, Language Tests
Suh Keong Kwon; Guoxing Yu – Language Testing, 2024
In this study, we examined the effect of visual cues in a second language listening test on test takers' viewing behaviours and their test performance. Fifty-seven learners of English in Korea took a video-based listening test, with their eye movements recorded, and 23 of them were interviewed individually after the test. The participants viewed…
Descriptors: Foreign Countries, English (Second Language), Second Language Learning, Eye Movements
Yunwen Su; Sun-Young Shin – Language Testing, 2024
Rating scales that language testers design should be tailored to the specific test purpose and score use as well as reflect the target construct. Researchers have long argued for the value of data-driven scales for classroom performance assessment, because they are specific to pedagogical tasks and objectives, have rich descriptors to offer useful…
Descriptors: Rating Scales, Language Tests, Test Construction, Performance Based Assessment
Ping-Lin Chuang – Language Testing, 2025
This experimental study explores how source use features impact raters' judgment of argumentation in a second language (L2) integrated writing test. One hundred four experienced and novice raters were recruited to complete a rating task that simulated the scoring assignment of a local English Placement Test (EPT). Sixty written responses were…
Descriptors: Interrater Reliability, Evaluators, Information Sources, Primary Sources
Okim Kang; Xun Yan; Maria Kostromitina; Ron Thomson; Talia Isaacs – Language Testing, 2024
This study aimed to answer an ongoing validity question related to the use of nonstandard English accents in international tests of English proficiency and associated issues of test fairness. More specifically, we examined (1) the extent to which different or shared English accents had an impact on listeners' performances on the Duolingo listening…
Descriptors: Language Tests, Second Language Learning, English (Second Language), Nonstandard Dialects
Tavakoli, Parvaneh; Kendon, Gill; Mazhurnaya, Svetlana; Ziomek, Anna – Language Testing, 2023
The main aim of this study was to investigate how oral fluency is assessed across different levels of proficiency in the Test of English for Educational Purposes (TEEP). Working with data from 56 test-takers performing a monologic task at a range of proficiency levels (equivalent to approximately levels 5.0, 5.5, 6.5, and 7.5 in the IELTS scoring…
Descriptors: Language Fluency, Language Tests, English (Second Language), Second Language Learning
Khaled Barkaoui – Language Testing, 2025
English-medium universities often accept scores from various English language proficiency (ELP) tests as evidence of ELP from non-English background students. This practice raises the question of how these tests compare in terms of their ability to predict academic achievement. This longitudinal study addresses this question by examining the…
Descriptors: English Learners, English (Second Language), Language Proficiency, Language Tests
Monteiro, Kátia; Crossley, Scott; Botarleanu, Robert-Mihai; Dascalu, Mihai – Language Testing, 2023
Lexical frequency benchmarks have been extensively used to investigate second language (L2) lexical sophistication, especially in language assessment studies. However, indices based on semantic co-occurrence, which may be a better representation of the experience language users have with lexical items, have not been sufficiently tested as…
Descriptors: Second Language Learning, Second Languages, Native Language, Semantics
Ramsey L. Cardwell; Steven W. Nydick; J.R. Lockwood; Alina A. von Davier – Language Testing, 2024
Applicants must often demonstrate adequate English proficiency when applying to postsecondary institutions by taking an English language proficiency test, such as the TOEFL iBT, IELTS Academic, or Duolingo English Test (DET). Concordance tables aim to provide equivalent scores across multiple assessments, helping admissions officers to make fair…
Descriptors: Second Language Learning, English (Second Language), Language Tests, Language Proficiency
Thi My Hang Nguyen; Peter Gu; Averil Coxhead – Language Testing, 2024
Despite extensive research on assessing collocational knowledge, valid measures of academic collocations remain elusive. With the present study, we employ an argument-based approach to validate two Academic Collocation Tests (ACTs) that assess the ability to recognize and produce academic collocations (i.e., two-word units such as "key…
Descriptors: Foreign Countries, College Students, College Entrance Examinations, English (Second Language)
Shungo Suzuki; Hiroaki Takatsu; Ryuki Matsuura; Miina Koyama; Mao Saeki; Yoichi Matsuyama – Language Testing, 2025
The current study proposes a new approach to weakness identification in diagnostic language assessment (DLA) for speaking skills. We also propose to design actionable and contextualised diagnostic feedback through the systematic integration of feedback and remedial learning activities. Focusing on lexical use in second language speaking, the…
Descriptors: English (Second Language), Speech Skills, Artificial Intelligence, Second Language Learning
Michael D. Carey; Stefan Szocs – Language Testing, 2024
This controlled experimental study investigated the interaction of variables associated with rating the pronunciation component of high-stakes English-language-speaking tests such as IELTS and TOEFL iBT. One hundred experienced raters who were all either familiar or unfamiliar with Brazilian-accented English or Papua New Guinean Tok Pisin-accented…
Descriptors: Dialects, Pronunciation, Suprasegmentals, Familiarity

Peer reviewed
Direct link
