Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 3 |
| Since 2017 (last 10 years) | 8 |
| Since 2007 (last 20 years) | 27 |
Descriptor
| Validity | 34 |
| Language Tests | 24 |
| English (Second Language) | 19 |
| Foreign Countries | 19 |
| Second Language Learning | 17 |
| Scores | 12 |
| Language Proficiency | 11 |
| Reliability | 11 |
| Evaluation | 8 |
| Second Language Instruction | 6 |
| Guidelines | 5 |
| More ▼ | |
Source
| Language Assessment Quarterly | 34 |
Author
| Lazaraton, Anne | 2 |
| Leung, Constant | 2 |
| McNamara, Tim | 2 |
| Bachman, Lyle F. | 1 |
| Barry O'Sullivan | 1 |
| Benjamin Kremmel | 1 |
| Butler, Yuko Goto | 1 |
| Chen, Huilin | 1 |
| Chen, Jinsong | 1 |
| Chen, Yuan-shan | 1 |
| Cho, Yeonsuk | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 34 |
| Reports - Research | 19 |
| Reports - Descriptive | 10 |
| Reports - Evaluative | 5 |
| Tests/Questionnaires | 4 |
Education Level
Audience
| Policymakers | 1 |
| Researchers | 1 |
Location
| Taiwan | 4 |
| Australia | 3 |
| Belgium | 2 |
| Hong Kong | 2 |
| Netherlands | 2 |
| Afghanistan | 1 |
| Austria | 1 |
| California | 1 |
| Canada | 1 |
| China (Beijing) | 1 |
| Denmark | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
| Test of English as a Foreign… | 2 |
| ACTFL Oral Proficiency… | 1 |
| English Proficiency Test | 1 |
| National Assessment of… | 1 |
| Program for International… | 1 |
| Test of English for… | 1 |
What Works Clearinghouse Rating
Barry O'Sullivan – Language Assessment Quarterly, 2023
This paper highlights as issues of concern the rapid changes in technology and the tendency to report on partial validation efforts where the work is not identified as forming part of a larger validation project. With close human supervision emerging technologies can have a significant and positive impact on language testing. While technology…
Descriptors: Technology Uses in Education, Computer Assisted Testing, Language Tests, Supervision
Daniel R. Isbell; Benjamin Kremmel; Jieun Kim – Language Assessment Quarterly, 2023
In the wake of the COVID-19 boom in remote administration of language tests, it appears likely that remote administration will be a permanent fixture in the language testing landscape. Accordingly, language test providers, stakeholders, and researchers must grapple with the implications of remote proctoring on valid, fair, and just uses of tests.…
Descriptors: Distance Education, Supervision, Language Tests, Culture Fair Tests
Huang, Becky H.; Butler, Yuko Goto – Language Assessment Quarterly, 2020
The population of young language minority (LM) students is rapidly growing worldwide due to global migration and immigration trends. The increasing representation of young LM students in school settings creates high demand for the language assessment of LM students in order to meet the needs of stakeholders, such as governments and language…
Descriptors: Validity, Student Evaluation, Language Proficiency, Language Minorities
Leung, Constant; Evans, Michael; Liu, Yongcan – Language Assessment Quarterly, 2021
The language assessment issues discussed in this paper are set against the backdrop of the English as an additional language (EAL) provision for students from ethnic and linguistic minority communities in the publicly funded school education system in England. We will first provide a background description of the educational response to linguistic…
Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Foreign Countries
L. Hannah; E. E. Jang; M. Shah; V. Gupta – Language Assessment Quarterly, 2023
Machines have a long-demonstrated ability to find statistical relationships between qualities of texts and surface-level linguistic indicators of writing. More recently, unlocked by artificial intelligence, the potential of using machines to identify content-related writing trait criteria has been uncovered. This development is significant,…
Descriptors: Validity, Automation, Scoring, Writing Assignments
Wagner, Elvis; Liao, Yen-Fen; Wagner, Santoi – Language Assessment Quarterly, 2021
L2 test developers often use scripted spoken texts in their L2 listening tests, because it is efficient and practical to create scripted spoken texts that meet predetermined test specifications. But because scripted spoken texts differ in a number of fundamental ways from unscripted spoken language, there are potential threats to validity when…
Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Grammar
Harsch, Claudia; Kanistra, Voula Paraskevi – Language Assessment Quarterly, 2020
We report on a standard-setting project in which the Item-Descriptor-Matching Method (IDM) and a complementary benchmarking approach were employed to align a suite of English language proficiency exams to the "Common European Framework of Reference" (CEFR), with a particular focus on the integrated and independent writing exams. Judges'…
Descriptors: Standard Setting, Guidelines, Rating Scales, Definitions
Xie, Qin – Language Assessment Quarterly, 2020
This article describes the steps we went through in designing and validating an item bank to diagnose linguistic problems in the English academic writing of university students in Hong Kong. Test items adopt traditional item formats (e.g., MCQ, grammatical judgment tasks, and error correction) but are based on authentic language materials…
Descriptors: English for Academic Purposes, Second Language Learning, Second Language Instruction, Item Analysis
Papageorgiou, Spiros; Xi, Xiaoming; Morgan, Rick; So, Youngsoon – Language Assessment Quarterly, 2015
This study presents the development and empirical validation of score levels and descriptors specifically designed for reporting purposes to provide test takers with more than just a number on a score scale. In the context of a test primarily intended for 11- to 15-year-old students learning English as a second/foreign language, the study examined…
Descriptors: Scores, Validity, Scaling, Classification
Chen, Huilin; Chen, Jinsong – Language Assessment Quarterly, 2016
Cognitive diagnosis models (CDMs) are psychometric models developed mainly to assess examinees' specific strengths and weaknesses in a set of skills or attributes within a domain. By adopting the Generalized-DINA model framework, the recently developed general modeling framework, we attempted to retrofit the PISA reading assessments, a…
Descriptors: Reading Tests, Diagnostic Tests, Models, Test Items
Tannenbaum, Richard J.; Cho, Yeonsuk – Language Assessment Quarterly, 2014
In this article, we consolidate and present in one place what is known about quality indicators for setting standards so that stakeholders may be able to recognize the signs of standard-setting quality. We use the context of setting standards to associate English language test scores with language proficiency descriptions such as those presented…
Descriptors: Standard Setting, Language Tests, Scores, English (Second Language)
Han, Chao – Language Assessment Quarterly, 2016
As a property of test scores, reliability/dependability constitutes an important psychometric consideration, and it underpins the validity of measurement results. A review of interpreter certification performance tests (ICPTs) reveals that (a) although reliability/dependability checking has been recognized as an important concern, its theoretical…
Descriptors: Foreign Countries, Scores, English, Chinese
Holster, Trevor A.; Lake, J. – Language Assessment Quarterly, 2016
Stewart questioned Beglar's use of Rasch analysis of the Vocabulary Size Test (VST) and advocated the use of 3-parameter logistic item response theory (3PLIRT) on the basis that it models a non-zero lower asymptote for items, often called a "guessing" parameter. In support of this theory, Stewart presented fit statistics derived from…
Descriptors: Guessing (Tests), Item Response Theory, Vocabulary, Language Tests
Chen, Yuan-shan; Liu, Jianda – Language Assessment Quarterly, 2016
This study reports the development of a scale to evaluate the speech act performance by intermediate-level Chinese learners of English. A qualitative analysis of the American raters' comments was conducted on learner scripts in response to a total of 16 apology and request written discourse completion task (WDCT) situations. The results showed…
Descriptors: Speech Acts, North Americans, Grammar, Electronic Mail
Yang, Hui-Chun – Language Assessment Quarterly, 2014
This study explores the construct of a summarization test task by means of single-group and multigroup structural equation modeling (SEM). It examines the interrelationships between strategy use and performance, drawing on data from 298 Taiwanese undergraduates' summary essays and their self-reported strategy use. Single-group SEM analyses…
Descriptors: Foreign Countries, Structural Equation Models, Writing Skills, Language Tests

Peer reviewed
Direct link
