Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 8 |
Descriptor
| Reliability | 12 |
| Validity | 11 |
| English (Second Language) | 7 |
| Language Tests | 7 |
| Scores | 7 |
| Evaluation | 6 |
| Foreign Countries | 6 |
| Language Proficiency | 3 |
| Second Language Learning | 3 |
| Testing | 3 |
| English | 2 |
| More ▼ | |
Source
| Language Assessment Quarterly | 12 |
Author
| Bachman, Lyle F. | 1 |
| Chen, Yuan-shan | 1 |
| Davis, Larry | 1 |
| Downey, Ryan | 1 |
| Farhady, Hossein | 1 |
| Fisher, Steven P. | 1 |
| Gordon, Belita | 1 |
| Han, Chao | 1 |
| Johnson, Robert L. | 1 |
| Lazaraton, Anne | 1 |
| Lee, Young-Ju | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 12 |
| Reports - Research | 7 |
| Tests/Questionnaires | 3 |
| Reports - Descriptive | 2 |
| Reports - Evaluative | 2 |
| Opinion Papers | 1 |
Education Level
| Higher Education | 4 |
| Postsecondary Education | 3 |
Audience
Location
| Belgium | 2 |
| Netherlands | 2 |
| Afghanistan | 1 |
| Australia | 1 |
| Austria | 1 |
| China (Beijing) | 1 |
| Denmark | 1 |
| France | 1 |
| Germany | 1 |
| Ghana | 1 |
| Hong Kong | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
| English Proficiency Test | 1 |
| Test of English as a Foreign… | 1 |
| Test of English for… | 1 |
What Works Clearinghouse Rating
Xie, Qin – Language Assessment Quarterly, 2020
This article describes the steps we went through in designing and validating an item bank to diagnose linguistic problems in the English academic writing of university students in Hong Kong. Test items adopt traditional item formats (e.g., MCQ, grammatical judgment tasks, and error correction) but are based on authentic language materials…
Descriptors: English for Academic Purposes, Second Language Learning, Second Language Instruction, Item Analysis
Papageorgiou, Spiros; Xi, Xiaoming; Morgan, Rick; So, Youngsoon – Language Assessment Quarterly, 2015
This study presents the development and empirical validation of score levels and descriptors specifically designed for reporting purposes to provide test takers with more than just a number on a score scale. In the context of a test primarily intended for 11- to 15-year-old students learning English as a second/foreign language, the study examined…
Descriptors: Scores, Validity, Scaling, Classification
Han, Chao – Language Assessment Quarterly, 2016
As a property of test scores, reliability/dependability constitutes an important psychometric consideration, and it underpins the validity of measurement results. A review of interpreter certification performance tests (ICPTs) reveals that (a) although reliability/dependability checking has been recognized as an important concern, its theoretical…
Descriptors: Foreign Countries, Scores, English, Chinese
Chen, Yuan-shan; Liu, Jianda – Language Assessment Quarterly, 2016
This study reports the development of a scale to evaluate the speech act performance by intermediate-level Chinese learners of English. A qualitative analysis of the American raters' comments was conducted on learner scripts in response to a total of 16 apology and request written discourse completion task (WDCT) situations. The results showed…
Descriptors: Speech Acts, North Americans, Grammar, Electronic Mail
Downey, Ryan; Farhady, Hossein; Present-Thomas, Rebecca; Suzuki, Masanori; Van Moere, Alistair – Language Assessment Quarterly, 2008
This article responds to a critique of Ordinate Corporation's "Versant for English" test (formerly PhonePass and SET-10). The critique (Chun, 2006) purports to apply the language task framework of Bachman and Palmer (1996) and the Test Fairness framework of Kunnan (2004) as guides for an analysis. However, the analysis fails to apply either…
Descriptors: Evaluation, Construct Validity, English (Second Language), Testing
Sedgwick, Carole – Language Assessment Quarterly, 2007
From June to November 2003 an exploratory survey was made of the assessment of written English in a small sample of English language degrees in Europe. All programmes had similar components, the study of English language and literature, cultural or area studies, and English language development. Some involved the study of another language.…
Descriptors: Test Validity, Foreign Countries, English (Second Language), English
Lazaraton, Anne; Davis, Larry – Language Assessment Quarterly, 2008
The increasing popularity of paired format in oral testing has engendered legitimate scrutiny of its reliability and validity as compared with the more traditional interviewer-interviewee arrangement. Although characteristics such as the gender, cultural/L1 background, and language proficiency of one's interlocutor likely affects the discourse…
Descriptors: Language Tests, Interviews, Language Proficiency, Second Languages
Lee, Young-Ju – Language Assessment Quarterly, 2007
The Multimedia Assisted Test of English Speaking was designed to assess global speaking competence of Korean speakers of English at Sookmyung Women's University. The test was developed with the help of the American Council on the Teaching of Foreign Languages for their proficiency guidelines and of the Center for Applied Linguistics for their…
Descriptors: Speech Communication, Applied Linguistics, Criterion Referenced Tests, Guidelines
Bachman, Lyle F. – Language Assessment Quarterly, 2005
The fields of language testing and educational and psychological measurement have not, as yet, developed a set of principles and procedures for linking test scores and score-based inferences to test use and the consequences of test use. Although Messick (1989) discusses test use and consequences, his framework provides virtually no guidance on how…
Descriptors: Test Use, Testing, Language Tests, Validity
Johnson, Robert L.; Penny, James; Gordon, Belita; Shumate, Steven R.; Fisher, Steven P. – Language Assessment Quarterly, 2005
Many studies have indicated that at least 2 raters should score writing assessments to improve interrater reliability. However, even for assessments that characteristically demonstrate high levels of rater agreement, 2 raters of the same essay can occasionally report different, or discrepant, scores. If a single score, typically referred to as an…
Descriptors: Interrater Reliability, Scores, Evaluation, Reliability
Reath, Anne – Language Assessment Quarterly, 2004
In 1993, the language section of the Swedish Migration Board initiated the production of documents they called "language analyses" to aid in the processing of asylum seekers. Today, 11 years later, 2 privately owned companies in Stockholm produce these documents. These companies have produced language analyses not only for the Swedish…
Descriptors: Language Tests, Police, Foreign Countries, Immigration
Leung, Constant – Language Assessment Quarterly, 2004
Classroom-based formative assessment by teachers has received a good deal of renewed scholarly and policy interest. The overall aim of this article is to foreground some of the key constitutive issues in this approach to teacher assessment and to suggest possible ways of conceptualizing key epistemological and empirical questions. This discussion…
Descriptors: Research and Development, Formative Evaluation, English (Second Language), Teaching Methods

Peer reviewed
Direct link
