Publication Date
| In 2026 | 0 |
| Since 2025 | 12 |
| Since 2022 (last 5 years) | 83 |
| Since 2017 (last 10 years) | 173 |
| Since 2007 (last 20 years) | 360 |
Descriptor
Source
| Language Testing | 539 |
Author
| Davies, Alan | 8 |
| Bachman, Lyle F. | 7 |
| Elder, Catherine | 7 |
| Cheng, Liying | 6 |
| Xi, Xiaoming | 6 |
| Yan, Xun | 6 |
| Alderson, J. Charles | 5 |
| Aryadoust, Vahid | 5 |
| Cho, Yeonsuk | 5 |
| Ginther, April | 5 |
| Knoch, Ute | 5 |
| More ▼ | |
Publication Type
Education Level
Audience
Location
| Japan | 33 |
| China | 30 |
| Australia | 23 |
| United Kingdom | 15 |
| Canada | 14 |
| South Korea | 13 |
| Europe | 7 |
| Germany | 6 |
| Hong Kong | 6 |
| Netherlands | 6 |
| New Zealand | 5 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 4 |
| Elementary and Secondary… | 1 |
| Lau v Nichols | 1 |
| Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedSasaki, Miyuki – Language Testing, 2000
Investigates how schemata activated by culturally familiar words might have influenced students' cloze test-taking processes. Subjects were Japanese English-as-a-foreign-language students. Results demonstrate that students who read culturally familiar cloze texts tried to solve more items and generally understood the text better, which resulted in…
Descriptors: Cloze Procedure, College Students, Cultural Awareness, English (Second Language)
Peer reviewedJarvis, Scott – Language Testing, 2002
Compares accuracy of five formulae in terms of their ability to model the type-token curves of written texts produced by learners and native speakers. The most accurate models are then used to consider unresolved issues of past research on lexical diversity: the relationship between lexical diversity and age, second language instruction (L2), L2…
Descriptors: Age, Comparative Analysis, Language Tests, Native Speakers
Mochida, Akira; Harrington, Michael – Language Testing, 2006
Performance on the Yes/No test (Huibregtse et al., 2002) was assessed as a predictor of scores on the Vocabulary Levels Test (VLT), a standard test of receptive second language (L2) vocabulary knowledge (Nation, 1990). The use of identical items on both tests allowed a direct comparison of test performance, with alternative methods for scoring the…
Descriptors: Scoring, Questioning Techniques, Vocabulary Development, Language Tests
Lee, Yong-Won – Language Testing, 2006
A multitask speaking measure consisting of both integrated and independent tasks is expected to be an important component of a new version of the TOEFL test. This study considered two critical issues concerning score dependability of the new speaking measure: How much would the score dependability be impacted by (1) combining scores on different…
Descriptors: Language Tests, Second Language Learning, English (Second Language), Generalizability Theory
McCarthy, Philip M.; Jarvis, Scott – Language Testing, 2007
A reliable index of lexical diversity (LD) has remained stubbornly elusive for over 60 years. Meanwhile, researchers in fields as varied as "stylistics," "neuropathology," "language acquisition," and even "forensics" continue to use flawed LD indices--often ignorant that their results are questionable and in…
Descriptors: Second Language Learning, English (Second Language), Foreign Countries, Adolescents
Liu, Jianda – Language Testing, 2007
Pragmatic proficiency has been incorporated in the EFL teaching and testing syllabi in China, but the corresponding tests still focus on linguistic competence. The gap between the teaching and testing is mainly due to the lack of generally accepted measures of communicative abilities such as pragmatic competence. This study developed a…
Descriptors: Linguistic Competence, Speech Acts, Testing, Foreign Countries
Abbott, Marilyn L. – Language Testing, 2007
In this article, I describe a practical application of the Roussos and Stout (1996) multidimensional analysis framework for interpreting group performance differences on an ESL reading proficiency test. Although a variety of statistical methods have been developed for flagging test items that function differentially for equal ability examinees…
Descriptors: Test Bias, Test Items, English (Second Language), Second Language Learning
Elder, Catherine; Barkhuizen, Gary; Knoch, Ute; von Randow, Janet – Language Testing, 2007
The use of online rater self-training is growing in popularity and has obvious practical benefits, facilitating access to training materials and rating samples and allowing raters to reorient themselves to the rating scale and self monitor their behaviour at their own convenience. However there has thus far been little research into rater…
Descriptors: Writing Evaluation, Writing Tests, Scoring Rubrics, Rating Scales
Peer reviewedOlshtain, Elite; Blum-Kulka, Shoshana – Language Testing, 1985
Describes several elicitation techniques used for speech act data collection and analysis and discusses their suitability in testing a learner's acquisition of the rules of language use. Argues that the development of testing instruments for communicative competence cannot be divorced from the field of cross-cultural pragmatics. (SED)
Descriptors: Communicative Competence (Languages), Cross Cultural Studies, Language Tests, Language Usage
Peer reviewedDavidson, Fred; Henning, Grant – Language Testing, 1985
Presents a study of how well a set of language proficiency self-ratings fit the predictions of a probabilistic measurement model known as the Rasch Model. Applies the principles of the model to scalar rather than binary item response data. Concludes that scalar analysis of this kind is feasible with self-rating data. (Author/SED)
Descriptors: English (Second Language), Goodness of Fit, Higher Education, Language Proficiency
Peer reviewedShohamy, Elana – Language Testing, 1984
Describes a study of multiple-choice and open-ended questions, each presented in the native language and in the second language, on the same second language text. Different methods resulted in different scores. Some methods were found to be more difficult than others and to have a greater effect on students of low-level proficiency. (SED)
Descriptors: English (Second Language), Language Tests, Multiple Choice Tests, Reading Comprehension
Peer reviewedShohamy, Elana – Language Testing, 1997
Argues that language tests employing methods not fair to all test takers are unethical. Ways of reducing sources of unfairness in language testing is discussed. (15 references) (Author/CK)
Descriptors: Academic Achievement, Change Strategies, Ethics, Language Proficiency
Peer reviewedBrindley, Geoff; Slatyer, Helen – Language Testing, 2002
Reports on an exploratory study that investigated the comparability of listening assessment tasks used to assess and report learning outcomes of adult English-as-a-Second-Language learners in Australia. Focused on the effects of task characteristics and task conditions on learners' performance in competency-based listening assessment tasks that…
Descriptors: Adults, English (Second Language), Language Tests, Listening Skills
Peer reviewedStansfield, Charles W.; Ross, Jacqueline – Language Testing, 1988
Outlines research necessary for determining the validity and reliability of Test of Written English, an essay test that directly measures writing ability and complements Test of English-as-a-Foreign-Language's (TOEFL) indirect assessment of writing skills. Research should cover such aspects as construct, criterion-related, concurrent, content, and…
Descriptors: English (Second Language), Essay Tests, Language Research, Language Tests
Peer reviewedLynch, Brian; And Others – Language Testing, 1988
An investigation of person dimensionality attempted to identify student clusters relating to demographic variables and their core on a university English-as-a-Second-Language placement examination. The two genuine clusters for students with scores in the top or bottom 27 percent could not be accounted for solely on an ability basis. (CB)
Descriptors: Academic Ability, English (Second Language), Higher Education, Language Tests

Direct link
