Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 4 |
| Since 2017 (last 10 years) | 7 |
| Since 2007 (last 20 years) | 8 |
Descriptor
Source
| Language Testing | 14 |
Author
| Brown, James Dean | 2 |
| Henning, Grant | 2 |
| Bachman, Lyle F. | 1 |
| Bax, Stephen | 1 |
| Brunfaut, Tineke | 1 |
| Guiberson, Mark | 1 |
| Janssen, Gerriet | 1 |
| Jason Fan | 1 |
| Kalender, Ilker | 1 |
| Kaya, Elif | 1 |
| Khatib, Mohammad | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 14 |
| Reports - Research | 12 |
| Information Analyses | 1 |
| Reports - Evaluative | 1 |
| Tests/Questionnaires | 1 |
Education Level
| Higher Education | 4 |
| Postsecondary Education | 3 |
| Secondary Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
| Clinical Evaluation of… | 1 |
| International English… | 1 |
What Works Clearinghouse Rating
Ute Knoch; Jason Fan – Language Testing, 2024
While several test concordance tables have been published, the research underpinning such tables has rarely been examined in detail. This study aimed to survey the publically available studies or documentation underpinning the test concordance tables of the providers of four major international language tests, all accepted by the Australian…
Descriptors: Language Tests, English, Test Validity, Item Analysis
Xueliang Chen; Vahid Aryadoust; Wenxin Zhang – Language Testing, 2025
The growing diversity among test takers in second or foreign language (L2) assessments makes the importance of fairness front and center. This systematic review aimed to examine how fairness in L2 assessments was evaluated through differential item functioning (DIF) analysis. A total of 83 articles from 27 journals were included in a systematic…
Descriptors: Second Language Learning, Language Tests, Test Items, Item Analysis
Guiberson, Mark – Language Testing, 2019
This study will demonstrate that group differences on a morphosyntactic measure used for the identification of specific language impairment (SLI) do not guarantee validity for diagnosis and tracking, and will exemplify this with a case study of the Spanish version of the "Clinical Evaluation of Preschool Language-2 Estructura de…
Descriptors: Test Validity, Content Validity, Language Impairments, Morphology (Languages)
Tajeddin, Zia; Khatib, Mohammad; Mahdavi, Mohsen – Language Testing, 2022
Critical language assessment (CLA) has been addressed in numerous studies. However, the majority of the studies have overlooked the need for a practical framework to measure the CLA dimension of teachers' language assessment literacy (LAL). This gap prompted us to develop and validate a critical language assessment literacy (CLAL) scale to further…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Language Tests
Kaya, Elif; O'Grady, Stefan; Kalender, Ilker – Language Testing, 2022
Language proficiency testing serves an important function of classifying examinees into different categories of ability. However, misclassification is to some extent inevitable and may have important consequences for stakeholders. Recent research suggests that classification efficacy may be enhanced substantially using computerized adaptive…
Descriptors: Item Response Theory, Test Items, Language Tests, Classification
McCray, Gareth; Brunfaut, Tineke – Language Testing, 2018
This study investigates test-takers' processing while completing banked gap-fill tasks, designed to test reading proficiency, in order to test theoretically based expectations about the variation in cognitive processes of test-takers across levels of performance. Twenty-eight test-takers' eye traces on 24 banked gap-fill items (on six tasks) were…
Descriptors: Language Tests, Test Items, Item Analysis, Eye Movements
Trace, Jonathan; Brown, James Dean; Janssen, Gerriet; Kozhevnikova, Liudmila – Language Testing, 2017
Cloze tests have been the subject of numerous studies regarding their function and use in both first language and second language contexts (e.g., Jonz & Oller, 1994; Watanabe & Koyama, 2008). From a validity standpoint, one area of investigation has been the extent to which cloze tests measure reading ability beyond the sentence level.…
Descriptors: Cloze Procedure, Language Tests, Test Items, Item Analysis
Bax, Stephen – Language Testing, 2013
The research described in this article investigates test takers' cognitive processing while completing onscreen IELTS (International English Language Testing System) reading test items. The research aims, among other things, to contribute to our ability to evaluate the cognitive validity of reading test items (Glaser, 1991; Field, in press). The…
Descriptors: Reading Tests, Eye Movements, Cognitive Processes, Language Tests
Peer reviewedHenning, Grant – Language Testing, 1988
Violations of item unidimensionality on language tests produced distorted estimates of person ability, and violations of person unidimensionality produced distorted estimates of item difficulty. The Bejar Method was sensitive to such distortions. (Author)
Descriptors: Construct Validity, Content Validity, Difficulty Level, Item Analysis
Peer reviewedRaatz, Ulrich – Language Testing, 1985
Argues that classical test theory cannot be used at the item level on "authentic" language tests. However, if the total score is derived by adding the scores of a number of different and independent parts, test reliability can be estimated. Suggests using the Classical Latent Additives model to examine test-part homogeneity. (Author/SED)
Descriptors: Item Analysis, Latent Trait Theory, Models, Second Language Learning
Pae, Tae-Il; Park, Gi-Pyo – Language Testing, 2006
The present study utilized both the IRT-LR (item response theory likelihood ratio) and a series of CFA (confirmatory factor analysis) multi-sample analyses to systematically examine the relationships between DIF (differential item functioning) and DTF (differential test functioning) with a random sample of 15 000 Korean examinees. Specifically,…
Descriptors: Item Response Theory, Factor Analysis, Test Bias, Test Validity
Peer reviewedBrown, James Dean – Language Testing, 1988
The reliability and validity of a cloze procedure used as an English-as-a-second-language (ESL) test in China were improved by applying traditional item analysis and selection techniques. The 'best' test items were chosen on the basis of item facility and discrimination indices, and were administered as a 'tailored cloze.' 29 references listed.…
Descriptors: Adaptive Testing, Cloze Procedure, English (Second Language), Foreign Countries
Peer reviewedBachman, Lyle F.; And Others – Language Testing, 1996
Discusses the value of content considerations in the design of language tests and the implications of the findings of various investigations of content analysis. The article argues that content analysis can be viewed as the application of a model of test design to a particular measurement instrument, using judgments of trained analysts. (26…
Descriptors: College Students, Content Analysis, English (Second Language), Item Analysis
Peer reviewedHenning, Grant; And Others – Language Testing, 1994
Examines the effectiveness of an automated language proficiency test assembly system at an air force base English Language Center. The study focuses on the equivalence of mean score difficulty, total score variance, and intercorrelation covariance across test norms and finds a high level of test-form equivalence and internal consistency. (nine…
Descriptors: Computer Assisted Testing, English (Second Language), Foreign Nationals, Item Analysis

Direct link
