ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	4
Since 2017 (last 10 years)	7
Since 2007 (last 20 years)	8

Source

Language Testing

Publication Type

Journal Articles	14
Reports - Research	12
Information Analyses	1
Reports - Evaluative	1
Tests/Questionnaires	1

Education Level

Higher Education	4
Postsecondary Education	3
Secondary Education	1

Audience

Location

Australia	1
China (Guangzhou)	1
Iran	1
Japan	1
Malaysia	1
Russia	1
Turkey (Ankara)	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

Clinical Evaluation of…	1
International English…	1

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Test Score Comparison Tables: How Well are They Serving Test Users?

Peer reviewed

Direct link

Ute Knoch; Jason Fan – Language Testing, 2024

While several test concordance tables have been published, the research underpinning such tables has rarely been examined in detail. This study aimed to survey the publically available studies or documentation underpinning the test concordance tables of the providers of four major international language tests, all accepted by the Australian…

Descriptors: Language Tests, English, Test Validity, Item Analysis

A Systematic Review of Differential Item Functioning in Second Language Assessment

Peer reviewed

Direct link

Xueliang Chen; Vahid Aryadoust; Wenxin Zhang – Language Testing, 2025

The growing diversity among test takers in second or foreign language (L2) assessments makes the importance of fairness front and center. This systematic review aimed to examine how fairness in L2 assessments was evaluated through differential item functioning (DIF) analysis. A total of 83 articles from 27 journals were included in a systematic…

Descriptors: Second Language Learning, Language Tests, Test Items, Item Analysis

Test Validity in Morphosyntactic Measures for Typical and SLI Incipient Spanish-English Bilinguals

Peer reviewed

Direct link

Guiberson, Mark – Language Testing, 2019

This study will demonstrate that group differences on a morphosyntactic measure used for the identification of specific language impairment (SLI) do not guarantee validity for diagnosis and tracking, and will exemplify this with a case study of the Spanish version of the "Clinical Evaluation of Preschool Language-2 Estructura de…

Descriptors: Test Validity, Content Validity, Language Impairments, Morphology (Languages)

Critical Language Assessment Literacy of EFL Teachers: Scale Construction and Validation

Peer reviewed

Direct link

Tajeddin, Zia; Khatib, Mohammad; Mahdavi, Mohsen – Language Testing, 2022

Critical language assessment (CLA) has been addressed in numerous studies. However, the majority of the studies have overlooked the need for a practical framework to measure the CLA dimension of teachers' language assessment literacy (LAL). This gap prompted us to develop and validate a critical language assessment literacy (CLAL) scale to further…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Language Tests

IRT-Based Classification Analysis of an English Language Reading Proficiency Subtest

Peer reviewed

Direct link

Kaya, Elif; O'Grady, Stefan; Kalender, Ilker – Language Testing, 2022

Language proficiency testing serves an important function of classifying examinees into different categories of ability. However, misclassification is to some extent inevitable and may have important consequences for stakeholders. Recent research suggests that classification efficacy may be enhanced substantially using computerized adaptive…

Descriptors: Item Response Theory, Test Items, Language Tests, Classification

Investigating the Construct Measured by Banked Gap-Fill Items: Evidence from Eye-Tracking

Peer reviewed

Direct link

McCray, Gareth; Brunfaut, Tineke – Language Testing, 2018

This study investigates test-takers' processing while completing banked gap-fill tasks, designed to test reading proficiency, in order to test theoretically based expectations about the variation in cognitive processes of test-takers across levels of performance. Twenty-eight test-takers' eye traces on 24 banked gap-fill items (on six tasks) were…

Descriptors: Language Tests, Test Items, Item Analysis, Eye Movements

Determining Cloze Item Difficulty from Item and Passage Characteristics across Different Learner Backgrounds

Peer reviewed

Direct link

Trace, Jonathan; Brown, James Dean; Janssen, Gerriet; Kozhevnikova, Liudmila – Language Testing, 2017

Cloze tests have been the subject of numerous studies regarding their function and use in both first language and second language contexts (e.g., Jonz & Oller, 1994; Watanabe & Koyama, 2008). From a validity standpoint, one area of investigation has been the extent to which cloze tests measure reading ability beyond the sentence level.…

Descriptors: Cloze Procedure, Language Tests, Test Items, Item Analysis

The Cognitive Processing of Candidates during Reading Tests: Evidence from Eye-Tracking

Peer reviewed

Direct link

Bax, Stephen – Language Testing, 2013

The research described in this article investigates test takers' cognitive processing while completing onscreen IELTS (International English Language Testing System) reading test items. The research aims, among other things, to contribute to our ability to evaluate the cognitive validity of reading test items (Glaser, 1991; Field, in press). The…

Descriptors: Reading Tests, Eye Movements, Cognitive Processes, Language Tests

The Influence of Test and Sample Dimensionality on Latent Trait Person Ability and Item Difficulty Calibrations.

Peer reviewed

Henning, Grant – Language Testing, 1988

Violations of item unidimensionality on language tests produced distorted estimates of person ability, and violations of person unidimensionality produced distorted estimates of item difficulty. The Bejar Method was sensitive to such distortions. (Author)

Descriptors: Construct Validity, Content Validity, Difficulty Level, Item Analysis

Better Theory for Better Tests?

Peer reviewed

Raatz, Ulrich – Language Testing, 1985

Argues that classical test theory cannot be used at the item level on "authentic" language tests. However, if the total score is derived by adding the scores of a number of different and independent parts, test reliability can be estimated. Suggests using the Classical Latent Additives model to examine test-part homogeneity. (Author/SED)

Descriptors: Item Analysis, Latent Trait Theory, Models, Second Language Learning

Examining the Relationship between Differential Item Functioning and Differential Test Functioning

Peer reviewed

Direct link

Pae, Tae-Il; Park, Gi-Pyo – Language Testing, 2006

The present study utilized both the IRT-LR (item response theory likelihood ratio) and a series of CFA (confirmatory factor analysis) multi-sample analyses to systematically examine the relationships between DIF (differential item functioning) and DTF (differential test functioning) with a random sample of 15 000 Korean examinees. Specifically,…

Descriptors: Item Response Theory, Factor Analysis, Test Bias, Test Validity

Tailored Cloze: Improved with Classical Item Analysis Techniques.

Peer reviewed

Brown, James Dean – Language Testing, 1988

The reliability and validity of a cloze procedure used as an English-as-a-second-language (ESL) test in China were improved by applying traditional item analysis and selection techniques. The 'best' test items were chosen on the basis of item facility and discrimination indices, and were administered as a 'tailored cloze.' 29 references listed.…

Descriptors: Adaptive Testing, Cloze Procedure, English (Second Language), Foreign Countries

The Use of Test Method Characteristics in the Content Analysis and Design of EFL Proficiency Tests.

Peer reviewed

Bachman, Lyle F.; And Others – Language Testing, 1996

Discusses the value of content considerations in the design of language tests and the implications of the findings of various investigations of content analysis. The article argues that content analysis can be viewed as the application of a model of test design to a particular measurement instrument, using judgments of trained analysts. (26…

Descriptors: College Students, Content Analysis, English (Second Language), Item Analysis

Automated Assembly of Pre-equated Language Proficiency Tests.

Peer reviewed

Henning, Grant; And Others – Language Testing, 1994

Examines the effectiveness of an automated language proficiency test assembly system at an air force base English Language Center. The study focuses on the equivalence of mean score difficulty, total score variance, and intercorrelation covariance across test norms and finds a high level of test-form equivalence and internal consistency. (nine…

Descriptors: Computer Assisted Testing, English (Second Language), Foreign Nationals, Item Analysis

Item Analysis	14
Language Tests	12
English (Second Language)	9
Test Validity	9
Second Language Learning	8
Foreign Countries	7
Test Items	7
Language Proficiency	5
Comparative Analysis	3
Computer Assisted Testing	3
Construct Validity	3
Factor Analysis	3
Graduate Students	3
Item Response Theory	3
Reading Tests	3
Statistical Analysis	3
Test Construction	3
Undergraduate Students	3
Cloze Procedure	2
Cognitive Processes	2
College Students	2
Content Analysis	2
Content Validity	2
Culture Fair Tests	2
Difficulty Level	2
More ▼

Brown, James Dean	2
Henning, Grant	2
Bachman, Lyle F.	1
Bax, Stephen	1
Brunfaut, Tineke	1
Guiberson, Mark	1
Janssen, Gerriet	1
Jason Fan	1
Kalender, Ilker	1
Kaya, Elif	1
Khatib, Mohammad	1
Kozhevnikova, Liudmila	1
Mahdavi, Mohsen	1
McCray, Gareth	1
O'Grady, Stefan	1
Pae, Tae-Il	1
Park, Gi-Pyo	1
Raatz, Ulrich	1
Tajeddin, Zia	1
Trace, Jonathan	1
Ute Knoch	1
Vahid Aryadoust	1
Wenxin Zhang	1
Xueliang Chen	1
More ▼