ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	19
Since 2017 (last 10 years)	43
Since 2007 (last 20 years)	68

Descriptor

Test Validity	106
Second Language Learning	105
Language Tests	97
English (Second Language)	60
Language Proficiency	44
Foreign Countries	33
Testing	31
Scores	27
Test Reliability	23
Test Construction	21
Correlation	17
Comparative Analysis	15
Second Language Instruction	15
Oral Language	13
Computer Assisted Testing	12
Factor Analysis	12
Test Format	12
College Students	11
Speech Communication	11
Computational Linguistics	9
Construct Validity	9
Psychometrics	9
Scoring	9
Higher Education	8
Language Skills	8
More ▼

Source

Language Testing

106

Publication Type

Journal Articles	106
Reports - Research	61
Reports - Evaluative	23
Opinion Papers	13
Information Analyses	7
Reports - Descriptive	7
Tests/Questionnaires	4
Speeches/Meeting Papers	1

Education Level

Higher Education	23
Postsecondary Education	18
Elementary Education	5
Secondary Education	3
Elementary Secondary Education	1
Grade 12	1
Grade 7	1
Grade 8	1
Grade 9	1
High Schools	1
Junior High Schools	1
Middle Schools	1
More ▼

Audience

Location

Japan	5
China	4
Australia	3
Brazil	3
South Korea	3
United Kingdom	3
Israel	2
New Zealand	2
Taiwan	2
Arizona	1
California (San Francisco)	1
Canada	1
Europe	1
Finland	1
France	1
Germany	1
India	1
Indiana	1
Kenya	1
New York (New York)	1
Pennsylvania (Philadelphia)	1
Pennsylvania (Pittsburgh)	1
Russia	1
Slovenia	1
South Africa	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	14
International English…	7
Test of English for…	3
ACT Assessment	1
Clinical Evaluation of…	1
Edinburgh Handedness Inventory	1
English Proficiency Test	1
Graduate Record Examinations	1
Michigan Test of English…	1
Program for International…	1
Test of Written English	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 106 results Save | Export

Test Score Validity Periods for High-Stakes Language Tests: Applications in Higher Education and Medical Sectors

Peer reviewed

Direct link

Emma Bruce; Karen Dunn; Tony Clark – Language Testing, 2025

Several high-stakes English proficiency tests including but not limited to IELTS, PTE Academic, and TOEFL iBT recommend a 2-year time limit on validity for score usage. Although this timeframe provides a useful rule-of-thumb for the recency of testing, it can have far-reaching consequences. In response to stakeholder queries around IELTS validity…

Descriptors: High Stakes Tests, Language Tests, Test Validity, Scores

A Systematic Review of Differential Item Functioning in Second Language Assessment

Peer reviewed

Direct link

Xueliang Chen; Vahid Aryadoust; Wenxin Zhang – Language Testing, 2025

The growing diversity among test takers in second or foreign language (L2) assessments makes the importance of fairness front and center. This systematic review aimed to examine how fairness in L2 assessments was evaluated through differential item functioning (DIF) analysis. A total of 83 articles from 27 journals were included in a systematic…

Descriptors: Second Language Learning, Language Tests, Test Items, Item Analysis

Comparing Two Formats of Data-Driven Rating Scales for Classroom Assessment of Pragmatic Performance with Roleplays

Peer reviewed

Direct link

Yunwen Su; Sun-Young Shin – Language Testing, 2024

Rating scales that language testers design should be tailored to the specific test purpose and score use as well as reflect the target construct. Researchers have long argued for the value of data-driven scales for classroom performance assessment, because they are specific to pedagogical tasks and objectives, have rich descriptors to offer useful…

Descriptors: Rating Scales, Language Tests, Test Construction, Performance Based Assessment

Practical Considerations When Building Concordances between English Tests

Peer reviewed

Direct link

Ramsey L. Cardwell; Steven W. Nydick; J.R. Lockwood; Alina A. von Davier – Language Testing, 2024

Applicants must often demonstrate adequate English proficiency when applying to postsecondary institutions by taking an English language proficiency test, such as the TOEFL iBT, IELTS Academic, or Duolingo English Test (DET). Concordance tables aim to provide equivalent scores across multiple assessments, helping admissions officers to make fair…

Descriptors: Second Language Learning, English (Second Language), Language Tests, Language Proficiency

Argument-Based Validation of Academic Collocation Tests

Peer reviewed

Direct link

Thi My Hang Nguyen; Peter Gu; Averil Coxhead – Language Testing, 2024

Despite extensive research on assessing collocational knowledge, valid measures of academic collocations remain elusive. With the present study, we employ an argument-based approach to validate two Academic Collocation Tests (ACTs) that assess the ability to recognize and produce academic collocations (i.e., two-word units such as "key…

Descriptors: Foreign Countries, College Students, College Entrance Examinations, English (Second Language)

Developing Internet-Based "Tests of Aptitude for Language Learning (TALL)": An Open Research Endeavour

Peer reviewed

Direct link

Junlan Pan; Emma Marsden – Language Testing, 2024

"Tests of Aptitude for Language Learning" (TALL) is an openly accessible internet-based battery to measure the multifaceted construct of foreign language aptitude, using language domain-specific instruments and L1-sensitive instructions and stimuli. This brief report introduces the components of this theory-informed battery and…

Descriptors: Language Tests, Aptitude Tests, Second Language Learning, Test Construction

Establishing Meaning Recall and Meaning Recognition Vocabulary Knowledge as Distinct Psychometric Constructs in Relation to Reading Proficiency

Peer reviewed

Direct link

Jeffrey Stewart; Henrik Gyllstad; Christopher Nicklin; Stuart McLean – Language Testing, 2024

The purpose of this paper is to (a) establish whether meaning recall and meaning recognition item formats test psychometrically distinct constructs of vocabulary knowledge which measure separate skills, and, if so, (b) determine whether each construct possesses unique properties predictive of L2 reading proficiency. Factor analyses and…

Descriptors: Vocabulary Development, Psychometrics, Language Tests, Recall (Psychology)

Critical Discursive Approaches to Evaluating Policy-Driven Testing: Social Impact as a Target for Validation

Peer reviewed

Direct link

Dongil Shin – Language Testing, 2024

This paper addresses the intersection of testing and policy, situating test-driven impact and validation within the context of policy-led educational reform in Korea. I will briefly review the existing validation models. Then, arguing for an expansion of the conventional conceptualization of consequential validity research, I use Fairclough's…

Descriptors: Educational Policy, Discourse Analysis, Test Validity, Educational Change

Towards a New Sophistication in Vocabulary Assessment

Peer reviewed

Direct link

Read, John – Language Testing, 2023

Published work on vocabulary assessment has grown substantially in the last 10 years, but it is still somewhat outside the mainstream of the field. There has been a recent call for those developing vocabulary tests to apply professional standards to their work, especially in validating their instruments for specified purposes before releasing them…

Descriptors: Language Tests, Vocabulary Development, Second Language Learning, Test Format

Korean Syntactic Complexity Analyzer (KOSCA): An NLP Application for the Analysis of Syntactic Complexity in Second Language Production

Peer reviewed

Direct link

Haerim Hwang; Hyunwoo Kim – Language Testing, 2024

Given the lack of computational tools available for assessing second language (L2) production in Korean, this study introduces a novel automated tool called the Korean Syntactic Complexity Analyzer (KOSCA) for measuring syntactic complexity in L2 Korean production. As an open-source graphic user interface (GUI) developed in Python, KOSCA provides…

Descriptors: Korean, Natural Language Processing, Syntax, Computer Graphics

Test Design and Validity Evidence of Interactive Speaking Assessment in the Era of Emerging Technologies

Peer reviewed

Direct link

Jung Youn, Soo – Language Testing, 2023

As access to smartphones and emerging technologies has become ubiquitous in our daily lives and in language learning, technology-mediated social interaction has become common in teaching and assessing L2 speaking. The changing ecology of L2 spoken interaction provides language educators and testers with opportunities for renewed test design and…

Descriptors: Test Construction, Test Validity, Second Language Learning, Telecommunications

Examining the Factor Structure and Its Replicability across Multiple Listening Test Forms: Validity Evidence for the Michigan English Test

Peer reviewed

Direct link

Liu, Tingting; Aryadoust, Vahid; Foo, Stacy – Language Testing, 2022

This study evaluated the validity of the Michigan English Test (MET) Listening Section by investigating its underlying factor structure and the replicability of its factor structure across multiple test forms. Data from 3255 test takers across four forms of the MET Listening Section were used. To investigate the factor structure, each form was…

Descriptors: Factor Structure, Language Tests, Second Language Learning, Second Language Instruction

Strategy Use in a Spoken Dialog System-Delivered Paired Discussion Task: A Stimulated Recall Study

Peer reviewed

Direct link

Gokturk, Nazlinur; Chukharev-Hudilainen, Evgeny – Language Testing, 2023

With recent technological advances, researchers have begun to explore the potential use of spoken dialog systems (SDSs) for L2 oral communication assessment. While several studies support the feasibility of building these systems for various types of oral tasks, research on the construct validity of SDS-delivered tasks is still limited. Thus, this…

Descriptors: Oral Language, Dialogs (Language), Second Language Learning, Second Language Instruction

Hanyu Shuiping Kaoshi (HSK): A Multi-Level, Multi-Purpose Proficiency Test

Peer reviewed

Direct link

Peng, Yue; Yan, Wei; Cheng, Liying – Language Testing, 2021

This test review focuses on the current version (2009) of [Chinese characters omitted] (Hanyu Shuiping Kaoshi), literally translated as the Chinese Language Proficiency Test and abbreviated as HSK. Tailored to non-native speakers of the Chinese language, this test consists of six proficiency levels (Levels 1 and 2 as beginners, Levels 3 and 4 as…

Descriptors: Language Proficiency, Language Tests, Chinese, Decision Making

Managing Proposal Sequences in Role-Play Assessment: Validity Evidence of Interactional Competence across Levels

Peer reviewed

Direct link

Youn, Soo Jung – Language Testing, 2020

This qualitative study reports an investigation of the nature of interactional competence at various levels of achievement in the context of role-play speaking assessment. The focal point of this study is on how examinees jointly accomplish the interactional work involved in proposal sequences in role-play interaction. Based on a conversation…

Descriptors: Role Playing, Interaction, Test Validity, Communicative Competence (Languages)

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

Aryadoust, Vahid	3
Shohamy, Elana	3
Yan, Xun	3
August, Diane	2
Bachman, Lyle F.	2
Carlo, Maria	2
Davies, Alan	2
Foo, Stacy	2
Ginther, April	2
Klein-Braley, Christine	2
Liu, Jianda	2
Louguit, Mohammed	2
Lynch, Brian	2
Malabonga, Valerie	2
Manna, Venessa F.	2
Roever, Carsten	2
Stansfield, Charles W.	2
Xi, Xiaoming	2
Yoo, Hanwook	2
Zeidner, Moshe	2
Adams, Raymond J.	1
Ai, Haiyang	1
Alanen, Riikka	1
Alderson, J. Charles	1
More ▼