ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	4
Since 2017 (last 10 years)	11
Since 2007 (last 20 years)	21

Descriptor

Graduate Students	24
English (Second Language)	19
Language Tests	19
Second Language Learning	17
Language Proficiency	13
Foreign Countries	10
Scores	8
Undergraduate Students	8
Correlation	7
Foreign Students	6
Second Language Instruction	6
Computer Assisted Testing	5
Evaluators	5
Teaching Assistants	5
Test Items	5
Test Validity	5
College Entrance Examinations	4
Construct Validity	4
Language Teachers	4
Native Language	4
Native Speakers	4
Oral Language	4
Pronunciation	4
Statistical Analysis	4
Writing Evaluation	4
More ▼

Source

Language Testing

Publication Type

Journal Articles	24
Reports - Research	20
Reports - Evaluative	3

Education Level

Higher Education	22
Postsecondary Education	12
Secondary Education	2

Audience

Researchers	1
Teachers	1

Location

United Kingdom	3
California (Los Angeles)	1
Canada	1
China	1
China (Guangzhou)	1
Europe	1
Iowa	1
Iran (Tehran)	1
Japan	1
Michigan	1
Ohio	1
Pennsylvania (Pittsburgh)	1
Turkey (Ankara)	1
United Kingdom (England)	1
United Kingdom (London)	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	6
International English…	2
Graduate Record Examinations	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

Examining the Predictive Validity of the Duolingo English Test: Evidence from a Major UK University

Peer reviewed

Direct link

Isaacs, Talia; Hu, Ruolin; Trenkic, Danijela; Varga, Julia – Language Testing, 2023

The COVID-19 pandemic has changed the university admissions and proficiency testing landscape. One change has been the meteoric rise in use of the fully automated Duolingo English Test (DET) for university entrance purposes, offering test-takers a cheaper, shorter, accessible alternative. This rapid response study is the first to investigate the…

Descriptors: Predictive Validity, Educational Technology, Handheld Devices, Language Tests

What the Analytic versus Holistic Scoring of International Teaching Assistants Can Reveal: Lexical Grammar Matters

Peer reviewed

Direct link

Ma, Wenyue – Language Testing, 2022

Second-language (L2) testing researchers have explored the relationship between speakers' overall speaking ability, reflected by holistic scores, and the speakers' performance on speaking subcomponents, reflected by analytic scores (e.g., McNamara, 1990; Sato, 2011). These research studies have advanced applied linguists' understanding of how…

Descriptors: Language Tests, Teaching Assistants, Second Language Learning, Second Language Instruction

Examining the Effects of Different English Speech Varieties on an L2 Academic Listening Comprehension Test at the Item Level

Peer reviewed

Direct link

Shin, Sun-Young; Lee, Senyung; Lidster, Ryan – Language Testing, 2021

In this study we investigated the potential for a shared-first-language (shared-L1) effect on second language (L2) listening test scores using differential item functioning (DIF) analyses. We did this in order to understand how accented speech may influence performance at the item level, while controlling for key variables including listening…

Descriptors: Listening Comprehension Tests, Language Tests, Native Language, Scores

"Am I Qualified to Be a Language Tester?": Understanding the Development of Language Assessment Literacy across Three Stakeholder Groups

Peer reviewed

Direct link

Yan, Xun; Fan, Jason – Language Testing, 2021

Recent investigations into language assessment literacy (LAL) suggest that stakeholder groups might differ in interests, needs, and expectations in assessment practice, resulting in different LAL profiles. This qualitative study furthers this line of research by examining the effect of contextual and experiential factors on the LAL profiles and…

Descriptors: Evaluators, Language Tests, Language Teachers, Second Language Learning

IRT-Based Classification Analysis of an English Language Reading Proficiency Subtest

Peer reviewed

Direct link

Kaya, Elif; O'Grady, Stefan; Kalender, Ilker – Language Testing, 2022

Language proficiency testing serves an important function of classifying examinees into different categories of ability. However, misclassification is to some extent inevitable and may have important consequences for stakeholders. Recent research suggests that classification efficacy may be enhanced substantially using computerized adaptive…

Descriptors: Item Response Theory, Test Items, Language Tests, Classification

Empirical Profiles of Academic Oral English Proficiency from an International Teaching Assistant Screening Test

Peer reviewed

Direct link

Choi, Ikkyu – Language Testing, 2017

Language proficiency constitutes a crucial barrier for prospective international teaching assistants (ITAs). Many US universities administer screening tests to ensure that ITAs possess the required academic oral English proficiency for their TA duties. Such ITA screening tests often elicit a sample of spoken English, which is evaluated in terms of…

Descriptors: Oral English, Academic Discourse, Language Proficiency, Screening Tests

The Relationship among Accent Familiarity, Shared L1, and Comprehensibility: A Path Analysis Perspective

Peer reviewed

Direct link

Miao, Yongzhi – Language Testing, 2023

Scholars have argued for the inclusion of different spoken varieties of English in high-stakes listening tests to better represent the global use of English. However, doing so may introduce additional construct-irrelevant variance due to accent familiarity and the shared first language (L1) advantage, which could threaten test fairness. However,…

Descriptors: Pronunciation, Metalinguistics, Native Language, Intelligibility

Factor Analysis for Fairness: Examining the Impact of Task Type and Examinee L1 Background on Scores of an ITA Speaking Test

Peer reviewed

Direct link

Yan, Xun; Cheng, Lixia; Ginther, April – Language Testing, 2019

This study investigated the construct validity of a local speaking test for international teaching assistants (ITAs) from a fairness perspective, by employing a multi-group confirmatory factor analysis (CFA) to examine the impact of task type and examinee first language (L1) background on the internal structure of the test. The test consists of…

Descriptors: Scores, Language Tests, Teaching Assistants, Culture Fair Tests

Investigating the Construct Measured by Banked Gap-Fill Items: Evidence from Eye-Tracking

Peer reviewed

Direct link

McCray, Gareth; Brunfaut, Tineke – Language Testing, 2018

This study investigates test-takers' processing while completing banked gap-fill tasks, designed to test reading proficiency, in order to test theoretically based expectations about the variation in cognitive processes of test-takers across levels of performance. Twenty-eight test-takers' eye traces on 24 banked gap-fill items (on six tasks) were…

Descriptors: Language Tests, Test Items, Item Analysis, Eye Movements

Advancing the Validity Argument for Standardized Writing Tests Using Quantitative Rhetorical Analysis

Peer reviewed

Direct link

Beigman Klebanov, Beata; Ramineni, Chaitanya; Kaufer, David; Yeoh, Paul; Ishizaki, Suguru – Language Testing, 2019

Essay writing is a common type of constructed-response task used frequently in standardized writing assessments. However, the impromptu timed nature of the essay writing tests has drawn increasing criticism for the lack of authenticity for real-world writing in classroom and workplace settings. The goal of this paper is to contribute evidence to a…

Descriptors: Test Validity, Writing Tests, Writing Skills, Persuasive Discourse

Strategies for Testing Statistical and Practical Significance in Detecting DIF with Logistic Regression Models

Peer reviewed

Direct link

Fidalgo, Angel M.; Alavi, Seyed Mohammad; Amirian, Seyed Mohammad Reza – Language Testing, 2014

This study examines three controversial aspects in differential item functioning (DIF) detection by logistic regression (LR) models: first, the relative effectiveness of different analytical strategies for detecting DIF; second, the suitability of the Wald statistic for determining the statistical significance of the parameters of interest; and…

Descriptors: Test Bias, Regression (Statistics), Statistical Significance, Language Tests

Validity Arguments for Diagnostic Assessment Using Automated Writing Evaluation

Peer reviewed

Direct link

Chapelle, Carol A.; Cotos, Elena; Lee, Jooyoung – Language Testing, 2015

Two examples demonstrate an argument-based approach to validation of diagnostic assessment using automated writing evaluation (AWE). "Criterion"®, was developed by Educational Testing Service to analyze students' papers grammatically, providing sentence-level error feedback. An interpretive argument was developed for its use as part of…

Descriptors: Diagnostic Tests, Writing Evaluation, Automation, Test Validity

Grounding Lexical Diversity in Human Judgments

Peer reviewed

Direct link

Jarvis, Scott – Language Testing, 2017

The present study discusses the relevance of measures of lexical diversity (LD) to the assessment of learner corpora. It also argues that existing measures of LD, many of which have become specialized for use with language corpora, are fundamentally measures of lexical repetition, are based on an etic perspective of language, and lack construct…

Descriptors: Computational Linguistics, English (Second Language), Second Language Learning, Native Speakers

Examining the Impact of L2 Proficiency and Keyboarding Skills on Scores on TOEFL-iBT Writing Tasks

Peer reviewed

Direct link

Barkaoui, Khaled – Language Testing, 2014

A major concern with computer-based (CB) tests of second-language (L2) writing is that performance on such tests may be influenced by test-taker keyboarding skills. Poor keyboarding skills may force test-takers to focus their attention and cognitive resources on motor activities (i.e., keyboarding) and, consequently, other processes and aspects of…

Descriptors: Language Tests, Computer Assisted Testing, English (Second Language), Second Language Learning

Scoring Yes-No Vocabulary Tests: Reaction Time vs. Nonword Approaches

Peer reviewed

Direct link

Pellicer-Sanchez, Ana; Schmitt, Norbert – Language Testing, 2012

Despite a number of research studies investigating the Yes-No vocabulary test format, one main question remains unanswered: What is the best scoring procedure to adjust for testee overestimation of vocabulary knowledge? Different scoring methodologies have been proposed based on the inclusion and selection of nonwords in the test. However, there…

Descriptors: Language Tests, Scoring, Reaction Time, Vocabulary Development

Previous Page | Next Page »

Pages: 1 | 2

Barkaoui, Khaled	2
Yan, Xun	2
Alavi, Seyed Mohammad	1
Amirian, Seyed Mohammad Reza	1
Beglar, David	1
Beigman Klebanov, Beata	1
Bridgeman, Brent	1
Briggs, Sarah L.	1
Brown, James Dean	1
Brunfaut, Tineke	1
Chapelle, Carol A.	1
Cheng, Lixia	1
Cho, Yeonsuk	1
Choi, Ikkyu	1
Cotos, Elena	1
Fairclough, Marta	1
Fan, Jason	1
Fidalgo, Angel M.	1
Ginther, April	1
Hale, Gordon A.	1
Hu, Ruolin	1
Isaacs, Talia	1
Ishizaki, Suguru	1
Jarvis, Scott	1
Kalender, Ilker	1
More ▼