ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	19
Since 2006 (last 20 years)	40

Descriptor

Second Language Learning	43
Statistical Analysis	43
Language Tests	32
English (Second Language)	29
Foreign Countries	17
Correlation	15
Language Proficiency	15
Comparative Analysis	14
Scores	13
Oral Language	11
Second Language Instruction	10
Scoring	9
Test Items	8
Testing	8
Evaluators	7
Native Speakers	7
College Students	6
Item Analysis	6
Reading Comprehension	6
Task Analysis	6
Test Validity	6
Writing Evaluation	6
Computational Linguistics	5
Difficulty Level	5
Elementary School Students	5
More ▼

Source

Language Testing

Publication Type

Journal Articles	43
Reports - Research	30
Reports - Evaluative	11
Tests/Questionnaires	7
Information Analyses	2
Opinion Papers	1
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Higher Education	14
Postsecondary Education	7
Elementary Education	4
Secondary Education	3
Elementary Secondary Education	1
Grade 6	1
High Schools	1

Audience

Location

Japan	4
Australia	2
China	2
United Kingdom	2
Canada	1
Chile	1
Colombia	1
Ecuador	1
Iran	1
Israel	1
Malaysia	1
Netherlands	1
Ohio	1
Poland	1
Russia	1
South Korea	1
Sweden	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	6
International English…	1
Michigan Test of English…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 43 results Save | Export

A Comparison of Reliability and Precision of Subscore Reporting Methods for a State English Language Proficiency Assessment

Peer reviewed

Direct link

Longabach, Tanya; Peyton, Vicki – Language Testing, 2018

K-12 English language proficiency tests that assess multiple content domains (e.g., listening, speaking, reading, writing) often have subsections based on these content domains; scores assigned to these subsections are commonly known as subscores. Testing programs face increasing customer demands for the reporting of subscores in addition to the…

Descriptors: Comparative Analysis, Test Reliability, Second Language Learning, Language Proficiency

Measuring L2 Speakers' Interactional Ability Using Interactive Speech Tasks

Peer reviewed

Direct link

van Batenburg, Eline S. L.; Oostdam, Ron J.; van Gelderen, Amos J. S.; de Jong, Nivja H. – Language Testing, 2018

This article explores ways to assess interactional performance, and reports on the use of a test format that standardizes the interlocutor's linguistic and interactional contributions to the exchange. It describes the construction and administration of six scripted speech tasks (instruction, advice, and sales tasks) with pre-vocational learners (n…

Descriptors: Second Language Learning, Speech Tests, Interaction, Test Reliability

The Selection of Cognitive Diagnostic Models for a Reading Comprehension Test

Peer reviewed

Direct link

Li, Hongli; Hunter, C. Vincent; Lei, Pui-Wa – Language Testing, 2016

Cognitive diagnostic models (CDMs) have great promise for providing diagnostic information to aid learning and instruction, and a large number of CDMs have been proposed. However, the assumptions and performances of different CDMs and their applications in regard to reading comprehension tests are not fully understood. In the present study, we…

Descriptors: Reading Comprehension, Reading Tests, Models, Comparative Analysis

Investigating the Construct Measured by Banked Gap-Fill Items: Evidence from Eye-Tracking

Peer reviewed

Direct link

McCray, Gareth; Brunfaut, Tineke – Language Testing, 2018

This study investigates test-takers' processing while completing banked gap-fill tasks, designed to test reading proficiency, in order to test theoretically based expectations about the variation in cognitive processes of test-takers across levels of performance. Twenty-eight test-takers' eye traces on 24 banked gap-fill items (on six tasks) were…

Descriptors: Language Tests, Test Items, Item Analysis, Eye Movements

The Influence of Training and Experience on Rater Performance in Scoring Spoken Language

Peer reviewed

Direct link

Davis, Larry – Language Testing, 2016

Two factors were investigated that are thought to contribute to consistency in rater scoring judgments: rater training and experience in scoring. Also considered were the relative effects of scoring rubrics and exemplars on rater performance. Experienced teachers of English (N = 20) scored recorded responses from the TOEFL iBT speaking test prior…

Descriptors: Evaluators, Oral Language, Scores, Language Tests

Functional Adequacy in L2 Writing: Towards a New Rating Scale

Peer reviewed

Direct link

Kuiken, Folkert; Vedder, Ineke – Language Testing, 2017

The importance of functional adequacy as an essential component of L2 proficiency has been observed by several authors (Pallotti, 2009; De Jong, Steinel, Florijn, Schoonen, & Hulstijn, 2012a, b). The rationale underlying the present study is that the assessment of writing proficiency in L2 is not fully possible without taking into account the…

Descriptors: Second Language Learning, Rating Scales, Computational Linguistics, Persuasive Discourse

Young Learners' Response Processes When Taking Computerized Tasks for Speaking Assessment

Peer reviewed

Direct link

Lee, Shinhye; Winke, Paula – Language Testing, 2018

We investigated how young language learners process their responses on and perceive a computer-mediated, timed speaking test. Twenty 8-, 9-, and 10-year-old non-native English-speaking children (NNSs) and eight same-aged, native English-speaking children (NSs) completed seven computerized sample TOEFL® Primary™ speaking test tasks. We investigated…

Descriptors: Elementary School Students, Second Language Learning, Responses, Computer Assisted Testing

Task and Rater Effects in L2 Speaking and Writing: A Synthesis of Generalizability Studies

Peer reviewed

Direct link

In'nami, Yo; Koizumi, Rie – Language Testing, 2016

We addressed Deville and Chalhoub-Deville's (2006), Schoonen's (2012), and Xi and Mollaun's (2006) call for research into the contextual features that are considered related to person-by-task interactions in the framework of generalizability theory in two ways. First, we quantitatively synthesized the generalizability studies to determine the…

Descriptors: Evaluators, Second Language Learning, Writing Skills, Oral Language

The Relationship between Three Measures of L2 Vocabulary Knowledge and L2 Listening and Reading

Peer reviewed

Direct link

Cheng, Junyu; Matthews, Joshua – Language Testing, 2018

This study explores the constructs that underpin three different measures of vocabulary knowledge and investigates the degree to which these three measures correlate with, and are able to predict, measures of second language (L2) listening and reading. Word frequency structured vocabulary tests tapping "receptive/orthographic (RecOrth)…

Descriptors: Listening Comprehension, Reading Comprehension, Reading Tests, Correlation

Elicited Imitation as a Measure of Second Language Proficiency: A Narrative Review and Meta-Analysis

Peer reviewed

Direct link

Yan, Xun; Maeda, Yukiko; Lv, Jing; Ginther, April – Language Testing, 2016

Elicited imitation (EI) has been widely used to examine second language (L2) proficiency and development and was an especially popular method in the 1970s and early 1980s. However, as the field embraced more communicative approaches to both instruction and assessment, the use of EI diminished, and the construct-related validity of EI scores as a…

Descriptors: Second Language Learning, Language Proficiency, Meta Analysis, Effect Size

The Use of Eye Tracking in Research on Video-Based Second Language (L2) Listening Assessment: A Comparison of Context Videos and Content Videos

Peer reviewed

Direct link

Suvorov, Ruslan – Language Testing, 2015

Investigating how visuals affect test takers' performance on video-based L2 listening tests has been the focus of many recent studies. While most existing research has been based on test scores and self-reported verbal data, few studies have examined test takers' viewing behavior (Ockey, 2007; Wagner, 2007, 2010a). To address this gap, in the…

Descriptors: Eye Movements, Second Language Learning, Listening Comprehension, Video Technology

Assessing Syntactic Sophistication in L2 Writing: A Usage-Based Approach

Peer reviewed

Direct link

Kyle, Kristopher; Crossley, Scott – Language Testing, 2017

Over the past 45 years, the construct of syntactic sophistication has been assessed in L2 writing using what Bulté and Housen (2012) refer to as absolute complexity (Lu, 2011; Ortega, 2003; Wolfe-Quintero, Inagaki, & Kim, 1998). However, it has been argued that making inferences about learners based on absolute complexity indices (e.g., mean…

Descriptors: Syntax, Verbs, Second Language Learning, Word Frequency

Using Corpus Linguistics to Examine the Extrapolation Inference in the Validity Argument for a High-Stakes Speaking Assessment

Peer reviewed

Direct link

LaFlair, Geoffrey T.; Staples, Shelley – Language Testing, 2017

Investigations of the validity of a number of high-stakes language assessments are conducted using an argument-based approach, which requires evidence for inferences that are critical to score interpretation (Chapelle, Enright, & Jamieson, 2008b; Kane, 2013). The current study investigates the extrapolation inference for a high-stakes test of…

Descriptors: Computational Linguistics, Language Tests, Test Validity, Inferences

Determining Cloze Item Difficulty from Item and Passage Characteristics across Different Learner Backgrounds

Peer reviewed

Direct link

Trace, Jonathan; Brown, James Dean; Janssen, Gerriet; Kozhevnikova, Liudmila – Language Testing, 2017

Cloze tests have been the subject of numerous studies regarding their function and use in both first language and second language contexts (e.g., Jonz & Oller, 1994; Watanabe & Koyama, 2008). From a validity standpoint, one area of investigation has been the extent to which cloze tests measure reading ability beyond the sentence level.…

Descriptors: Cloze Procedure, Language Tests, Test Items, Item Analysis

Construct Validity in TOEFL iBT Speaking Tasks: Insights from Natural Language Processing

Peer reviewed

Direct link

Kyle, Kristopher; Crossley, Scott A.; McNamara, Danielle S. – Language Testing, 2016

This study explores the construct validity of speaking tasks included in the TOEFL iBT (e.g., integrated and independent speaking tasks). Specifically, advanced natural language processing (NLP) tools, MANOVA difference statistics, and discriminant function analyses (DFA) are used to assess the degree to which and in what ways responses to these…

Descriptors: Construct Validity, Natural Language Processing, Speech Skills, Speech Acts

Previous Page | Next Page »

Pages: 1 | 2 | 3

Crossley, Scott A.	2
Kyle, Kristopher	2
McNamara, Danielle S.	2
Allalouf, Avi	1
Alvarez, Marta E.	1
Babaii, Esmat	1
Bachman, Lyle F.	1
Bae, Jungok	1
Bax, Stephen	1
Brown, James Dean	1
Brunfaut, Tineke	1
Butler, Yuko Goto	1
Campfield, Dorota E.	1
Cheng, Junyu	1
Crossley, Scott	1
Davies, Alan	1
Davis, Larry	1
Feng, Ying	1
Filipi, Anna	1
Garras, John	1
Ginther, April	1
Giunta, Anthony	1
Granfeldt, Jonas	1
Hopp, Holger	1
More ▼