ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	16
Since 2007 (last 20 years)	52

Source

Language Testing

Publication Type

Journal Articles	52
Reports - Research	39
Reports - Evaluative	11
Tests/Questionnaires	8
Information Analyses	3
Reports - Descriptive	2
Speeches/Meeting Papers	1

Education Level

Higher Education	18
Postsecondary Education	11
Secondary Education	6
Elementary Education	5
Elementary Secondary Education	1
Grade 5	1
Grade 6	1
Intermediate Grades	1

Audience

Location

Japan	6
Australia	3
China	2
Sweden	2
United Kingdom	2
Canada	1
Chile	1
Colombia	1
Denmark	1
Ecuador	1
Georgia	1
Germany	1
Iowa	1
Iran	1
Malaysia	1
Netherlands	1
Norway	1
Ohio	1
Poland	1
Russia	1
South Korea	1
Texas	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	5
Early Childhood Longitudinal…	1
International English…	1
Michigan Test of English…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 52 results Save | Export

A Nonparametric Procedure for Exploring Differences in Rating Quality across Test-Taker Subgroups in Rater-Mediated Writing Assessments

Peer reviewed

Direct link

Wind, Stefanie A. – Language Testing, 2019

Differences in rater judgments that are systematically related to construct-irrelevant characteristics threaten the fairness of rater-mediated writing assessments. Accordingly, it is essential that researchers and practitioners examine the degree to which the psychometric quality of rater judgments is comparable across test-taker subgroups.…

Descriptors: Nonparametric Statistics, Interrater Reliability, Differences, Writing Tests

A Systematic Review of Methods for Evaluating Rating Quality in Language Assessment

Peer reviewed

Direct link

Wind, Stefanie A.; Peterson, Meghan E. – Language Testing, 2018

The use of assessments that require rater judgment (i.e., rater-mediated assessments) has become increasingly popular in high-stakes language assessments worldwide. Using a systematic literature review, the purpose of this study is to identify and explore the dominant methods for evaluating rating quality within the context of research on…

Descriptors: Language Tests, Evaluators, Evaluation Methods, Interrater Reliability

A Comparison of Reliability and Precision of Subscore Reporting Methods for a State English Language Proficiency Assessment

Peer reviewed

Direct link

Longabach, Tanya; Peyton, Vicki – Language Testing, 2018

K-12 English language proficiency tests that assess multiple content domains (e.g., listening, speaking, reading, writing) often have subsections based on these content domains; scores assigned to these subsections are commonly known as subscores. Testing programs face increasing customer demands for the reporting of subscores in addition to the…

Descriptors: Comparative Analysis, Test Reliability, Second Language Learning, Language Proficiency

National Reading Tests in Denmark, Norway, and Sweden: A Comparison of Construct Definitions, Cognitive Targets, and Response Formats

Peer reviewed

Direct link

Tengberg, Michael – Language Testing, 2017

Reading comprehension tests are often assumed to measure the same, or at least similar, constructs. Yet, reading is not a single but a multidimensional form of processing, which means that variations in terms of reading material and item design may emphasize one aspect of the construct at the cost of another. The educational systems in Denmark,…

Descriptors: Foreign Countries, National Competency Tests, Reading Tests, Comparative Analysis

Measuring L2 Speakers' Interactional Ability Using Interactive Speech Tasks

Peer reviewed

Direct link

van Batenburg, Eline S. L.; Oostdam, Ron J.; van Gelderen, Amos J. S.; de Jong, Nivja H. – Language Testing, 2018

This article explores ways to assess interactional performance, and reports on the use of a test format that standardizes the interlocutor's linguistic and interactional contributions to the exchange. It describes the construction and administration of six scripted speech tasks (instruction, advice, and sales tasks) with pre-vocational learners (n…

Descriptors: Second Language Learning, Speech Tests, Interaction, Test Reliability

Setting Cut Scores on an EFL Placement Test Using the Prototype Group Method: A Receiver Operating Characteristic (ROC) Analysis

Peer reviewed

Direct link

Eckes, Thomas – Language Testing, 2017

This paper presents an approach to standard setting that combines the prototype group method (PGM; Eckes, 2012) with a receiver operating characteristic (ROC) analysis. The combined PGM-ROC approach is applied to setting cut scores on a placement test of English as a foreign language (EFL). To implement the PGM, experts first named learners whom…

Descriptors: English (Second Language), Language Tests, Cutting Scores, Standard Setting (Scoring)

The Selection of Cognitive Diagnostic Models for a Reading Comprehension Test

Peer reviewed

Direct link

Li, Hongli; Hunter, C. Vincent; Lei, Pui-Wa – Language Testing, 2016

Cognitive diagnostic models (CDMs) have great promise for providing diagnostic information to aid learning and instruction, and a large number of CDMs have been proposed. However, the assumptions and performances of different CDMs and their applications in regard to reading comprehension tests are not fully understood. In the present study, we…

Descriptors: Reading Comprehension, Reading Tests, Models, Comparative Analysis

Investigating the Construct Measured by Banked Gap-Fill Items: Evidence from Eye-Tracking

Peer reviewed

Direct link

McCray, Gareth; Brunfaut, Tineke – Language Testing, 2018

This study investigates test-takers' processing while completing banked gap-fill tasks, designed to test reading proficiency, in order to test theoretically based expectations about the variation in cognitive processes of test-takers across levels of performance. Twenty-eight test-takers' eye traces on 24 banked gap-fill items (on six tasks) were…

Descriptors: Language Tests, Test Items, Item Analysis, Eye Movements

The Influence of Training and Experience on Rater Performance in Scoring Spoken Language

Peer reviewed

Direct link

Davis, Larry – Language Testing, 2016

Two factors were investigated that are thought to contribute to consistency in rater scoring judgments: rater training and experience in scoring. Also considered were the relative effects of scoring rubrics and exemplars on rater performance. Experienced teachers of English (N = 20) scored recorded responses from the TOEFL iBT speaking test prior…

Descriptors: Evaluators, Oral Language, Scores, Language Tests

Young Learners' Response Processes When Taking Computerized Tasks for Speaking Assessment

Peer reviewed

Direct link

Lee, Shinhye; Winke, Paula – Language Testing, 2018

We investigated how young language learners process their responses on and perceive a computer-mediated, timed speaking test. Twenty 8-, 9-, and 10-year-old non-native English-speaking children (NNSs) and eight same-aged, native English-speaking children (NSs) completed seven computerized sample TOEFL® Primary™ speaking test tasks. We investigated…

Descriptors: Elementary School Students, Second Language Learning, Responses, Computer Assisted Testing

Functional Adequacy in L2 Writing: Towards a New Rating Scale

Peer reviewed

Direct link

Kuiken, Folkert; Vedder, Ineke – Language Testing, 2017

The importance of functional adequacy as an essential component of L2 proficiency has been observed by several authors (Pallotti, 2009; De Jong, Steinel, Florijn, Schoonen, & Hulstijn, 2012a, b). The rationale underlying the present study is that the assessment of writing proficiency in L2 is not fully possible without taking into account the…

Descriptors: Second Language Learning, Rating Scales, Computational Linguistics, Persuasive Discourse

Adaptation of a Vocabulary Test from British Sign Language to American Sign Language

Peer reviewed

Direct link

Mann, Wolfgang; Roy, Penny; Morgan, Gary – Language Testing, 2016

This study describes the adaptation process of a vocabulary knowledge test for British Sign Language (BSL) into American Sign Language (ASL) and presents results from the first round of pilot testing with 20 deaf native ASL signers. The web-based test assesses the strength of deaf children's vocabulary knowledge by means of different mappings of…

Descriptors: Deafness, Language Skills, Vocabulary Development, American Sign Language

Task and Rater Effects in L2 Speaking and Writing: A Synthesis of Generalizability Studies

Peer reviewed

Direct link

In'nami, Yo; Koizumi, Rie – Language Testing, 2016

We addressed Deville and Chalhoub-Deville's (2006), Schoonen's (2012), and Xi and Mollaun's (2006) call for research into the contextual features that are considered related to person-by-task interactions in the framework of generalizability theory in two ways. First, we quantitatively synthesized the generalizability studies to determine the…

Descriptors: Evaluators, Second Language Learning, Writing Skills, Oral Language

The Relationship between Three Measures of L2 Vocabulary Knowledge and L2 Listening and Reading

Peer reviewed

Direct link

Cheng, Junyu; Matthews, Joshua – Language Testing, 2018

This study explores the constructs that underpin three different measures of vocabulary knowledge and investigates the degree to which these three measures correlate with, and are able to predict, measures of second language (L2) listening and reading. Word frequency structured vocabulary tests tapping "receptive/orthographic (RecOrth)…

Descriptors: Listening Comprehension, Reading Comprehension, Reading Tests, Correlation

Evaluating Different Standard-Setting Methods in an ESL Placement Testing Context

Peer reviewed

Direct link

Shin, Sun-Young; Lidster, Ryan – Language Testing, 2017

In language programs, it is crucial to place incoming students into appropriate levels to ensure that course curriculum and materials are well targeted to their learning needs. Deciding how and where to set cutscores on placement tests is thus of central importance to programs, but previous studies in educational measurement disagree as to which…

Descriptors: Language Tests, English (Second Language), Standard Setting (Scoring), Student Placement

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Crossley, Scott A.	2
Kyle, Kristopher	2
McNamara, Danielle S.	2
Wind, Stefanie A.	2
Alvarez, Marta E.	1
Babaii, Esmat	1
Bachman, Lyle F.	1
Bae, Jungok	1
Batty, Aaron Olaf	1
Bax, Stephen	1
Brown, James Dean	1
Brunfaut, Tineke	1
Butler, Yuko Goto	1
Campfield, Dorota E.	1
Chalhoub-Deville, Micheline	1
Chapelle, Carol A.	1
Chen, Fang	1
Cheng, Junyu	1
Cotos, Elena	1
Crossley, Scott	1
Davies, Alan	1
Davis, Larry	1
Eckes, Thomas	1
Feng, Ying	1
Filipi, Anna	1
More ▼

Statistical Analysis	52
Second Language Learning	40
Language Tests	37
English (Second Language)	31
Foreign Countries	21
Comparative Analysis	18
Language Proficiency	18
Correlation	16
Scores	15
Oral Language	12
Second Language Instruction	10
Evaluators	9
Scoring	9
Test Items	9
College Students	8
Native Speakers	8
Interrater Reliability	7
Item Response Theory	7
Secondary School Students	7
Test Validity	7
Writing Evaluation	7
Computer Assisted Testing	6
Difficulty Level	6
Elementary School Students	6
Item Analysis	6
More ▼