Publication Date
In 2025 | 2 |
Since 2024 | 5 |
Since 2021 (last 5 years) | 26 |
Since 2016 (last 10 years) | 43 |
Since 2006 (last 20 years) | 61 |
Descriptor
Second Language Learning | 70 |
Test Format | 70 |
Test Items | 70 |
English (Second Language) | 59 |
Language Tests | 55 |
Foreign Countries | 42 |
Second Language Instruction | 29 |
Scores | 25 |
Item Analysis | 24 |
Language Proficiency | 21 |
Test Construction | 20 |
More ▼ |
Source
Author
McLean, Stuart | 3 |
O'Grady, Stefan | 3 |
Kremmel, Benjamin | 2 |
Pae, Tae-Il | 2 |
Stewart, Jeffrey | 2 |
Abramzon, Andrea | 1 |
Agnieszka Slezak-Swiat | 1 |
Aizawa, Kazumi | 1 |
Akbay, Lokman | 1 |
Akbay, Tuncer | 1 |
Akhavan Masoumi, Ghazal | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 31 |
Postsecondary Education | 28 |
Secondary Education | 11 |
Elementary Education | 4 |
Middle Schools | 4 |
Junior High Schools | 3 |
High Schools | 2 |
Early Childhood Education | 1 |
Grade 8 | 1 |
Location
Japan | 7 |
Turkey | 7 |
South Korea | 4 |
Iran | 3 |
United Kingdom | 3 |
Australia | 2 |
Canada | 2 |
China | 2 |
Japan (Tokyo) | 2 |
Austria | 1 |
European Union | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 9 |
Test of English for… | 3 |
International English… | 2 |
Computer Attitude Scale | 1 |
SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
Xueliang Chen; Vahid Aryadoust; Wenxin Zhang – Language Testing, 2025
The growing diversity among test takers in second or foreign language (L2) assessments makes the importance of fairness front and center. This systematic review aimed to examine how fairness in L2 assessments was evaluated through differential item functioning (DIF) analysis. A total of 83 articles from 27 journals were included in a systematic…
Descriptors: Second Language Learning, Language Tests, Test Items, Item Analysis
Kyung-Mi O. – Language Testing in Asia, 2024
This study examines the efficacy of artificial intelligence (AI) in creating parallel test items compared to human-made ones. Two test forms were developed: one consisting of 20 existing human-made items and another with 20 new items generated with ChatGPT assistance. Expert reviews confirmed the content parallelism of the two test forms.…
Descriptors: Comparative Analysis, Artificial Intelligence, Computer Software, Test Items
Stefan O'Grady – International Journal of Listening, 2025
Language assessment is increasingly computermediated. This development presents opportunities with new task formats and equally a need for renewed scrutiny of established conventions. Recent recommendations to increase integrated skills assessment in lecture comprehension tests is premised on empirical research that demonstrates enhanced construct…
Descriptors: Language Tests, Lecture Method, Listening Comprehension Tests, Multiple Choice Tests
Sharareh Sadat Sarsarabi; Zeinab Sazegar – International Journal of Language Testing, 2023
The statement stated in a multiple-choice question can be developed regarding two types of sentences: Interruptive (periodic) and cumulative (or loose). This study deals with different kinds of stems in designing multiple-choice (MC) items. To fill the existing gap in the literature, two groups of teacher students passing general English courses…
Descriptors: Language Tests, Test Format, Multiple Choice Tests, Student Placement
Liao, Ray J. T. – Language Testing, 2023
Among the variety of selected response formats used in L2 reading assessment, multiple-choice (MC) is the most commonly adopted, primarily due to its efficiency and objectiveness. Given the impact of assessment results on teaching and learning, it is necessary to investigate the degree to which the MC format reliably measures learners' L2 reading…
Descriptors: Reading Tests, Language Tests, Second Language Learning, Second Language Instruction
Tim Stoeckel; Tomoko Ishii – Vocabulary Learning and Instruction, 2024
In an upcoming coverage-comprehension study, we plan to assess learners' meaning-recall knowledge of words as they occur in the study's reading passage. As several meaning-recall test formats exist, the purpose of this small-scale study (N = 10) was to determine which of three formats was most similar to a criterion interview regarding mean score…
Descriptors: Vocabulary Development, Language Tests, Second Language Learning, Classification
Jeffrey Martin – Vocabulary Learning and Instruction, 2022
The functioning of a vocabulary testing instrument rests in part on the test-taking actions made possible for examinees by item format, an aspect of test development that warrants consideration in second-language vocabulary research. For example, although iterations of the written receptive vocabulary levels test (VLT) have integrated improvements…
Descriptors: Test Wiseness, Vocabulary, Vocabulary Development, Second Language Learning
Monika Grotek; Agnieszka Slezak-Swiat – Reading in a Foreign Language, 2024
The study investigates the effect of the perception of text and task difficulty on adults' performance in reading tests in L1 and L2. The relationship between the following variables is studied: (a) readers' perception of text and task difficulty in L1 and L2 measured in a self-reported post-task questionnaire, (b) the number of correct answers to…
Descriptors: Difficulty Level, Second Language Learning, Eye Movements, Task Analysis
O'Grady, Stefan – Language Teaching Research, 2023
The current study explores the impact of varying multiple-choice question preview and presentation formats in a test of second language listening proficiency targeting different levels of text comprehension. In a between-participant design, participants completed a 30-item test of listening comprehension featuring implicit and explicit information…
Descriptors: Language Tests, Multiple Choice Tests, Scores, Second Language Learning
Burton, J. Dylan – Language Assessment Quarterly, 2023
The effects of question or task complexity on second language speaking have traditionally been investigated using complexity, accuracy, and fluency measures. Response processes in speaking tests, however, may manifest in other ways, such as through nonverbal behavior. Eye behavior, in the form of averted gaze or blinking frequency, has been found…
Descriptors: Oral Language, Speech Communication, Language Tests, Eye Movements
Jonathan Trace – Language Teaching Research Quarterly, 2023
The role of context in cloze tests has long been seen as both a benefit as well as a complication in their usefulness as a measure of second language comprehension (Brown, 2013). Passage cohesion, in particular, would seem to have a relevant and important effect on the degree to which cloze items function and the interpretability of performances…
Descriptors: Language Tests, Cloze Procedure, Connected Discourse, Test Items
Akhavan Masoumi, Ghazal; Sadeghi, Karim – Language Testing in Asia, 2020
This study aimed to examine the effect of test format on test performance by comparing Multiple Choice (MC) and Constructed Response (CR) vocabulary tests in an EFL setting. Also, this paper investigated the function of gender in MC and CR vocabulary measures. To this end, five 20-item stem-equivalent vocabulary tests (CR, and 3-, 4-, 5-, and…
Descriptors: Language Tests, Test Items, English (Second Language), Second Language Learning
Eberharter, Kathrin; Kormos, Judit; Guggenbichler, Elisa; Ebner, Viktoria S.; Suzuki, Shungo; Moser-Frötscher, Doris; Konrad, Eva; Kremmel, Benjamin – Language Testing, 2023
In online environments, listening involves being able to pause or replay the recording as needed. Previous research indicates that control over the listening input could improve the measurement accuracy of listening assessment. Self-pacing also supports the second language (L2) comprehension processes of test-takers with specific learning…
Descriptors: Literacy, Native Language, Second Language Learning, Second Language Instruction
Al-Jarf, Reima – Online Submission, 2023
This article aims to give a comprehensive guide to planning and designing vocabulary tests which include Identifying the skills to be covered by the test; outlining the course content covered; preparing a table of specifications that shows the skill, content topics and number of questions allocated to each; and preparing the test instructions. The…
Descriptors: Vocabulary Development, Learning Processes, Test Construction, Course Content
Gyllstad, Henrik; McLean, Stuart; Stewart, Jeffrey – Language Testing, 2021
The last three decades have seen an increase of tests aimed at measuring an individual's vocabulary level or size. The target words used in these tests are typically sampled from word frequency lists, which are in turn based on language corpora. Conventionally, test developers sample items from frequency bands of 1000 words; different tests employ…
Descriptors: Vocabulary Development, Sample Size, Language Tests, Test Items