Publication Date
| In 2026 | 0 |
| Since 2025 | 7 |
| Since 2022 (last 5 years) | 40 |
| Since 2017 (last 10 years) | 76 |
| Since 2007 (last 20 years) | 107 |
Descriptor
| Foreign Countries | 123 |
| Item Analysis | 123 |
| Language Tests | 123 |
| English (Second Language) | 94 |
| Test Items | 90 |
| Second Language Learning | 85 |
| Second Language Instruction | 55 |
| Language Proficiency | 41 |
| Comparative Analysis | 29 |
| Scores | 28 |
| Test Construction | 26 |
| More ▼ | |
Source
Author
| Baghaei, Purya | 3 |
| Tim Stoeckel | 3 |
| Brown, James Dean | 2 |
| Coniam, David | 2 |
| Ghazanfari, Mohammad | 2 |
| Ghonsooly, Behzad | 2 |
| Hassan, Aalaa Yaseen | 2 |
| Hung Tan Ha | 2 |
| Janssen, Gerriet | 2 |
| McCray, Gareth | 2 |
| McLean, Stuart | 2 |
| More ▼ | |
Publication Type
| Reports - Research | 115 |
| Journal Articles | 113 |
| Tests/Questionnaires | 17 |
| Speeches/Meeting Papers | 4 |
| Dissertations/Theses -… | 3 |
| Information Analyses | 2 |
| Reports - Evaluative | 2 |
| Reports - Descriptive | 1 |
Education Level
Audience
Location
| Iran | 16 |
| Japan | 16 |
| China | 9 |
| Turkey | 9 |
| Europe | 7 |
| Thailand | 6 |
| Saudi Arabia | 5 |
| South Korea | 4 |
| Vietnam | 4 |
| Indonesia | 3 |
| Malaysia | 3 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
| Test of English as a Foreign… | 5 |
| Test of English for… | 5 |
| International English… | 4 |
| Peabody Picture Vocabulary… | 2 |
| Digit Span Test | 1 |
| Reynell Developmental… | 1 |
| Wechsler Preschool and… | 1 |
What Works Clearinghouse Rating
Ute Knoch; Jason Fan – Language Testing, 2024
While several test concordance tables have been published, the research underpinning such tables has rarely been examined in detail. This study aimed to survey the publically available studies or documentation underpinning the test concordance tables of the providers of four major international language tests, all accepted by the Australian…
Descriptors: Language Tests, English, Test Validity, Item Analysis
Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025
This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…
Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis
Ingela Holmström; Krister Schönström; Magnus Ryttervik – Language Assessment Quarterly, 2024
There is a lack of tests available for assessing sign language proficiency among L2 learners. We have therefore developed a sign repetition test, SignRepL2, with a specific focus on the phonological features of signs. This paper describes the two phases of developing this test. In the first phase, content was developed in the form of 50 items with…
Descriptors: Sign Language, Novices, Task Analysis, Second Language Learning
Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025
The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…
Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction
Justin Harris – Language Teaching Research, 2025
This article outlines the development of a 16-item instrument for measuring language learner's foreign language self-efficacy (SE) concerning their speaking and listening skills through repeated administrations to groups of Japanese tertiary students. Responses were analysed through the Rasch model, which allows researchers to investigate…
Descriptors: Speech Communication, Questionnaires, Item Analysis, Second Language Learning
Serap Buyukkidik – International Journal of Assessment Tools in Education, 2023
In the current study, differential item functioning (DIF) detection using real data was conducted with the application of "Mantel-Haenszel (MH)", "Simultaneous item bias test (SIBTEST)", "Lord's chi-square", and "Raju's area" methods, both when item purification was carried out and when item purification was…
Descriptors: Language Tests, Test Items, Item Analysis, Gender Differences
Shadi Noroozi; Hossein Karami – Language Testing in Asia, 2024
Recently, psychometricians and researchers have voiced their concern over the exploration of language test items in light of Messick's validation framework. Validity has been central to test development and use; however, it has not received due attention in language tests having grave consequences for test takers. The present study sought to…
Descriptors: Foreign Countries, Doctoral Students, Graduate Students, Language Proficiency
Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025
This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…
Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction
Tim Stoeckel; Liang Ye Tan; Hung Tan Ha; Nam Thi Phuong Ho; Tomoko Ishii; Young Ae Kim; Chunmei Huang; Stuart McLean – Vocabulary Learning and Instruction, 2024
Local item dependency (LID) occurs when test-takers' responses to one test item are affected by their responses to another. It can be problematic if it causes inflated reliability estimates or distorted person and item measures. The cued-recall reading comprehension test in Hu and Nation's (2000) well-known and influential coverage--comprehension…
Descriptors: Reading Comprehension, English (Second Language), Second Language Instruction, Second Language Learning
Hryvko, Antonina V.; Zhuk, Yurii O. – Journal of Curriculum and Teaching, 2022
A feature of the presented study is a comprehensive approach to studying the reliability problem of linguistic testing results due to the several functional and variable factors impact. Contradictions and ambiguous views of scientists on the researched issues determine the relevance of this study. The article highlights the problem of equivalence…
Descriptors: Student Evaluation, Language Tests, Test Format, Test Items
Janssen, Gerriet – Language Testing, 2022
This article provides a single, common-case study of a test retrofit project at one Colombian university. It reports on how the test retrofit project was carried out and describes the different areas of language assessment literacy the project afforded local teacher stakeholders. This project was successful in that it modified the test constructs…
Descriptors: Language Tests, Placement Tests, Language Teachers, College Faculty
Seyedeh Azadeh Ghiasian; Fatemeh Hemmati; Seyyed Mohammad Alavi; Afsar Rouhi – International Journal of Language Testing, 2025
A critical component of cognitive diagnostic models (CDMs) is a Q-matrix that stipulates associations between items of a test and their required attributes. The present study aims to develop and empirically validate a Q-matrix for the listening comprehension section of the International English Language Testing System (IELTS). To this end, a…
Descriptors: Test Items, Listening Comprehension Tests, English (Second Language), Language Tests
Rümeysa Kaya; Bayram Çetin – International Journal of Assessment Tools in Education, 2025
In this study, the cut-off scores obtained from the Angoff, Angoff Y/N, Nedelsky and Ebel standard methods were compared with the 50 T score and the current cut-off score in various aspects. Data were collected from 448 students who took Module B1+ English Exit Exam IV and 14 experts. It was seen that while the Nedelsky method gave the lowest…
Descriptors: Standard Setting, Cutting Scores, Exit Examinations, Academic Achievement
Tu, Thuy Thi Minh – ProQuest LLC, 2023
The study aimed to elicit information from Vietnamese EFL university instructors about their knowledge and skills regarding the principles, theory, and practices of language assessment by means of revision and validation of the Language Assessment Literacy--Revised Vietnam (LAL-RV), which was previously developed by Kremmel and Harding (2020). A…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, College Faculty
Tim Stoeckel; Tomoko Ishii – Vocabulary Learning and Instruction, 2024
In an upcoming coverage-comprehension study, we plan to assess learners' meaning-recall knowledge of words as they occur in the study's reading passage. As several meaning-recall test formats exist, the purpose of this small-scale study (N = 10) was to determine which of three formats was most similar to a criterion interview regarding mean score…
Descriptors: Vocabulary Development, Language Tests, Second Language Learning, Classification

Peer reviewed
Direct link
