Publication Date
| In 2026 | 0 |
| Since 2025 | 2 |
| Since 2022 (last 5 years) | 7 |
| Since 2017 (last 10 years) | 9 |
| Since 2007 (last 20 years) | 15 |
Descriptor
| Item Analysis | 21 |
| Second Language Learning | 21 |
| Test Reliability | 21 |
| English (Second Language) | 16 |
| Language Tests | 15 |
| Test Validity | 14 |
| Foreign Countries | 12 |
| Test Items | 12 |
| Second Language Instruction | 9 |
| Test Construction | 9 |
| Multiple Choice Tests | 7 |
| More ▼ | |
Source
Author
Publication Type
| Journal Articles | 15 |
| Reports - Research | 15 |
| Speeches/Meeting Papers | 3 |
| Dissertations/Theses -… | 2 |
| Tests/Questionnaires | 2 |
| Information Analyses | 1 |
| Reports - Evaluative | 1 |
Education Level
| Higher Education | 8 |
| Postsecondary Education | 8 |
| Adult Education | 2 |
| Elementary Education | 1 |
| Grade 8 | 1 |
Audience
Location
| Indonesia | 2 |
| Iran | 2 |
| China | 1 |
| Colombia | 1 |
| Europe | 1 |
| Iraq | 1 |
| Japan | 1 |
| Pakistan | 1 |
| Russia | 1 |
| Saudi Arabia | 1 |
| Thailand | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
| Test of English as a Foreign… | 1 |
| Test of English for… | 1 |
What Works Clearinghouse Rating
Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025
The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…
Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction
Mardiana – Eurasian Journal of Applied Linguistics, 2023
Written inquiries, which are more frequent and have less of a focus on complex thinking, are issues at school. Students are not taught how to respond to questions found in High-Level Thinking Skills (HOTS) tests, hence, their thinking abilities are generally weak. The issue for teachers is that neither they nor anyone else has been able to create…
Descriptors: Skill Development, Thinking Skills, Check Lists, Models
Pablo Robles-García; Stuart McLean; Jeffrey Stewart; Ji-young Shin; Claudia Helena Sánchez-Gutiérrez – Language Assessment Quarterly, 2024
Recent literature in the field of L2 vocabulary assessment has advocated for the development of written receptive vocabulary tests such as Vocabulary Levels Tests (VLTs) that use: (a) meaning-recall item formats, (b) a minimum of 40 item counts per 1,000-frequency band to improve level estimates, and (c) lemmas (not word-families) as the lexical…
Descriptors: Spanish, Test Validity, Test Construction, Vocabulary Development
Patrisius Istiarto Djiwandono; Daniel Ginting – Language Education & Assessment, 2025
The teaching of English as a foreign language in Indonesia has a long history, and it is always important to ask whether the assessment of the students' language skills has been valid and reliable. A screening of many articles in several prominent databases reveal that a number of evaluation studies have been done by Indonesian scholars in the…
Descriptors: Foreign Countries, Language Tests, English (Second Language), Second Language Learning
Al-Jarf, Reima – Online Submission, 2023
This study explores the similarities and differences between English and Arabic numeral-based formulaic expressions, and difficulties that student-translators have with them. A corpus of English and Arabic numeral-based formulaic expressions containing zero, two, three, twenty, sixty, hundred, thousand…etc., and another corpus of specialized…
Descriptors: Translation, Arabic, Contrastive Linguistics, Phrase Structure
Cheewasukthaworn, Kanchana – PASAA: Journal of Language Teaching and Learning in Thailand, 2022
In 2016, the Office of the Higher Education Commission issued a directive requiring all higher education institutions in Thailand to have their students take a standardized English proficiency test. According to the directive, the test's results had to align with the Common European Framework of Reference for Languages (CEFR). In response to this…
Descriptors: Test Construction, Standardized Tests, Language Tests, English (Second Language)
Omarov, Nazarbek Bakytbekovich; Mohammed, Aisha; Alghurabi, Ammar Muhi Khleel; Alallo, Hajir Mahmood Ibrahim; Ali, Yusra Mohammed; Hassan, Aalaa Yaseen; Demeuova, Lyazat; Viktorovna, Shvedova Irina; Nazym, Bekenova; Al Khateeb, Nashaat Sultan Afif – International Journal of Language Testing, 2023
The Multiple-choice (MC) item format is commonly used in educational assessments due to its economy and effectiveness across a variety of content domains. However, numerous studies have examined the quality of MC items in high-stakes and higher-education assessments and found many flawed items, especially in terms of distractors. These faulty…
Descriptors: Test Items, Multiple Choice Tests, Item Response Theory, English (Second Language)
Ji-young Shin – ProQuest LLC, 2021
The present dissertation investigated the impact of scales/scoring methods and prompt linguistic features on the measurement quality of L2 English elicited imitation (EI). Scales/scoring methods are an important feature for the validity and reliability of L2 EI test, but less is known (Yan et al., 2016). Prompt linguistic features are also known…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Semantics
Kim, Peter – Language Teaching Research Quarterly, 2021
Foreign language aptitude is defined as one's potential to learn a second language. A language learner with higher aptitude is predicted to learn more, faster, and reach a higher level of proficiency. If this is the case, one way to validate the construct of aptitude and its measure is to conduct a validation study in which measures of aptitude is…
Descriptors: Morphology (Languages), Syntax, Second Language Learning, Second Language Instruction
Baghaei, Purya; Dourakhshan, Alireza – International Journal of Language Testing, 2016
The purpose of the present study is to compare the psychometric qualities of canonical single-response multiple-choice items with their double-response counterparts. Thirty, two-response fouroption grammar items for undergraduate students of English were constructed. A second version of the test was constructed by replacing one of the correct…
Descriptors: Language Tests, Multiple Choice Tests, Test Items, Factor Analysis
Palacio, Marcela; Gaviria, Sandra; Brown, James Dean – PROFILE: Issues in Teachers' Professional Development, 2016
Frustrations with traditional testing led a group of teachers at the English for adults program at Universidad EAFIT (Colombia) to design tests aligned with the institutional teaching philosophy and classroom practices. This article reports on a study of an item-by-item evaluation of a series of English exams for validity and reliability in an…
Descriptors: Foreign Countries, English (Second Language), Second Language Learning, Second Language Instruction
Xu, Lan; Wannaruk, Anchalee – LEARN Journal: Language Education and Acquisition Research Network, 2016
Performing routines in interlanguage is vitally important for EFL learners since it can cause embarrassment between speakers from different cultures. The present study aims to 1) investigate the reliability and validity of an interlanguge pragmatic competence test on routines in a Chinese EFL context with multiple choice discourse completion task…
Descriptors: Language Tests, Test Construction, Pragmatics, Interlanguage
Brown, N. Anthony; Dewey, Dan P.; Cox, Troy L. – Foreign Language Annals, 2014
In this study, the authors evaluated the strengths and limitations of a self-assessment based on ACTFL Can-Do statements ("ACTFL," 2013]) as a tool for measuring linguistic gains over an internship abroad in Russia. They assessed its reliability, determined how its items mapped with the ACTFL scale, and measured the degree to which…
Descriptors: Self Evaluation (Individuals), Pretests Posttests, Interviews, Language Proficiency
Haider, Zubair; Latif, Farah; Akhtar, Samina; Mushtaq, Maria – Educational Research and Reviews, 2012
Validity, reliability and item analysis are critical to the process of evaluating the quality of an educational measurement. The present study evaluates the quality of an assessment constructed to measure elementary school student's achievement in English. In this study, the survey model of descriptive research was used as a research method.…
Descriptors: Foreign Countries, English (Second Language), Second Language Learning, Language Tests
Pomes, Maria Pilar – ProQuest LLC, 2012
Immigrant populations are growing and permanently changing the demographic profile of the United States. Diverse cultural and linguistic backgrounds are manifested in the families in each community, imposing demands and challenges to agencies that provide services to them. A large population of immigrant families, especially first and second…
Descriptors: Spanish, Translation, Screening Tests, Cultural Awareness
Previous Page | Next Page »
Pages: 1 | 2
Peer reviewed
Direct link
