NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
Assessments and Surveys
Wechsler Preschool and…1
What Works Clearinghouse Rating
Showing all 15 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Shadi Noroozi; Hossein Karami – Language Testing in Asia, 2024
Recently, psychometricians and researchers have voiced their concern over the exploration of language test items in light of Messick's validation framework. Validity has been central to test development and use; however, it has not received due attention in language tests having grave consequences for test takers. The present study sought to…
Descriptors: Foreign Countries, Doctoral Students, Graduate Students, Language Proficiency
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Patrisius Istiarto Djiwandono; Daniel Ginting – Language Education & Assessment, 2025
The teaching of English as a foreign language in Indonesia has a long history, and it is always important to ask whether the assessment of the students' language skills has been valid and reliable. A screening of many articles in several prominent databases reveal that a number of evaluation studies have been done by Indonesian scholars in the…
Descriptors: Foreign Countries, Language Tests, English (Second Language), Second Language Learning
Peer reviewed Peer reviewed
Direct linkDirect link
Zhang, Ci; Xu, XiaoShu; Zhang, Yunfeng – Language Testing in Asia, 2023
This study presents the validation process of a listening test based on a communicative language test proposed by Bachman (Fundamental considerations in language testing, 1990). It was administered to third-grade high school students by the sixteen Korean Provincial Offices of Education for Curriculum and Evaluation in September 2012 to assess…
Descriptors: Language Tests, Second Language Learning, Second Language Instruction, Listening Comprehension Tests
Shujuan Wang – ProQuest LLC, 2021
Existing methods used to validate self-report questionnaires in foreign language teaching effectiveness have relied on Classical Test Theory (CTT). However, the use of CTT approaches limits the reliability and validity of self-report instruments. The Rasch Model, which is based on the principles of objective measurement, addresses some of the…
Descriptors: Second Language Programs, Second Language Learning, Second Language Instruction, Language Tests
Bronson Hui – ProQuest LLC, 2021
Vocabulary researchers have started expanding their assessment toolbox by incorporating timed tasks and psycholinguistic instruments (e.g., priming tasks) to gain insights into lexical development (e.g., Elgort, 2011; Godfroid, 2020b; Nakata & Elgort, 2020; Vandenberghe et al., 2021). These timed sensitive and implicit word measures differ…
Descriptors: Measures (Individuals), Construct Validity, Decision Making, Vocabulary Development
Peer reviewed Peer reviewed
Direct linkDirect link
Kim, Ahyoung Alicia; Tywoniw, Rurik L.; Chapman, Mark – Language Assessment Quarterly, 2022
Technology-enhanced items (TEIs) are innovative, computer-delivered test items that allow test takers to better interact with the test environment compared to traditional multiple-choice items (MCIs). The interactive nature of TEIs offer improved construct coverage compared with MCIs but little research exists regarding students' performance on…
Descriptors: Language Tests, Test Items, Computer Assisted Testing, English (Second Language)
Peer reviewed Peer reviewed
Direct linkDirect link
O'Grady, Stefan – Innovation in Language Learning and Teaching, 2023
Purpose: The current study applies an innovative approach to the assessment of second language listening comprehension skills. This is an important focus in need of innovation because scores generated through language assessment tasks should reflect variation in the target skill and the literature broadly suggests that conventional methods of…
Descriptors: Listening Comprehension, Second Language Learning, Correlation, English (Second Language)
Peer reviewed Peer reviewed
Direct linkDirect link
Tajeddin, Zia; Khatib, Mohammad; Mahdavi, Mohsen – Language Testing, 2022
Critical language assessment (CLA) has been addressed in numerous studies. However, the majority of the studies have overlooked the need for a practical framework to measure the CLA dimension of teachers' language assessment literacy (LAL). This gap prompted us to develop and validate a critical language assessment literacy (CLAL) scale to further…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Language Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Bourdeaud'Hui, Heleen; Aesaert, Koen; van Braak, Johan – Language Assessment Quarterly, 2021
Effective listening comprehension skills are an important prerequisite for the academic success of primary school students. However, the assessment of listening skills in the instructional language appears to have received only scant attention in the literature. Therefore, the goal of the present study was twofold. Firstly, a comprehensive…
Descriptors: Native Language, Indo European Languages, Second Language Learning, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Kaya, Elif; O'Grady, Stefan; Kalender, Ilker – Language Testing, 2022
Language proficiency testing serves an important function of classifying examinees into different categories of ability. However, misclassification is to some extent inevitable and may have important consequences for stakeholders. Recent research suggests that classification efficacy may be enhanced substantially using computerized adaptive…
Descriptors: Item Response Theory, Test Items, Language Tests, Classification
Peer reviewed Peer reviewed
Direct linkDirect link
Klem, Marianne; Gustafsson, Jan-Eric; Hagtvet, Bente – Scandinavian Journal of Educational Research, 2015
The Norwegian government recommends a systematic language assessment of all four-year-olds as part of the general health surveillance program for the purpose of identifying children at risk of language delay. This study aimed to investigate the construct validity of the recommended language screening tool called LANGUAGE4 [SPRÃ…K4] by first…
Descriptors: Norwegian, Language Skills, Construct Validity, Language Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Kim, Jungtae; Craig, Daniel A. – Computer Assisted Language Learning, 2012
Videoconferencing offers new opportunities for language testers to assess speaking ability in low-stakes diagnostic tests. To be considered a trusted testing tool in language testing, a test should be examined employing appropriate validation processes [Chapelle, C.A., Jamieson, J., & Hegelheimer, V. (2003). "Validation of a web-based ESL…
Descriptors: Speech Communication, Testing, Language Tests, Construct Validity
Peer reviewed Peer reviewed
PDF on ERIC Download full text
MacSwan, Jeff; Mahoney, Kate – Journal of Educational Research & Policy Studies, 2008
Construct validity concerns for the IPT I Oral Grades K-6 Spanish Second Edition (IPT-S) as a measure of native oral language proficiency are examined. The examination included describing a subset of items that contributes most to overall score and native-language proficiency designation. Correlations between this subset of items and the overall…
Descriptors: Language Research, Oral Language, Language Tests, Construct Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Lee, Saekyun H.; Han, Hyunjoo – Applied Language Learning, 2007
This study investigated some issues regarding the validity of the Scholastic Achievement Test (SAT) Subject Test: Korean with Listening. The SAT Korean has been administered just once a year since its inception in 1997. As of March 2006, it had been administered nine times. However, SAT foreign language tests are not as rigorously researched as…
Descriptors: Test Results, Second Language Learning, Language Tests, Academic Achievement
Peer reviewed Peer reviewed
Henning, Grant – Language Testing, 1988
Violations of item unidimensionality on language tests produced distorted estimates of person ability, and violations of person unidimensionality produced distorted estimates of item difficulty. The Bejar Method was sensitive to such distortions. (Author)
Descriptors: Construct Validity, Content Validity, Difficulty Level, Item Analysis