Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 2 |
| Since 2017 (last 10 years) | 2 |
| Since 2007 (last 20 years) | 2 |
Descriptor
| Language Tests | 3 |
| Simulation | 3 |
| Second Language Learning | 2 |
| Academic Achievement | 1 |
| Academic Language | 1 |
| Accuracy | 1 |
| Bias | 1 |
| Case Studies | 1 |
| Comparative Analysis | 1 |
| Computational Linguistics | 1 |
| Content Validity | 1 |
| More ▼ | |
Source
| Language Testing | 3 |
Publication Type
| Journal Articles | 3 |
| Reports - Research | 3 |
Education Level
| High Schools | 1 |
| Secondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
| International English… | 1 |
| Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Yufan Zhao; Vahid Aryadoust – Language Testing, 2025
This study examined the semantic features of the simulated mini-lectures in the listening sections of the International English Language Testing System (IELTS) and the Test of English as a Foreign Language (TOEFL) based on automatized semantic analysis to explore the content validity of the two tests. Two study corpora were utilized, the IELTS…
Descriptors: Semantics, Computational Linguistics, Academic Language, Second Language Learning
Wind, Stefanie A. – Language Testing, 2023
Researchers frequently evaluate rater judgments in performance assessments for evidence of differential rater functioning (DRF), which occurs when rater severity is systematically related to construct-irrelevant student characteristics after controlling for student achievement levels. However, researchers have observed that methods for detecting…
Descriptors: Evaluators, Decision Making, Student Characteristics, Performance Based Assessment
Peer reviewedHenning, Grant – Language Testing, 1996
Analyzes simulated performance ratings on a six-point scale by two independent raters to account for nonsystematic error in performance ratings. Results suggest that rater agreement or covariance is not always a dependable estimate of score reliability and that the practice of seeking additional raters for adjudication of discrepant ratings is not…
Descriptors: Correlation, Error Patterns, Interrater Reliability, Language Tests

Direct link
