Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 4 |
| Since 2007 (last 20 years) | 7 |
Descriptor
| Interrater Reliability | 12 |
| Second Language Instruction | 12 |
| Test Validity | 12 |
| Language Tests | 10 |
| English (Second Language) | 8 |
| Second Language Learning | 8 |
| Foreign Countries | 6 |
| Test Reliability | 5 |
| Oral Language | 4 |
| Test Construction | 4 |
| Testing | 4 |
| More ▼ | |
Source
Author
| Ahmadi Safa, Mohammad | 1 |
| Bogorevich, Valeriia | 1 |
| Derek N. Canning | 1 |
| Doosti, Mehdi | 1 |
| Elder, Catherine | 1 |
| Ferroli, Lou | 1 |
| Hamp-Lyons, Liz | 1 |
| Joseph P. Vitta | 1 |
| Knoch, Ute | 1 |
| Krajenta, Marilyn | 1 |
| Nunan, Anna | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 8 |
| Reports - Research | 5 |
| Information Analyses | 2 |
| Reports - Descriptive | 2 |
| Reports - Evaluative | 2 |
| Speeches/Meeting Papers | 2 |
| Tests/Questionnaires | 2 |
| Dissertations/Theses -… | 1 |
| Opinion Papers | 1 |
Education Level
| Higher Education | 4 |
| Postsecondary Education | 4 |
| Secondary Education | 1 |
Audience
Location
| Japan | 2 |
| China | 1 |
| Ireland (Dublin) | 1 |
| Turkey (Istanbul) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| Test of English as a Foreign… | 1 |
| Test of English for… | 1 |
What Works Clearinghouse Rating
Doosti, Mehdi; Ahmadi Safa, Mohammad – International Journal of Language Testing, 2021
This study examined the effect of rater training on promoting inter-rater reliability in oral language assessment. It also investigated whether rater training and the consideration of the examinees' expectations by the examiners have any effect on test-takers' perceptions of being fairly evaluated. To this end, four raters scored 31 Iranian…
Descriptors: Oral Language, Language Tests, Interrater Reliability, Training
Derek N. Canning; Stuart McLean; Joseph P. Vitta – Vocabulary Learning and Instruction, 2022
The substantive component of construct validity requires a confrontation between empirical test results and content relevance. The Vocabulary Size Test (VST) has been extensively validated in terms of empirical results. Less is known, however, about expert judgments of content relevance. The VST was constructed and validated according to the…
Descriptors: Foreign Countries, Undergraduate Students, College Faculty, Vocabulary Skills
Yalçin-Çolakoglu, Özlem; Selçuk, Merve – Advances in Language and Literary Studies, 2019
Criterion referenced tests of second language speaking performance are administered in different institutions using different procedures. The present study reports raters' practices of second language speaking tests, in particular the correspondence between test-takers' grades when assessed individually and in groups. Data derived from…
Descriptors: Oral Language, Language Tests, Test Validity, Inferences
Bogorevich, Valeriia – ProQuest LLC, 2018
Rater variation in performance assessment can impact test-takers' scores and compromise assessments' fairness and validity (Crooks, Kane, & Cohen, 1996). Rater variation can also undermine a test's validity and fairness; therefore, it is important to investigate raters' scoring patterns in order to inform rater training. Substantial work has…
Descriptors: Pronunciation, Familiarity, English (Second Language), Second Language Learning
Nunan, Anna – Language Learning in Higher Education, 2014
The Applied Language Centre at University College Dublin offers foreign language modules to students in ten languages at CEFR [Common European Framework of Reference for Languages] levels ranging from A1 to B2. Efforts have been underway in the Centre to standardise the assessment components across languages to ensure parity between module credits…
Descriptors: Second Language Learning, Second Language Instruction, College Students, Standards
Zhao, Zhongbao – RELC Journal: A Journal of Language Teaching and Research, 2013
This study investigates the validity of the Diagnostic College English Speaking Test (DCEST) in the context of EFL teaching and learning in China. The experiment was conducted in three stages over the course of eight weeks at a national key university in China. By means of test administration and questionnaire survey, the researcher gathered…
Descriptors: Oral Language, Construct Validity, Language Tests, Diagnostic Tests
Knoch, Ute; Elder, Catherine – System: An International Journal of Educational Technology and Applied Linguistics, 2010
A number of scholars have questioned the practice of assessing academic writing in the context of a one-off language test, claiming that the time restrictions imposed in the test environment, when compared to the writing conditions typical at university, may prevent learners from displaying the kinds of writing skills required in academic…
Descriptors: Writing Tests, Language Tests, Test Validity, Interrater Reliability
Salies, Tania Gastao – 1998
A discussion of the evaluation of writing, particularly in English as a Second Language, argues for a communicative approach reflecting the current approach to language teaching and learning. The movement toward more communication-oriented and more valid language testing is examined briefly, and direct assessment is chosen as the preferred format…
Descriptors: Communicative Competence (Languages), English (Second Language), Evaluation Criteria, Foreign Countries
Hamp-Lyons, Liz – 1987
A study examined the scoring procedure for the second part of the modular section of the English Language Testing Service (ELTS) academic writing test. The scoring is done by external raters according to procedures and a scale specified for the test, resulting in a performance profile. The report chronicles the development of the procedures and…
Descriptors: English for Academic Purposes, English (Second Language), Graduate Students, Higher Education
Peer reviewedTurner, Jean – Annual Review of Applied Linguistics, 1998
This review of research on second-language oral testing outlines the nature of early research in interview-format proficiency testing, then reports on new directions in investigation of construct validity of interview-format and other oral skills tests through examination of examinee, interviewer, and rater performance. Research on empirically…
Descriptors: Construct Validity, Educational Trends, Interrater Reliability, Interviews
Strong, Gregory – Thought Currents in English Literature, 1995
This paper traces developments in educational psychology and measurement that led to the Test of English as a Foreign Language (TOEFL) and the test of English for International Communication (TOEIC) and the application of educational measurement terms such as validity and reliability to testing. Use of a table of specifications for planning…
Descriptors: Cloze Procedure, Difficulty Level, English (Second Language), Foreign Countries
Ferroli, Lou; Krajenta, Marilyn – 1993
The creation and validation of a Spanish version of an English developmental spelling test (DST) is described. An introductory section reviews related literature on the rationale for and construction of DSTs, spelling development in the early grades, and Spanish-English bilingual education. Differences between the English and Spanish test versions…
Descriptors: Comparative Analysis, Elementary School Students, English, Grade 1

Direct link
