ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	4
Since 2007 (last 20 years)	7

Descriptor

Interrater Reliability	12
Second Language Instruction	12
Test Validity	12
Language Tests	10
English (Second Language)	8
Second Language Learning	8
Foreign Countries	6
Test Reliability	5
Oral Language	4
Test Construction	4
Testing	4
College Students	3
Evaluators	3
Language Proficiency	3
Scoring	3
Student Attitudes	3
Writing Evaluation	3
Communicative Competence…	2
Comparative Analysis	2
Construct Validity	2
Diagnostic Tests	2
Difficulty Level	2
English for Academic Purposes	2
Evaluation Criteria	2
Interviews	2
More ▼

Source

Advances in Language and…	1
Annual Review of Applied…	1
International Journal of…	1
Language Learning in Higher…	1
ProQuest LLC	1
RELC Journal: A Journal of…	1
System: An International…	1
Thought Currents in English…	1
Vocabulary Learning and…	1

Author

Ahmadi Safa, Mohammad	1
Bogorevich, Valeriia	1
Derek N. Canning	1
Doosti, Mehdi	1
Elder, Catherine	1
Ferroli, Lou	1
Hamp-Lyons, Liz	1
Joseph P. Vitta	1
Knoch, Ute	1
Krajenta, Marilyn	1
Nunan, Anna	1
Salies, Tania Gastao	1
Selçuk, Merve	1
Strong, Gregory	1
Stuart McLean	1
Turner, Jean	1
Yalçin-Çolakoglu, Özlem	1
Zhao, Zhongbao	1
More ▼

Publication Type

Journal Articles	8
Reports - Research	5
Information Analyses	2
Reports - Descriptive	2
Reports - Evaluative	2
Speeches/Meeting Papers	2
Tests/Questionnaires	2
Dissertations/Theses -…	1
Opinion Papers	1

Education Level

Higher Education	4
Postsecondary Education	4
Secondary Education	1

Audience

Location

Japan	2
China	1
Ireland (Dublin)	1
Turkey (Istanbul)	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	1
Test of English for…	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Fairness in Oral Language Assessment: Training Raters and Considering Examinees' Expectations

Peer reviewed
PDF on ERIC

Download full text

Doosti, Mehdi; Ahmadi Safa, Mohammad – International Journal of Language Testing, 2021

This study examined the effect of rater training on promoting inter-rater reliability in oral language assessment. It also investigated whether rater training and the consideration of the examinees' expectations by the examiners have any effect on test-takers' perceptions of being fairly evaluated. To this end, four raters scored 31 Iranian…

Descriptors: Oral Language, Language Tests, Interrater Reliability, Training

Rater Judgments and Word Difficulty: Conceptualizing the Substantive Validity of the VST

Peer reviewed
PDF on ERIC

Download full text

Derek N. Canning; Stuart McLean; Joseph P. Vitta – Vocabulary Learning and Instruction, 2022

The substantive component of construct validity requires a confrontation between empirical test results and content relevance. The Vocabulary Size Test (VST) has been extensively validated in terms of empirical results. Less is known, however, about expert judgments of content relevance. The VST was constructed and validated according to the…

Descriptors: Foreign Countries, Undergraduate Students, College Faculty, Vocabulary Skills

Assessing Individual and Group Oral Exams: Scoring Criteria and Rater Interaction

Peer reviewed
PDF on ERIC

Download full text

Yalçin-Çolakoglu, Özlem; Selçuk, Merve – Advances in Language and Literary Studies, 2019

Criterion referenced tests of second language speaking performance are administered in different institutions using different procedures. The present study reports raters' practices of second language speaking tests, in particular the correspondence between test-takers' grades when assessed individually and in groups. Data derived from…

Descriptors: Oral Language, Language Tests, Test Validity, Inferences

Native and Non-Native Raters of L2 Speaking Performance: Accent Familiarity and Cognitive Processes

Direct link

Bogorevich, Valeriia – ProQuest LLC, 2018

Rater variation in performance assessment can impact test-takers' scores and compromise assessments' fairness and validity (Crooks, Kane, & Cohen, 1996). Rater variation can also undermine a test's validity and fairness; therefore, it is important to investigate raters' scoring patterns in order to inform rater training. Substantial work has…

Descriptors: Pronunciation, Familiarity, English (Second Language), Second Language Learning

Standardising Assessment to Meet Student Needs in Foreign Language Modules in a University Context: Is Standardisation Possible?

Peer reviewed

Direct link

Nunan, Anna – Language Learning in Higher Education, 2014

The Applied Language Centre at University College Dublin offers foreign language modules to students in ten languages at CEFR [Common European Framework of Reference for Languages] levels ranging from A1 to B2. Efforts have been underway in the Centre to standardise the assessment components across languages to ensure parity between module credits…

Descriptors: Second Language Learning, Second Language Instruction, College Students, Standards

Diagnosing the English Speaking Ability of College Students in China -- Validation of the Diagnostic College English Speaking Test

Direct link

Zhao, Zhongbao – RELC Journal: A Journal of Language Teaching and Research, 2013

This study investigates the validity of the Diagnostic College English Speaking Test (DCEST) in the context of EFL teaching and learning in China. The experiment was conducted in three stages over the course of eight weeks at a national key university in China. By means of test administration and questionnaire survey, the researcher gathered…

Descriptors: Oral Language, Construct Validity, Language Tests, Diagnostic Tests

Validity and Fairness Implications of Varying Time Conditions on a Diagnostic Test of Academic English Writing Proficiency

Peer reviewed

Direct link

Knoch, Ute; Elder, Catherine – System: An International Journal of Educational Technology and Applied Linguistics, 2010

A number of scholars have questioned the practice of assessing academic writing in the context of a one-off language test, claiming that the time restrictions imposed in the test environment, when compared to the writing conditions typical at university, may prevent learners from displaying the kinds of writing skills required in academic…

Descriptors: Writing Tests, Language Tests, Test Validity, Interrater Reliability

Towards Communicative Measurement of Writing: Where Are We Now?

Download full text

Salies, Tania Gastao – 1998

A discussion of the evaluation of writing, particularly in English as a Second Language, argues for a communicative approach reflecting the current approach to language teaching and learning. The movement toward more communication-oriented and more valid language testing is examined briefly, and direct assessment is chosen as the preferred format…

Descriptors: Communicative Competence (Languages), English (Second Language), Evaluation Criteria, Foreign Countries

Performance Profiles for Academic Writing.

Download full text

Hamp-Lyons, Liz – 1987

A study examined the scoring procedure for the second part of the modular section of the English Language Testing Service (ELTS) academic writing test. The scoring is done by external raters according to procedures and a scale specified for the test, resulting in a performance profile. The report chronicles the development of the procedures and…

Descriptors: English for Academic Purposes, English (Second Language), Graduate Students, Higher Education

Assessing Speaking.

Peer reviewed

Turner, Jean – Annual Review of Applied Linguistics, 1998

This review of research on second-language oral testing outlines the nature of early research in interview-format proficiency testing, then reports on new directions in investigation of construct validity of interview-format and other oral skills tests through examination of examinee, interviewer, and rater performance. Research on empirically…

Descriptors: Construct Validity, Educational Trends, Interrater Reliability, Interviews

A Survey of Issues and Item Writing in Language Testing.

Download full text

Strong, Gregory – Thought Currents in English Literature, 1995

This paper traces developments in educational psychology and measurement that led to the Test of English as a Foreign Language (TOEFL) and the test of English for International Communication (TOEIC) and the application of educational measurement terms such as validity and reliability to testing. Use of a table of specifications for planning…

Descriptors: Cloze Procedure, Difficulty Level, English (Second Language), Foreign Countries

Validating a Spanish Developmental Spelling Test.

Download full text

Ferroli, Lou; Krajenta, Marilyn – 1993

The creation and validation of a Spanish version of an English developmental spelling test (DST) is described. An introductory section reviews related literature on the rationale for and construction of DSTs, spelling development in the early grades, and Spanish-English bilingual education. Differences between the English and Spanish test versions…

Descriptors: Comparative Analysis, Elementary School Students, English, Grade 1