ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	7
Since 2007 (last 20 years)	13

Descriptor

Interrater Reliability	22
Second Language Learning	22
Test Validity	22
Language Tests	18
English (Second Language)	14
Language Proficiency	11
Foreign Countries	10
Second Language Instruction	8
Oral Language	6
Rating Scales	6
Testing	6
Interviews	5
Test Construction	5
Test Reliability	5
Questionnaires	4
Scores	4
Scoring	4
College Students	3
Communicative Competence…	3
Correlation	3
Criterion Referenced Tests	3
Difficulty Level	3
Evaluators	3
Factor Analysis	3
Spanish	3
More ▼

Source

ETS Research Report Series	2
Language Testing	2
ProQuest LLC	2
Advances in Language and…	1
Annual Review of Applied…	1
Cogent Education	1
ELT Journal	1
Foreign Language Annals	1
Journal of Occupational…	1
Language Learning in Higher…	1
Modern Language Journal	1
Online Submission	1
RELC Journal: A Journal of…	1
Remedial & Special Education	1
System	1
System: An International…	1
Vocabulary Learning and…	1
More ▼

Publication Type

Journal Articles	17
Reports - Research	12
Tests/Questionnaires	4
Dissertations/Theses -…	3
Reports - Descriptive	3
Reports - Evaluative	2
Information Analyses	1
Opinion Papers	1
Speeches/Meeting Papers	1

Education Level

Higher Education	6
Postsecondary Education	5
Early Childhood Education	1
Elementary Education	1
Elementary Secondary Education	1
Grade 1	1
Kindergarten	1
Primary Education	1
Secondary Education	1

Audience

Practitioners

Location

Arizona	2
China	2
Japan	2
Colombia	1
Denmark	1
Finland	1
Germany	1
Hong Kong	1
India	1
Ireland (Dublin)	1
Israel	1
Jordan	1
Mexico	1
South Korea	1
Turkey	1
Turkey (Istanbul)	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	4
ACTFL Oral Proficiency…	2

What Works Clearinghouse Rating

Showing 1 to 15 of 22 results Save | Export

Rater Judgments and Word Difficulty: Conceptualizing the Substantive Validity of the VST

Peer reviewed
PDF on ERIC

Download full text

Derek N. Canning; Stuart McLean; Joseph P. Vitta – Vocabulary Learning and Instruction, 2022

The substantive component of construct validity requires a confrontation between empirical test results and content relevance. The Vocabulary Size Test (VST) has been extensively validated in terms of empirical results. Less is known, however, about expert judgments of content relevance. The VST was constructed and validated according to the…

Descriptors: Foreign Countries, Undergraduate Students, College Faculty, Vocabulary Skills

Screening out Chinese-English Biliterate Kindergarten Children with Handwriting Difficulties

Peer reviewed

Direct link

Tse, Linda Fung Ling; Siu, Andrew Man Hong; Li-Tsang, Cecilia Wai Ping – Journal of Occupational Therapy, Schools & Early Intervention, 2018

Aims: This study aimed to (1) develop and validate the Chinese and English Handwriting Screening Test for Kindergarten Children (CHEST) to screen children for handwriting difficulties in their final year of kindergarten education in Hong Kong, and to (2) identify common types of problems encountered by those children before their formal primary…

Descriptors: Literacy, Bilingualism, Kindergarten, Content Validity

Developing an Innovative Elicited Imitation Task for Efficient English Proficiency Assessment. TOEFL® Research Report. RR-96. ETS RR-21-24

Peer reviewed
PDF on ERIC

Download full text

Davis, Larry; Norris, John – ETS Research Report Series, 2021

The elicited imitation task (EIT), in which language learners listen to a series of spoken sentences and repeat each one verbatim, is a commonly used measure of language proficiency in second language acquisition research. The "TOEFL® Essentials"™ test includes an EIT as a holistic measure of speaking proficiency, referred to as the…

Descriptors: Task Analysis, Language Proficiency, Speech Communication, Language Tests

Assessing Individual and Group Oral Exams: Scoring Criteria and Rater Interaction

Peer reviewed
PDF on ERIC

Download full text

Yalçin-Çolakoglu, Özlem; Selçuk, Merve – Advances in Language and Literary Studies, 2019

Criterion referenced tests of second language speaking performance are administered in different institutions using different procedures. The present study reports raters' practices of second language speaking tests, in particular the correspondence between test-takers' grades when assessed individually and in groups. Data derived from…

Descriptors: Oral Language, Language Tests, Test Validity, Inferences

Age, Task Characteristics, and Acoustic Indicators of Engagement: Investigations into the Validity of a Technology-Enhanced Speaking Test for Young Language Learners

Download full text

Edward Paul Getman – Online Submission, 2020

Despite calls for engaging assessments targeting young language learners (YLLs) between 8 and 13 years old, what makes assessment tasks engaging and how such task characteristics affect measurement quality have not been well studied empirically. Furthermore, there has been a dearth of validity research about technology-enhanced speaking tests for…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Learner Engagement

Adaptation and Assessment of a Public Speaking Rating Scale

Peer reviewed

Direct link

Iberri-Shea, Gina – Cogent Education, 2017

Prominent spoken language assessments such as the Oral Proficiency Interview and the Test of Spoken English have been primarily concerned with speaking ability as it relates to conversation. This paper looks at an additional aspect of spoken language ability, namely public speaking. This study used an adapted form of a public speaking rating scale…

Descriptors: Public Speaking, Rating Scales, Adoption (Ideas), English Instruction

Native and Non-Native Raters of L2 Speaking Performance: Accent Familiarity and Cognitive Processes

Direct link

Bogorevich, Valeriia – ProQuest LLC, 2018

Rater variation in performance assessment can impact test-takers' scores and compromise assessments' fairness and validity (Crooks, Kane, & Cohen, 1996). Rater variation can also undermine a test's validity and fairness; therefore, it is important to investigate raters' scoring patterns in order to inform rater training. Substantial work has…

Descriptors: Pronunciation, Familiarity, English (Second Language), Second Language Learning

Assessing Learners' Writing Skills in a SLA Study: Validating the Rating Process across Tasks, Scales and Languages

Peer reviewed

Direct link

Huhta, Ari; Alanen, Riikka; Tarnanen, Mirja; Martin, Maisa; Hirvelä, Tuija – Language Testing, 2014

There is still relatively little research on how well the CEFR and similar holistic scales work when they are used to rate L2 texts. Using both multifaceted Rasch analyses and qualitative data from rater comments and interviews, the ratings obtained by using a CEFR-based writing scale and the Finnish National Core Curriculum scale for L2 writing…

Descriptors: Foreign Countries, Writing Skills, Second Language Learning, Finno Ugric Languages

Standardising Assessment to Meet Student Needs in Foreign Language Modules in a University Context: Is Standardisation Possible?

Peer reviewed

Direct link

Nunan, Anna – Language Learning in Higher Education, 2014

The Applied Language Centre at University College Dublin offers foreign language modules to students in ten languages at CEFR [Common European Framework of Reference for Languages] levels ranging from A1 to B2. Efforts have been underway in the Centre to standardise the assessment components across languages to ensure parity between module credits…

Descriptors: Second Language Learning, Second Language Instruction, College Students, Standards

Diagnosing the English Speaking Ability of College Students in China -- Validation of the Diagnostic College English Speaking Test

Direct link

Zhao, Zhongbao – RELC Journal: A Journal of Language Teaching and Research, 2013

This study investigates the validity of the Diagnostic College English Speaking Test (DCEST) in the context of EFL teaching and learning in China. The experiment was conducted in three stages over the course of eight weeks at a national key university in China. By means of test administration and questionnaire survey, the researcher gathered…

Descriptors: Oral Language, Construct Validity, Language Tests, Diagnostic Tests

Validity and Fairness Implications of Varying Time Conditions on a Diagnostic Test of Academic English Writing Proficiency

Peer reviewed

Direct link

Knoch, Ute; Elder, Catherine – System: An International Journal of Educational Technology and Applied Linguistics, 2010

A number of scholars have questioned the practice of assessing academic writing in the context of a one-off language test, claiming that the time restrictions imposed in the test environment, when compared to the writing conditions typical at university, may prevent learners from displaying the kinds of writing skills required in academic…

Descriptors: Writing Tests, Language Tests, Test Validity, Interrater Reliability

Prompt and Rater Effects in Second Language Writing Performance Assessment

Direct link

Lim, Gad S. – ProQuest LLC, 2009

Performance assessments have become the norm for evaluating language learners' writing abilities in international examinations of English proficiency. Two aspects of these assessments are usually systematically varied: test takers respond to different prompts, and their responses are read by different raters. This raises the possibility of undue…

Descriptors: Performance Based Assessment, Language Tests, Performance Tests, Test Validity

Exploring the Role of Teacher Quality in Predicting Reading Outcomes for First-Grade English Learners: An Observational Study

Peer reviewed

Direct link

Gersten, Russell; Baker, Scott K.; Haager, Diane; Graves, Anne W. – Remedial & Special Education, 2005

The first portion of this article describes the development and validation of a classroom observation measure. The goal of the measure was to assess the quality of reading instruction provided to first-grade English learners. We report the internal consistency reliability, interrater reliability, the development of empirically derived subscales,…

Descriptors: Second Language Learning, English (Second Language), Reading Instruction, Teacher Effectiveness

Testing the Language Proficiency of Bilingual Teachers: Arizona's Spanish Proficiency Test.

Peer reviewed

Grant, Leslie – Language Testing, 1997

Describes current procedures used for testing bilingual teachers in the United States and focuses on one means of assessment used in Arizona. Examinee questionnaire responses, teacher questionnaire responses and test section analysis all contributed evidence for validity. (33 references) (Author/CK)

Descriptors: Bilingualism, Criterion Referenced Tests, Interrater Reliability, Language Teachers

Reading Proficiency Assessment and the ILR/ACTFL Typology: A Reevaluation.

Peer reviewed

Edwards, Alison L. – Modern Language Journal, 1996

Examined the validity of the pragmatic approach to test difficulty put forward by Child (1987). This study investigated whether the Child discourse-type hierarchy predicts text difficulty for second-language readers. Results suggested that this hierarchy may provide a sound basis for developing foreign-language tests when it is applied by trained…

Descriptors: Adult Students, Analysis of Variance, French, Interrater Reliability

Previous Page | Next Page »

Pages: 1 | 2

Grant, Leslie	2
Alanen, Riikka	1
Baker, Scott K.	1
Bejar, Isaac I.	1
Bogorevich, Valeriia	1
Davis, Larry	1
Derek N. Canning	1
Edward Paul Getman	1
Edwards, Alison L.	1
Elder, Catherine	1
Ferroli, Lou	1
Gersten, Russell	1
Graves, Anne W.	1
Haager, Diane	1
Haastrup, Kirsten	1
Hemat, Ramin	1
Hirvelä, Tuija	1
Huhta, Ari	1
Iberri-Shea, Gina	1
Joseph P. Vitta	1
Knoch, Ute	1
Krajenta, Marilyn	1
Li-Tsang, Cecilia Wai Ping	1
Lim, Gad S.	1
More ▼