ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	9

Descriptor

Comparative Analysis	11
Interrater Reliability	11
Writing Tests	11
Second Language Learning	6
English (Second Language)	5
Scoring	5
Writing Evaluation	5
Correlation	4
Essays	4
Evaluation Methods	4
Foreign Countries	4
Computer Assisted Testing	3
Computer Software	3
Essay Tests	3
Evaluators	3
Language Tests	3
Scores	3
Second Language Instruction	3
Student Evaluation	3
College Students	2
Educational Technology	2
Feedback (Response)	2
Models	2
Rating Scales	2
Student Attitudes	2
More ▼

Source

Assessing Writing	2
ETS Research Report Series	2
Assessment in Education:…	1
JALT CALL Journal	1
Language Testing	1
ReCALL	1
Research-publishing.net	1
System: An International…	1

Author

Attali, Yigal	1
Baker, Beverly A.	1
Barkaoui, Khaled	1
Breyer, F. Jay	1
Coniam, David	1
Elder, Catherine	1
Heidari, Jamshid	1
Isonio, Steven	1
Khodabandeh, Farzaneh	1
Knoch, Ute	1
Lee, H. K.	1
Lorenz, Florian	1
Ramineni, Chaitanya	1
Soleimani, Hassan	1
Sumner, Josh	1
Trapani, Catherine S.	1
Williamson, David M.	1
Zhang, Mo	1
More ▼

Publication Type

Journal Articles	9
Reports - Research	7
Reports - Evaluative	3
Numerical/Quantitative Data	1
Reports - Descriptive	1
Tests/Questionnaires	1

Education Level

Higher Education	5
Postsecondary Education	3
Secondary Education	2
Adult Education	1
Elementary Secondary Education	1

Audience

Location

Canada	1
Hong Kong	1
Iran	1

Laws, Policies, & Programs

Assessments and Surveys

Praxis Series	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Comparative Judgement: Assess Student Production without Absolute Judgements

Peer reviewed
PDF on ERIC

Download full text

Sumner, Josh – Research-publishing.net, 2021

Comparative Judgement (CJ) has emerged as a technique that typically makes use of holistic judgement to assess difficult-to-specify constructs such as production (speaking and writing) in Modern Foreign Languages (MFL). In traditional approaches, markers assess candidates' work one-by-one in an absolute manner, assigning scores to different…

Descriptors: Holistic Approach, Student Evaluation, Comparative Analysis, Decision Making

Evaluation of "e-rater"® for the "Praxis I"®Writing Test. Research Report. ETS RR-15-03

Peer reviewed
PDF on ERIC

Download full text

Ramineni, Chaitanya; Trapani, Catherine S.; Williamson, David M. – ETS Research Report Series, 2015

Automated scoring models were trained and evaluated for the essay task in the "Praxis I"® writing test. Prompt-specific and generic "e-rater"® scoring models were built, and evaluation statistics, such as quadratic weighted kappa, Pearson correlation, and standardized differences in mean scores, were examined to evaluate the…

Descriptors: Writing Tests, Licensing Examinations (Professions), Teacher Competency Testing, Scoring

A Comparison of Newly-Trained and Experienced Raters on a Standardized Writing Assessment

Peer reviewed

Direct link

Attali, Yigal – Language Testing, 2016

A short training program for evaluating responses to an essay writing task consisted of scoring 20 training essays with immediate feedback about the correct score. The same scoring session also served as a certification test for trainees. Participants with little or no previous rating experience completed this session and 14 trainees who passed an…

Descriptors: Writing Evaluation, Writing Tests, Standardized Tests, Evaluators

A Comparative Analysis of Face to Face Instruction vs. Telegram Mobile Instruction in Terms of Narrative Writing

Peer reviewed
PDF on ERIC

Download full text

Heidari, Jamshid; Khodabandeh, Farzaneh; Soleimani, Hassan – JALT CALL Journal, 2018

The emergence of computer technology in English language teaching has paved the way for teachers' application of Mobile Assisted Language Learning (mall) and its advantages in teaching. This study aimed to compare the effectiveness of the face to face instruction with Telegram mobile instruction. Based on a toefl test, 60 English foreign language…

Descriptors: Comparative Analysis, Conventional Instruction, Teaching Methods, Computer Assisted Instruction

Investigating the Suitability of Implementing the "e-rater"® Scoring Engine in a Large-Scale English Language Testing Program. Research Report. ETS RR-13-36

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013

In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…

Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests

Effects of Marking Method and Rater Experience on ESL Essay Scores and Rater Performance

Peer reviewed

Direct link

Barkaoui, Khaled – Assessment in Education: Principles, Policy & Practice, 2011

This study examined the effects of marking method and rater experience on ESL (English as a Second Language) essay test scores and rater performance. Each of 31 novice and 29 experienced raters rated a sample of ESL essays both holistically and analytically. Essay scores were analysed using a multi-faceted Rasch model to compare test-takers'…

Descriptors: Writing Evaluation, Writing Tests, Essay Tests, Interrater Reliability

Playing with the Stakes: A Consideration of an Aspect of the Social Context of a Gatekeeping Writing Assessment

Peer reviewed

Direct link

Baker, Beverly A. – Assessing Writing, 2010

In high-stakes writing assessments, rater training in the use of a rating scale does not eliminate variability in grade attribution. This realisation has been accompanied by research that explores possible sources of rater variability, such as rater background or rating scale type. However, there has been little consideration thus far of…

Descriptors: Foreign Countries, Writing Evaluation, Writing Tests, Testing

Experimenting with a Computer Essay-Scoring Program Based on ESL Student Writing Scripts

Peer reviewed

Direct link

Coniam, David – ReCALL, 2009

This paper describes a study of the computer essay-scoring program BETSY. While the use of computers in rating written scripts has been criticised in some quarters for lacking transparency or lack of fit with how human raters rate written scripts, a number of essay rating programs are available commercially, many of which claim to offer comparable…

Descriptors: Writing Tests, Scoring, Foreign Countries, Interrater Reliability

Validity and Fairness Implications of Varying Time Conditions on a Diagnostic Test of Academic English Writing Proficiency

Peer reviewed

Direct link

Knoch, Ute; Elder, Catherine – System: An International Journal of Educational Technology and Applied Linguistics, 2010

A number of scholars have questioned the practice of assessing academic writing in the context of a one-off language test, claiming that the time restrictions imposed in the test environment, when compared to the writing conditions typical at university, may prevent learners from displaying the kinds of writing skills required in academic…

Descriptors: Writing Tests, Language Tests, Test Validity, Interrater Reliability

Judgments of Placement Writing Samples at Golden West College: An Evaluation of Inter-Rater Reliability.

Download full text

Isonio, Steven – 1991

At Golden College (California), student writing samples are holistically scored by pairs of judges on a six-point scale. Judges are allowed to use plus and minus figures, thus converting the integer scale to a decimal scale of evaluation. In 1991, 499 writing samples written as part of the placement testing process for students in the Coast…

Descriptors: Community Colleges, Comparative Analysis, Correlation, Evaluation Methods

A Comparative Study of ESL Writers' Performance in a Paper-Based and a Computer-Delivered Writing Test

Peer reviewed

Direct link

Lee, H. K. – Assessing Writing, 2004

This study aimed to comprehensively investigate the impact of a word-processor on an ESL writing assessment, covering comparison of inter-rater reliability, the quality of written products, the writing process across different testing occasions using different writing media, and students' perception of a computer-delivered test. Writing samples of…

Descriptors: Writing Evaluation, Student Attitudes, Writing Tests, Testing