ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	5

Descriptor

Interrater Reliability	13
Test Reliability	13
Writing Tests	13
Test Validity	7
Scoring	6
English (Second Language)	5
College Students	4
Foreign Countries	4
Language Tests	4
Scores	4
Second Language Learning	4
Writing Evaluation	4
Computer Assisted Testing	3
Generalizability Theory	3
Performance Based Assessment	3
Scoring Rubrics	3
Test Construction	3
Correlation	2
Elementary Secondary Education	2
Grammar	2
Standardized Tests	2
Writing Ability	2
Writing Skills	2
Academic Discourse	1
Accuracy	1
More ▼

Source

Assessing Writing	1
ETS Research Report Series	1
Educational Testing Service	1
European Journal of…	1
Journal of Deaf Studies and…	1
Michigan Reading Journal	1
Online Submission	1
Phi Delta Kappan	1
ProQuest LLC	1
Turkish Online Journal of…	1

Publication Type

Journal Articles	8
Reports - Research	6
Reports - Evaluative	4
Speeches/Meeting Papers	3
Reports - Descriptive	2
Tests/Questionnaires	2
Dissertations/Theses -…	1

Education Level

Higher Education	3
Postsecondary Education	2

Audience

Location

Turkey	3
Australia	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
International English…	1
National Assessment of…	1
SAT (College Admission Test)	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Reliability of the Analytic Rubric and Checklist for the Assessment of Story Writing Skills: G and Decision Study in Generalizability Theory

Peer reviewed
PDF on ERIC

Download full text

Uzun, N. Bilge; Alici, Devrim; Aktas, Mehtap – European Journal of Educational Research, 2019

The purpose of study is to examine the reliability of analytical rubrics and checklists developed for the assessment of story writing skills by means of generalizability theory. The study group consisted of 52 students attending the 5th grade at primary school and 20 raters in Mersin University. The G study was carried out with the fully crossed…

Descriptors: Foreign Countries, Scoring Rubrics, Check Lists, Writing Tests

Development and Validation of the Written Communication Assessment of the "HEIghten"® Outcomes Assessment Suite. Research Report. ETS RR-17-53

Peer reviewed
PDF on ERIC

Download full text

Rios, Joseph A.; Sparks, Jesse R.; Zhang, Mo; Liu, Ou Lydia – ETS Research Report Series, 2017

Proficiency with written communication (WC) is critical for success in college and careers. As a result, institutions face a growing challenge to accurately evaluate their students' writing skills to obtain data that can support demands of accreditation, accountability, or curricular improvement. Many current standardized measures, however, lack…

Descriptors: Test Construction, Test Validity, Writing Tests, College Outcomes Assessment

Use of e-rater[R] in Scoring of the TOEFL iBT[R] Writing Test. Research Report. ETS RR-11-25

Download full text

Haberman, Shelby J. – Educational Testing Service, 2011

Alternative approaches are discussed for use of e-rater[R] to score the TOEFL iBT[R] Writing test. These approaches involve alternate criteria. In the 1st approach, the predicted variable is the expected rater score of the examinee's 2 essays. In the 2nd approach, the predicted variable is the expected rater score of 2 essay responses by the…

Descriptors: Writing Tests, Scoring, Essays, Language Tests

Bringing Reading-to-Write and Writing-Only Assessment Tasks Together: A Generalizability Analysis

Peer reviewed

Direct link

Gebril, Atta – Assessing Writing, 2010

Integrated tasks are currently employed in a number of L2 exams since they are perceived as an addition to the writing-only task type. Given this trend, the current study investigates composite score generalizability of both reading-to-write and writing-only tasks. For this purpose, a multivariate generalizability analysis is used to investigate…

Descriptors: Scoring, Scores, Second Language Instruction, Writing Evaluation

Prompt and Rater Effects in Second Language Writing Performance Assessment

Direct link

Lim, Gad S. – ProQuest LLC, 2009

Performance assessments have become the norm for evaluating language learners' writing abilities in international examinations of English proficiency. Two aspects of these assessments are usually systematically varied: test takers respond to different prompts, and their responses are read by different raters. This raises the possibility of undue…

Descriptors: Performance Based Assessment, Language Tests, Performance Tests, Test Validity

Assessing the Writing of Deaf College Students: Reevaluating a Direct Assessment of Writing

Peer reviewed

Direct link

Schley, Sara; Albertini, John – Journal of Deaf Studies and Deaf Education, 2005

The NTID Writing Test was developed to assess the writing ability of postsecondary deaf students entering the National Technical Institute for the Deaf and to determine their appropriate placement into developmental writing courses. While previous research (Albertini et al., 1986; Albertini et al., 1996; Bochner, Albertini, Samar, & Metz, 1992)…

Descriptors: Deafness, Writing Ability, Writing Tests, College Students

Is the MEAP Writing Test Reliable? A Case Study.

Peer reviewed

Anderson, Stephen A. – Michigan Reading Journal, 2002

Considers the development of an inter-rater reliability correlation comparing the judgments, or scores, or each judge to see if their observations are similar. Presents a case study of the Northville Public Schools' data for the 2000 MEAP (Michigan Educational Assessment Program) Writing Test. Concludes that in this case study the state fails both…

Descriptors: Case Studies, Elementary Education, Evaluation Research, Interrater Reliability

Writing to the Rubric: Lingering Effects of Traditional Standardized Testing on Direct Writing Assessment.

Mabry, Linda – Phi Delta Kappan, 1999

Education remains heavily shackled by punitive, test-driven reform. Despite reasonable alternatives, testing increasingly drives educational accountability and reform. Standardization of direct writing assessments promotes scoring reliability and facilitates educational comparisons and rankings. However, standardized writing is not good writing,…

Descriptors: Elementary Secondary Education, Interrater Reliability, Performance Based Assessment, Scoring Rubrics

The Effect of Computers on the Test and Inter-Rater Reliability of Writing Tests of ESL Learners

Download full text

Aydin, Selami – Online Submission, 2006

This research aimed to investigate the effect of computers on the test and inter-rater reliability of writing test scores of ESL learners. Writing samples of 20 pen-paper and 20 computer group students were scored in analytic scoring method by two scorers, and then the scores were analyzed in Alpha (Cronbach) model. The results showed that the…

Descriptors: Writing Tests, Interrater Reliability, Test Reliability, English (Second Language)

A Discussion of Analytic Scoring for Writing Performance Assessments.

Download full text

Crehan, Kevin D. – 1997

Writing fits well within the realm of outcomes suitable for observation by performance assessments. Studies of the reliability of performance assessments have suggested that interrater reliability can be consistently high. Scoring consistency, however, is only one aspect of quality in decisions based on assessment results. Another is…

Descriptors: Evaluation Methods, Feedback, Generalizability Theory, Interrater Reliability

The Effect of Computers on the Test and Inter-Rater Reliability of Writing Tests of ESL Learners

Peer reviewed
PDF on ERIC

Download full text

Aydin, Selami – Turkish Online Journal of Educational Technology - TOJET, 2006

Descriptors: Foreign Countries, College Students, Computer Assisted Testing, English (Second Language)

Characteristics of the Test Components of the IELTS Battery: Australian Trial Data.

Download full text

Griffin, Patrick – 1990

Results of the International English Language Testing System (IELTS) battery trials in Australia are reported. The IELTS tests of productive language skills use direct assessment strategies and subjective scoring according to detailed guidelines. The receptive skills tests use indirect assessment strategies and clerical scoring procedures.…

Descriptors: English (Second Language), Foreign Countries, Grammar, Interrater Reliability

Reliability of Professionally Scored Data: NAEP-Related Issues.

Kaplan, Bruce A.; Johnson, Eugene G. – 1992

Across the field of educational assessment the case has been made for alternatives to the multiple-choice item type. Most of the alternative types of items require a subjective evaluation by a rater. The reliability of this subjective rating is a key component of these types of alternative items. In this paper, measures of reliability are…

Descriptors: Educational Assessment, Elementary Secondary Education, Estimation (Mathematics), Evaluators

Aydin, Selami	2
Aktas, Mehtap	1
Albertini, John	1
Alici, Devrim	1
Anderson, Stephen A.	1
Crehan, Kevin D.	1
Gebril, Atta	1
Griffin, Patrick	1
Haberman, Shelby J.	1
Johnson, Eugene G.	1
Kaplan, Bruce A.	1
Lim, Gad S.	1
Liu, Ou Lydia	1
Mabry, Linda	1
Rios, Joseph A.	1
Schley, Sara	1
Sparks, Jesse R.	1
Uzun, N. Bilge	1
Zhang, Mo	1
More ▼