Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 3 |
| Since 2007 (last 20 years) | 6 |
Descriptor
| Interrater Reliability | 8 |
| Statistical Analysis | 8 |
| Writing Tests | 8 |
| Scores | 3 |
| Scoring | 3 |
| College Students | 2 |
| Correlation | 2 |
| English (Second Language) | 2 |
| Essay Tests | 2 |
| Evaluation Criteria | 2 |
| Item Response Theory | 2 |
| More ▼ | |
Source
Author
| Wind, Stefanie A. | 2 |
| Allen, Nancy | 1 |
| Bennett, Randy Elliot | 1 |
| Braswell, James | 1 |
| Guastello, E. Francine | 1 |
| Horkay, Nancy | 1 |
| Kaplan, Bruce | 1 |
| Kayapinar, Ulas | 1 |
| Lenz, Claire | 1 |
| Liu, Ou Lydia | 1 |
| Nieto, Eloísa | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 8 |
| Journal Articles | 7 |
| Numerical/Quantitative Data | 1 |
| Tests/Questionnaires | 1 |
Education Level
| Higher Education | 2 |
| Elementary Education | 1 |
| Grade 10 | 1 |
| Grade 4 | 1 |
| High Schools | 1 |
| Intermediate Grades | 1 |
| Postsecondary Education | 1 |
| Secondary Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
| ACT Assessment | 1 |
| SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
Wind, Stefanie A. – Language Testing, 2019
Differences in rater judgments that are systematically related to construct-irrelevant characteristics threaten the fairness of rater-mediated writing assessments. Accordingly, it is essential that researchers and practitioners examine the degree to which the psychometric quality of rater judgments is comparable across test-taker subgroups.…
Descriptors: Nonparametric Statistics, Interrater Reliability, Differences, Writing Tests
Wind, Stefanie A.; Patil, Yogendra J. – Educational and Psychological Measurement, 2018
Recent research has explored the use of models adapted from Mokken scale analysis as a nonparametric approach to evaluating rating quality in educational performance assessments. A potential limiting factor to the widespread use of these techniques is the requirement for complete data, as practical constraints in operational assessment systems…
Descriptors: Scaling, Data, Interrater Reliability, Writing Tests
Rios, Joseph A.; Sparks, Jesse R.; Zhang, Mo; Liu, Ou Lydia – ETS Research Report Series, 2017
Proficiency with written communication (WC) is critical for success in college and careers. As a result, institutions face a growing challenge to accurately evaluate their students' writing skills to obtain data that can support demands of accreditation, accountability, or curricular improvement. Many current standardized measures, however, lack…
Descriptors: Test Construction, Test Validity, Writing Tests, College Outcomes Assessment
Prieto, Gerardo; Nieto, Eloísa – Psicologica: International Journal of Methodology and Experimental Psychology, 2014
This paper describes how a Many Faceted Rasch Measurement (MFRM) approach can be applied to performance assessment focusing on rater analysis. The article provides an introduction to MFRM, a description of MFRM analysis procedures, and an example to illustrate how to examine the effects of various sources of variability on test takers' performance…
Descriptors: Item Response Theory, Interrater Reliability, Rating Scales, Error of Measurement
Kayapinar, Ulas – Eurasian Journal of Educational Research, 2014
Problem Statement: There have been many attempts to research the effective assessment of writing ability, and many proposals for how this might be done. In this sense, rater reliability plays a crucial role for making vital decisions about testees in different turning points of both educational and professional life. Intra-rater and inter-rater…
Descriptors: Interrater Reliability, Essay Tests, Writing Tests, Grading
Wang, Ping – English Language Teaching, 2009
This paper makes a study of the rater reliability in scoring composition in the test of English as a foreign language (EFL) and focuses on the inter-rater reliability as well as several interactions between raters and the other facets involved (that is examinees, rating criteria and rating methods). Results showed that raters were fairly…
Descriptors: Interrater Reliability, Scoring, Writing (Composition), English (Second Language)
Sandene, Brent; Horkay, Nancy; Bennett, Randy Elliot; Allen, Nancy; Braswell, James; Kaplan, Bruce; Oranje, Andreas – National Center for Education Statistics, 2005
This publication presents the reports from two studies, Math Online (MOL) and Writing Online (WOL), part of the National Assessment of Educational Progress (NAEP) Technology-Based Assessment (TBA) project. Funded by the National Center for Education Statistics (NCES), the Technology-Based Assessment project is intended to explore the use of new…
Descriptors: Grade 8, Statistical Analysis, Scoring, Familiarity
Guastello, E. Francine; Lenz, Claire – Language and Literacy Spectrum, 2004
This study examined the effects of parental training on students' writing scores. Six classes of fourth grade students from three schools were randomly assigned to three experimental and three control groups. Parents of the students in the experimental group attended training sessions and received instruction in the stages of the writing process…
Descriptors: Writing Improvement, Parent Participation, Experimental Groups, Writing Processes

Peer reviewed
Direct link
