Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 2 |
| Since 2007 (last 20 years) | 8 |
Descriptor
| Evaluators | 10 |
| Interrater Reliability | 10 |
| Simulation | 10 |
| Scores | 5 |
| Scoring | 4 |
| Comparative Analysis | 3 |
| Correlation | 3 |
| Decision Making | 3 |
| Essays | 3 |
| Foreign Countries | 3 |
| Performance Based Assessment | 3 |
| More ▼ | |
Source
| College Board | 1 |
| ETS Research Report Series | 1 |
| Educational Measurement:… | 1 |
| English Language Teaching | 1 |
| Journal of Continuing… | 1 |
| Journal of Research on… | 1 |
| ProQuest LLC | 1 |
| RELC Journal: A Journal of… | 1 |
Author
| Ang-Aw, Hui Teng | 1 |
| Breyer, F. Jay | 1 |
| Brull, Harry | 1 |
| Conley, Patrick | 1 |
| DeCarlo, Lawrence T. | 1 |
| DiazGranados, Deborah | 1 |
| Feldman, Moshe | 1 |
| Fulmer, Gavin W. | 1 |
| Goh, Christine Chuen Meng | 1 |
| Jegerski, Jane | 1 |
| Kaiser, Paul D. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 6 |
| Reports - Research | 5 |
| Reports - Evaluative | 2 |
| Dissertations/Theses -… | 1 |
| Non-Print Media | 1 |
| Reference Materials - General | 1 |
| Reports - Descriptive | 1 |
| Speeches/Meeting Papers | 1 |
| Tests/Questionnaires | 1 |
Education Level
| Postsecondary Education | 3 |
| Higher Education | 2 |
| Adult Education | 1 |
| Elementary Education | 1 |
| Grade 4 | 1 |
| Grade 8 | 1 |
| Intermediate Grades | 1 |
| Junior High Schools | 1 |
| Middle Schools | 1 |
| Secondary Education | 1 |
Audience
Location
| Singapore | 1 |
| United Kingdom | 1 |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
| SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
Wind, Stefanie A.; Walker, A. Adrienne – Educational Measurement: Issues and Practice, 2021
Many large-scale performance assessments include score resolution procedures for resolving discrepancies in rater judgments. The goal of score resolution is conceptually similar to person fit analyses: To identify students for whom observed scores may not accurately reflect their achievement. Previously, researchers have observed that…
Descriptors: Goodness of Fit, Performance Based Assessment, Evaluators, Decision Making
Yun, Jiyeo – ProQuest LLC, 2017
Since researchers investigated automatic scoring systems in writing assessments, they have dealt with relationships between human and machine scoring, and then have suggested evaluation criteria for inter-rater agreement. The main purpose of my study is to investigate the magnitudes of and relationships among indices for inter-rater agreement used…
Descriptors: Interrater Reliability, Essays, Scoring, Evaluators
Li, Hui – English Language Teaching, 2016
The aim of the study was to investigate how raters come to their decisions when judging spoken vocabulary. Segmental rating was introduced to quantify raters' decision-making process. It is hoped that this simulated study brings fresh insight to future methodological considerations with spoken data. Twenty trainee raters assessed five Chinese…
Descriptors: Foreign Countries, Evaluators, Interrater Reliability, Decision Making
Polikoff, Morgan S.; Fulmer, Gavin W. – Journal of Research on Educational Effectiveness, 2013
The alignment among standards, assessments, and teachers' instruction is an essential element of standards-based educational reforms. The Surveys of Enacted Curriculum (SEC) is the only common tool that can be used to measure the alignment among all three of these sources (Martone & Sireci, 2009). Prior SEC alignment work has been limited by…
Descriptors: Alignment (Education), Academic Standards, Educational Assessment, Instruction
Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013
In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…
Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests
Ang-Aw, Hui Teng; Goh, Christine Chuen Meng – RELC Journal: A Journal of Language Teaching and Research, 2011
The oral examination is an important component of the high-stakes "O" Level examination in Singapore taken by 16-17 year olds whose first language may or may not be English. In spite of this, there has been sparse research into the examination. This paper reports findings of an exploratory study which attempted to determine whether there…
Descriptors: Protocol Analysis, Rating Scales, Examiners, Foreign Countries
Feldman, Moshe; Lazzara, Elizabeth H.; Vanderbilt, Allison A.; DiazGranados, Deborah – Journal of Continuing Education in the Health Professions, 2012
Competency-based assessment and an emphasis on obtaining higher-level outcomes that reflect physicians' ability to demonstrate their skills has created a need for more advanced assessment practices. Simulation-based assessments provide medical education planners with tools to better evaluate the 6 Accreditation Council for Graduate Medical…
Descriptors: Performance Based Assessment, Physicians, Accuracy, High Stakes Tests
DeCarlo, Lawrence T.; Kim, YoungKoung – College Board, 2008
[Slides] presented at the American Educational Research Association (AERA) Conference in New York in March 2008. This presentation explores what cues are used as a deciding factor in essay scoring by the essay grader.
Descriptors: Essays, Grading, Evaluation Criteria, Scoring Rubrics
Kaiser, Paul D.; Brull, Harry – 1994
The design, administration, scoring, and results of the 1993 New York State Correctional Captain Examination are described. The examination was administered to 405 candidates. As in previous Sergeant and Lieutenant examinations, candidates also completed latent image written simulation problems and open/closed book multiple choice test components.…
Descriptors: Competitive Selection, Correctional Rehabilitation, Decision Making, Educational Innovation
Conley, Patrick; Jegerski, Jane – 1991
Construction of a work sample test, the Investigator Planning Exercise (IPE), for the job of detective in the Chicago (Illinois) Police Department is described. Simulated crime scenarios, a mock crime scene, and five checklists of necessary skills (i.e., ability to summarize and communicate facts, identify inconsistencies, and determine the next…
Descriptors: Check Lists, Deduction, Evaluators, Interrater Reliability

Peer reviewed
Direct link
