ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	4
Since 2007 (last 20 years)	9

Descriptor

Correlation	11
Interrater Reliability	11
Simulation	11
Comparative Analysis	5
Scores	4
Scoring	4
Statistical Analysis	4
College Students	3
Evaluators	3
Language Tests	3
Clinical Experience	2
Computer Software	2
Counselor Client Relationship	2
Counselor Training	2
Error Patterns	2
Essays	2
Evaluation Methods	2
Foreign Countries	2
Generalization	2
Item Analysis	2
Rating Scales	2
Second Language Learning	2
Student Evaluation	2
Test Items	2
Allied Health Personnel	1
More ▼

Source

ETS Research Report Series	2
Advances in Health Sciences…	1
Athletic Training Education…	1
Contemporary Educational…	1
English Language Teaching	1
Journal of Nutrition…	1
Journal of Teaching in Social…	1
Language Testing	1
Practical Assessment,…	1
ProQuest LLC	1

Publication Type

Journal Articles	10
Reports - Research	8
Reports - Evaluative	2
Dissertations/Theses -…	1
Tests/Questionnaires	1

Education Level

Higher Education	5
Postsecondary Education	4

Audience

Location

United Kingdom

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Using Clinical Simulation to Assess MSW Students' Engagement Skills

Peer reviewed

Direct link

Sacristan, Dolly; Martinez, Colleen D. – Journal of Teaching in Social Work, 2023

Social work educators are compelled to use reliable and valid methods to assess student learning outcomes. This study adapted a clinical simulation by integrating traditional role-play of case scenarios and elements of the Objective Structured Clinical Examination, which is often used to assess students' practice skills. Master of Social Work…

Descriptors: Graduate Students, Counselor Training, Masters Programs, Clinical Experience

Exploring Differences in Measurement and Reporting of Classroom Observation Inter-Rater Reliability

Peer reviewed
PDF on ERIC

Download full text

Wilhelm, Anne Garrison; Gillespie Rouse, Amy; Jones, Francesca – Practical Assessment, Research & Evaluation, 2018

Although inter-rater reliability is an important aspect of using observational instruments, it has received little theoretical attention. In this article, we offer some guidance for practitioners and consumers of classroom observations so that they can make decisions about inter-rater reliability, both for study design and in the reporting of data…

Descriptors: Interrater Reliability, Measurement, Observation, Educational Research

Applying Kane's Validity Framework to a Simulation Based Assessment of Clinical Competence

Peer reviewed

Direct link

Tavares, Walter; Brydges, Ryan; Myre, Paul; Prpic, Jason; Turner, Linda; Yelle, Richard; Huiskamp, Maud – Advances in Health Sciences Education, 2018

Assessment of clinical competence is complex and inference based. Trustworthy and defensible assessment processes must have favourable evidence of validity, particularly where decisions are considered high stakes. We aimed to organize, collect and interpret validity evidence for a high stakes simulation based assessment strategy for certifying…

Descriptors: Competence, Simulation, Allied Health Personnel, Certification

The Impact of Rater Variability on Relationships among Different Effect-Size Indices for Inter-Rater Agreement between Human and Automated Essay Scoring

Direct link

Yun, Jiyeo – ProQuest LLC, 2017

Since researchers investigated automatic scoring systems in writing assessments, they have dealt with relationships between human and machine scoring, and then have suggested evaluation criteria for inter-rater agreement. The main purpose of my study is to investigate the magnitudes of and relationships among indices for inter-rater agreement used…

Descriptors: Interrater Reliability, Essays, Scoring, Evaluators

Estimating Item Difficulty with Comparative Judgments. Research Report. ETS RR-14-39

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Saldivia, Luis; Jackson, Carol; Schuppan, Fred; Wanamaker, Wilbur – ETS Research Report Series, 2014

Previous investigations of the ability of content experts and test developers to estimate item difficulty have, for themost part, produced disappointing results. These investigations were based on a noncomparative method of independently rating the difficulty of items. In this article, we argue that, by eliciting comparative judgments of…

Descriptors: Test Items, Difficulty Level, Comparative Analysis, College Entrance Examinations

How Do Raters Judge Spoken Vocabulary?

Peer reviewed
PDF on ERIC

Download full text

Li, Hui – English Language Teaching, 2016

The aim of the study was to investigate how raters come to their decisions when judging spoken vocabulary. Segmental rating was introduced to quantify raters' decision-making process. It is hoped that this simulated study brings fresh insight to future methodological considerations with spoken data. Twenty trainee raters assessed five Chinese…

Descriptors: Foreign Countries, Evaluators, Interrater Reliability, Decision Making

Standardized Patients Provide a Reliable Assessment of Athletic Training Students' Clinical Skills

Peer reviewed

Direct link

Armstrong, Kirk J.; Jarriel, Amanda J. – Athletic Training Education Journal, 2016

Context: Providing students reliable objective feedback regarding their clinical performance is of great value for ongoing clinical skill assessment. Since a standardized patient (SP) is trained to consistently portray the case, students can be assessed and receive immediate feedback within the same clinical encounter; however, no research, to our…

Descriptors: Patients, Athletics, Simulation, Outcome Measures

Investigating the Suitability of Implementing the "e-rater"® Scoring Engine in a Large-Scale English Language Testing Program. Research Report. ETS RR-13-36

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013

In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…

Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests

Evaluation of the FOCUS (Feedback on Counseling Using Simulation) Instrument for Assessment of Client-Centered Nutrition Counseling Behaviors

Peer reviewed

Direct link

Henry, Beverly W.; Smith, Thomas J. – Journal of Nutrition Education and Behavior, 2010

Objective: To develop an instrument to assess client-centered counseling behaviors (skills) of student-counselors in a standardized patient (SP) exercise. Methods: Descriptive study of the accuracy and utility of a newly developed counseling evaluation instrument. Study participants included 11 female student-counselors at a Midwestern…

Descriptors: Feedback (Response), Generalizability Theory, Nutrition, Diseases

Accounting for Nonsystematic Error in Performance Ratings.

Peer reviewed

Henning, Grant – Language Testing, 1996

Analyzes simulated performance ratings on a six-point scale by two independent raters to account for nonsystematic error in performance ratings. Results suggest that rater agreement or covariance is not always a dependable estimate of score reliability and that the practice of seeking additional raters for adjudication of discrepant ratings is not…

Descriptors: Correlation, Error Patterns, Interrater Reliability, Language Tests

An Experimental Investigation of the Beliefs-of-Relatedness Source of Halo.

Peer reviewed

Suter, W. Newton; Roberts, William L. – Contemporary Educational Psychology, 1987

This study examined halo in raters' beliefs of item (attribute) relatedness. College students' prior beliefs of the co-occurrence of teaching attributes were correlated with actual correlation of teaching attributes of fictional college professors. Results showed some support for beliefs-of-relatedness source of halo. (LMO)

Descriptors: College Students, Correlation, Error of Measurement, Higher Education

Armstrong, Kirk J.	1
Attali, Yigal	1
Breyer, F. Jay	1
Brydges, Ryan	1
Gillespie Rouse, Amy	1
Henning, Grant	1
Henry, Beverly W.	1
Huiskamp, Maud	1
Jackson, Carol	1
Jarriel, Amanda J.	1
Jones, Francesca	1
Li, Hui	1
Lorenz, Florian	1
Martinez, Colleen D.	1
Myre, Paul	1
Prpic, Jason	1
Roberts, William L.	1
Sacristan, Dolly	1
Saldivia, Luis	1
Schuppan, Fred	1
Smith, Thomas J.	1
Suter, W. Newton	1
Tavares, Walter	1
Turner, Linda	1
Wanamaker, Wilbur	1
More ▼