ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	5
Since 2017 (last 10 years)	10
Since 2007 (last 20 years)	29

Descriptor

Interrater Reliability	45
Simulation	45
Comparative Analysis	12
Correlation	11
Evaluators	10
Evaluation Methods	8
Rating Scales	7
Scores	7
Scoring	7
Statistical Analysis	7
Student Evaluation	7
Accuracy	6
Medical Education	6
Patients	6
Foreign Countries	5
Higher Education	5
Peer Evaluation	5
Test Reliability	5
Validity	5
Check Lists	4
Clinical Experience	4
Competence	4
Measurement	4
Models	4
Monte Carlo Methods	4
More ▼

Publication Type

Journal Articles	36
Reports - Research	31
Reports - Evaluative	8
Speeches/Meeting Papers	4
Dissertations/Theses -…	2
Reports - Descriptive	2
Tests/Questionnaires	2
Non-Print Media	1
Reference Materials - General	1

Education Level

Higher Education	11
Postsecondary Education	8
Secondary Education	2
Adult Education	1
Elementary Education	1
Elementary Secondary Education	1
Grade 10	1
Grade 4	1
Grade 8	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
More ▼

Audience

Administrators	1
Practitioners	1
Researchers	1
Teachers	1

Location

Iran	2
Singapore	1
United Kingdom	1
Washington	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

SAT (College Admission Test)

What Works Clearinghouse Rating

Showing 1 to 15 of 45 results Save | Export

The Precision and Bias of Cut Score Estimates from the Beuk Standard Setting Method

Peer reviewed

Direct link

Joseph H. Grochowalski; Lei Wan; Lauren Molin; Amy H. Hendrickson – Journal of Educational Measurement, 2025

The Beuk standard setting method derives cut scores through expert judgment that balances content and normative perspectives. This study developed a method to estimate confidence intervals for Beuk settings and assessed their accuracy via simulations. Simulations varied SME panel size, expert agreement, cut score locations, score distributions,…

Descriptors: Cutting Scores, Standard Setting, Accuracy, Statistical Bias

Interactional Competencies in Medical Student Admission -- What Makes a "Good Medical Doctor"?

Peer reviewed

Direct link

Leonie Fleck; Dorothee Amelung; Anna Fuchs; Benjamin Mayer; Malvin Escher; Lena Listunova; Jobst-Hendrik Schultz; Andreas Möltner; Clara Schütte; Tim Wittenberg; Isabella Schneider; Sabine C. Herpertz – Advances in Health Sciences Education, 2025

Doctors' interactional competencies play a crucial role in patient satisfaction, well-being, and compliance. Accordingly, it is in medical schools' interest to select candidates with strong interactional abilities. While Multiple Mini Interviews (MMIs) provide a useful context to assess such abilities, the evaluation of candidate performance…

Descriptors: Medical Students, Medical Schools, College Admission, Admission Criteria

Assessing Inter-Rater Reliability with Heterogeneous Variance Components Models: Flexible Approach Accounting for Contextual Variables

Peer reviewed

Direct link

Martinková, Patrícia; Bartoš, František; Brabec, Marek – Journal of Educational and Behavioral Statistics, 2023

Inter-rater reliability (IRR), which is a prerequisite of high-quality ratings and assessments, may be affected by contextual variables, such as the rater's or ratee's gender, major, or experience. Identification of such heterogeneity sources in IRR is important for the implementation of policies with the potential to decrease measurement error…

Descriptors: Interrater Reliability, Bayesian Statistics, Statistical Inference, Hierarchical Linear Modeling

A Model-Data-Fit-Informed Approach to Score Resolution in Performance Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Walker, A. Adrienne – Educational Measurement: Issues and Practice, 2021

Many large-scale performance assessments include score resolution procedures for resolving discrepancies in rater judgments. The goal of score resolution is conceptually similar to person fit analyses: To identify students for whom observed scores may not accurately reflect their achievement. Previously, researchers have observed that…

Descriptors: Goodness of Fit, Performance Based Assessment, Evaluators, Decision Making

Validation of Objective Structured Clinical Examination (OSCE) Based on the Occupational Therapy Practice Framework (OTPF): A Pilot Study

Peer reviewed
PDF on ERIC

Download full text

Marzieh Pashmdarfard; Afsoon Hassani Mehraban; Narges Shafaroodi; Kamran Soltani Arabshahi; Soroor Parvizy; Akram Azad; Samaneh Karamali Esmaeili – Journal of Occupational Therapy Education, 2022

Fieldwork education is an integral part of the educational process in occupational therapy and assessing student competency at the end of fieldwork is important. The aim of this study was to design and conduct an Objective Structured Clinical Examination (OSCE) based on the Occupational Therapy Practice Framework (OTPF) for occupational therapy…

Descriptors: Occupational Therapy, Allied Health Occupations Education, Test Construction, Test Validity

Metrics for Discrete Student Models: Chance Levels, Comparisons, and Use Cases

Peer reviewed
PDF on ERIC

Download full text

Bosch, Nigel; Paquette, Luc – Journal of Learning Analytics, 2018

Metrics including Cohen's kappa, precision, recall, and F[subscript 1] are common measures of performance for models of discrete student states, such as a student's affect or behaviour. This study examined discrete model metrics for previously published student model examples to identify situations where metrics provided differing perspectives on…

Descriptors: Models, Comparative Analysis, Prediction, Probability

Using Clinical Simulation to Assess MSW Students' Engagement Skills

Peer reviewed

Direct link

Sacristan, Dolly; Martinez, Colleen D. – Journal of Teaching in Social Work, 2023

Social work educators are compelled to use reliable and valid methods to assess student learning outcomes. This study adapted a clinical simulation by integrating traditional role-play of case scenarios and elements of the Objective Structured Clinical Examination, which is often used to assess students' practice skills. Master of Social Work…

Descriptors: Graduate Students, Counselor Training, Masters Programs, Clinical Experience

Exploring Differences in Measurement and Reporting of Classroom Observation Inter-Rater Reliability

Peer reviewed
PDF on ERIC

Download full text

Wilhelm, Anne Garrison; Gillespie Rouse, Amy; Jones, Francesca – Practical Assessment, Research & Evaluation, 2018

Although inter-rater reliability is an important aspect of using observational instruments, it has received little theoretical attention. In this article, we offer some guidance for practitioners and consumers of classroom observations so that they can make decisions about inter-rater reliability, both for study design and in the reporting of data…

Descriptors: Interrater Reliability, Measurement, Observation, Educational Research

Applying Kane's Validity Framework to a Simulation Based Assessment of Clinical Competence

Peer reviewed

Direct link

Tavares, Walter; Brydges, Ryan; Myre, Paul; Prpic, Jason; Turner, Linda; Yelle, Richard; Huiskamp, Maud – Advances in Health Sciences Education, 2018

Assessment of clinical competence is complex and inference based. Trustworthy and defensible assessment processes must have favourable evidence of validity, particularly where decisions are considered high stakes. We aimed to organize, collect and interpret validity evidence for a high stakes simulation based assessment strategy for certifying…

Descriptors: Competence, Simulation, Allied Health Personnel, Certification

The Impact of Rater Variability on Relationships among Different Effect-Size Indices for Inter-Rater Agreement between Human and Automated Essay Scoring

Direct link

Yun, Jiyeo – ProQuest LLC, 2017

Since researchers investigated automatic scoring systems in writing assessments, they have dealt with relationships between human and machine scoring, and then have suggested evaluation criteria for inter-rater agreement. The main purpose of my study is to investigate the magnitudes of and relationships among indices for inter-rater agreement used…

Descriptors: Interrater Reliability, Essays, Scoring, Evaluators

Item Response Theory for Peer Assessment

Peer reviewed

Direct link

Uto, Masaki; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2016

As an assessment method based on a constructivist approach, peer assessment has become popular in recent years. However, in peer assessment, a problem remains that reliability depends on the rater characteristics. For this reason, some item response models that incorporate rater parameters have been proposed. Those models are expected to improve…

Descriptors: Item Response Theory, Peer Evaluation, Bayesian Statistics, Simulation

Estimating Item Difficulty with Comparative Judgments. Research Report. ETS RR-14-39

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Saldivia, Luis; Jackson, Carol; Schuppan, Fred; Wanamaker, Wilbur – ETS Research Report Series, 2014

Previous investigations of the ability of content experts and test developers to estimate item difficulty have, for themost part, produced disappointing results. These investigations were based on a noncomparative method of independently rating the difficulty of items. In this article, we argue that, by eliciting comparative judgments of…

Descriptors: Test Items, Difficulty Level, Comparative Analysis, College Entrance Examinations

How Do Raters Judge Spoken Vocabulary?

Peer reviewed
PDF on ERIC

Download full text

Li, Hui – English Language Teaching, 2016

The aim of the study was to investigate how raters come to their decisions when judging spoken vocabulary. Segmental rating was introduced to quantify raters' decision-making process. It is hoped that this simulated study brings fresh insight to future methodological considerations with spoken data. Twenty trainee raters assessed five Chinese…

Descriptors: Foreign Countries, Evaluators, Interrater Reliability, Decision Making

Stimulated Recall Interviews for Describing Pragmatic Epistemology

Peer reviewed

Direct link

Shubert, Christopher W.; Meredith, Dawn C. – Physical Review Special Topics - Physics Education Research, 2015

Students' epistemologies affect how and what they learn: do they believe physics is a list of equations, or a coherent and sensible description of the physical world? In order to study these epistemologies as part of curricular assessment, we adopt the resources framework, which posits that students have many productive epistemological resources…

Descriptors: Epistemology, Recall (Psychology), Physics, Educational Environment

The Performance of Standardized Patients in Portraying Clinical Scenarios in Speech-Language Therapy

Peer reviewed

Direct link

Hill, Anne E.; Davidson, Bronwyn J.; Theodoros, Deborah G. – International Journal of Language & Communication Disorders, 2013

Background: Standardized patients (SPs) are frequently included in the clinical preparation of students in the health sciences. An acknowledged benefit of using SPs is the opportunity to provide a standardized method by which students can demonstrate and develop their competency. Relatively little is known, however, about the capacity of SPs to…

Descriptors: Speech Language Pathology, Therapy, Patients, Simulation

Previous Page | Next Page »

Pages: 1 | 2 | 3

Advances in Health Sciences…	3
ETS Research Report Series	2
ProQuest LLC	2
Academic Medicine	1
Academic Psychiatry	1
Assessment & Evaluation in…	1
Athletic Training Education…	1
Cognitive Science	1
College Board	1
Contemporary Educational…	1
Discourse Processes: A…	1
Educational Measurement:…	1
English Language Teaching	1
Human Resource Development…	1
IEEE Transactions on Learning…	1
International Journal of…	1
International Journal of…	1
Journal of Academic Medicine	1
Journal of Continuing…	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of Learning Analytics	1
Journal of Nutrition…	1
Journal of Occupational…	1
Journal of Professional…	1
More ▼

Adamson, Katie Anne	1
Afsoon Hassani Mehraban	1
Akram Azad	1
Alaeddini, F.	1
Allison, Meredith	1
Amy H. Hendrickson	1
Andreas Möltner	1
Ang-Aw, Hui Teng	1
Anna Fuchs	1
Arbabi, Mohammad	1
Armstrong, Kirk J.	1
Ato, Manuel	1
Attali, Yigal	1
Bartfay, Emma	1
Bartoš, František	1
Benavente, Ana	1
Benjamin Mayer	1
Blanchard, P. Nick	1
Bosch, Nigel	1
Boulet, John R.	1
Brabec, Marek	1
Breyer, F. Jay	1
Brimacombe, C. A. Elizabeth	1
Brull, Harry	1
More ▼