NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 645 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Stefan K. Schauber; Anne O. Olsen; Erik L. Werner; Morten Magelssen – Advances in Health Sciences Education, 2024
Introduction: Research in various areas indicates that expert judgment can be highly inconsistent. However, expert judgment is indispensable in many contexts. In medical education, experts often function as examiners in rater-based assessments. Here, disagreement between examiners can have far-reaching consequences. The literature suggests that…
Descriptors: Medical Students, Performance Based Assessment, Expertise, Interrater Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Samuel D'Emanuele; Francesca Nardello; Fabrizio Garau; Diego Campaci; Federico Schena; Cantor Tarperi – Measurement in Physical Education and Exercise Science, 2025
The agreement between a wearable inertial sensor (GYKO, G) and the force platform (P) was assessed by evaluating "test-retest" and "inter-rater reliability." Thirty-eight subjects were enrolled; the selected indices of balance were investigated over foot positions and (un)stable conditions. Intraclass correlation coefficient…
Descriptors: Human Posture, Measurement Equipment, Interrater Reliability, Measurement Techniques
Peer reviewed Peer reviewed
Direct linkDirect link
Jennifer Manning; Jeffrey Baldwin; Natasha Powell – Innovations in Education and Teaching International, 2025
As ChatGPT continues to reshape student engagement and instructional design, it is crucial to examine its practical implications. This study aims to evaluate the effectiveness of ChatGPT3.5 and ChatGPT4 as potential automated essay scoring (AES) systems. Fifty authentic, student-written annotated bibliographies were evaluated by three human raters…
Descriptors: Foreign Countries, Essays, Writing Evaluation, Artificial Intelligence
Peer reviewed Peer reviewed
Direct linkDirect link
Reeta Neittaanmäki; Iasonas Lamprianou – Language Testing, 2024
This article focuses on rater severity and consistency and their relation to major changes in the rating system in a high-stakes testing context. The study is based on longitudinal data collected from 2009 to 2019 from the second language (L2) Finnish speaking subtest in the National Certificates of Language Proficiency in Finland. We investigated…
Descriptors: Foreign Countries, Interrater Reliability, Evaluators, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Suzanna Dooley; Tammy Hopper; Rachael Doyle; Orla Gilheaney; Margaret Walshe – International Journal of Language & Communication Disorders, 2025
Background: Individuals with dementia have communication limitations resulting from cognitive impairments that define the syndrome. Whereas there are numerous cognitive assessments for individuals with dementia, there are far fewer communication assessments. The Profiling Communication Ability in Dementia (P-CAD) was developed to address this gap.…
Descriptors: Communication Skills, Communication Problems, Dementia, Intellectual Disability
Peer reviewed Peer reviewed
Direct linkDirect link
Duong Thi Ngoc Ngan; Maria Hercz – Asia-Pacific Education Researcher, 2024
As there is a paucity of instrument investigating a hybrid teaching conception, the current study is seen as part of attempt to fill this gap. The subjects in the study were 310 University participants--instructors in Socialist Republic of Viet Nam (Vietnam). The survey was implemented with the use of Cognitive Constructivism-oriented Teaching…
Descriptors: Blended Learning, Faculty, Teaching Methods, Foreign Countries
Peer reviewed Peer reviewed
Direct linkDirect link
Elizabeth Murray; Shelley Velleman; Jonathan L. Preston; Robert Heard; Akhila Shibu; Patricia McCabe – Journal of Speech, Language, and Hearing Research, 2024
Purpose: The current standard for clinical diagnosis of childhood apraxia of speech (CAS) is expert clinician judgment. The psychometric properties of this standard are not well understood; however, they are important for improving clinical diagnosis. The purpose of this study is to determine the extent to which experts agree on the clinical…
Descriptors: Neurological Impairments, Speech Impairments, Preschool Children, Adolescents
Peer reviewed Peer reviewed
Direct linkDirect link
Hulteen, Ryan M.; True, Larissa; Kroc, Edward – Measurement in Physical Education and Exercise Science, 2023
The typical process for assessing inter-rater reliability is facilitated by training raters within a research team. Lacking is an understanding if inter-rater reliability scores "between" research teams demonstrate adequate reliability. This study examined inter-rater reliability between 16 researchers who assessed fundamental motor…
Descriptors: Psychomotor Skills, Scores, Reliability, Interrater Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Li Wang; Xin Qi; Ziyan Meng; Meiyu Xiang; Zhuoqing Li; Sitong Zhang; Longyun Hu; Hoyee W. Hirai; Carol K. S. To; Patrick C. M. Wong – Journal of Speech, Language, and Hearing Research, 2025
Purpose: Assessing social communication and measuring its changes among young autistic children presents significant challenges, particularly when tracking intervention effects within short timeframes. Existing measures, mostly validated in Western contexts, may not be suitable for culturally diverse populations. Addressing this gap, the Social…
Descriptors: Autism Spectrum Disorders, Preschool Children, Interpersonal Communication, Communication Skills
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Erica Munnik; Prenita Reddi; Mario R. Smith – South African Journal of Childhood Education, 2025
Background: The need for the adaptation of instruments to other native languages to promote culture-fairness framed this study. Aim: This article reports on the adaptation of the locally developed Emotional Social Screening Tool for School Readiness (E3SR-R) into isiXhosa. Setting: This adaptation study was conducted in South Africa. Methods: The…
Descriptors: Screening Tests, Emotional Development, Social Development, School Readiness
Peer reviewed Peer reviewed
Direct linkDirect link
Aislinn Ganci; Miran Qazizada; Brianna Fehr; Ana Vucenovic; Edmond Lou; Eric Parent – Measurement in Physical Education and Exercise Science, 2024
Spinal alignment can be assessed without radiation using three-dimensional ultrasound imaging (3DUS). Reliable measurements could inform the ideal arm position for scoliosis radiographs. This study determined the inter-evaluator reliability of axial vertebral rotation (AVR) measurements and sagittal curve angles in healthy females from 3DUS spinal…
Descriptors: Foreign Countries, Young Adults, Adults, Adolescents
Peer reviewed Peer reviewed
Direct linkDirect link
Liz Jackson; Michael W. Apple; Fei Yan; Jason Cong Lin; Chenxi Jiang; Tongzhou Li; Edward Vickers – Educational Philosophy and Theory, 2024
In this collective essay the authors consider the nature and consequences of reading and researching across difference in an international and intergenerational team, whose core members are focused on understanding how curriculum operates and the nature of textbook representation of diversity in Mainland China, Hong Kong, Taiwan, and Macau.…
Descriptors: Foreign Countries, Textbooks, Reading Research, Educational Research
Peer reviewed Peer reviewed
Direct linkDirect link
Pouls, Claudia; Jeandarme, Inge – Journal of Mental Health Research in Intellectual Disabilities, 2023
Background: The ARMIDILO-S is advocated as a promising tool for assessing dynamic risk factors in sex offenders with intellectual disabilities (SOIDs). However, research remains scarce. The present study aimed to further validate this instrument in SOIDs. Method: The study prospectively followed 38 SOIDs for up to one year to test the accuracy of…
Descriptors: Test Reliability, Test Validity, Sexual Abuse, Criminals
Peer reviewed Peer reviewed
Direct linkDirect link
Ilona Rinne – Assessment & Evaluation in Higher Education, 2024
It is widely acknowledged in research that common criteria and aligned standards do not result in consistent assessment of such a complex performance as the final undergraduate thesis. Assessment is determined by examiners' understanding of rubrics and their views on thesis quality. There is still a gap in the research literature about how…
Descriptors: Foreign Countries, Undergraduate Students, Teacher Education Programs, Evaluation Criteria
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Yang Yang – Shanlax International Journal of Education, 2024
This paper explores the reliability of using ChatGPT in evaluating EFL writing by assessing its intra- and inter-rater reliability. Eighty-two compositions were randomly sampled from the Written English Corpus of Chinese Learners. These compositions were rated by three experienced raters with regard to 'language', 'content', and 'organization'.…
Descriptors: English (Second Language), Second Language Instruction, Writing (Composition), Evaluation Methods
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  ...  |  43