Publication Date
| In 2026 | 0 |
| Since 2025 | 56 |
| Since 2022 (last 5 years) | 282 |
| Since 2017 (last 10 years) | 778 |
| Since 2007 (last 20 years) | 2040 |
Descriptor
| Interrater Reliability | 3122 |
| Foreign Countries | 654 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 24 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Pope, David A.; Mick, James P. – String Research Journal, 2021
The purpose of this study was to examine the assigned ratings, interrater reliability, and possible influences of school level and instrumentation on adjudicators' evaluations of orchestra performances at a national-level adjudicated music festival. Data consisted of the overall ratings assigned to orchestra performances (N = 55) at the 2017,…
Descriptors: Interrater Reliability, Musical Instruments, Musicians, Music Activities
Zahn, Daniela; Canton, Ursula; Boyd, Victoria; Hamilton, Laura; Mamo, Josianne; McKay, Jane; Proudfoot, Linda; Telfer, Dickson; Williams, Kim; Wilson, Colin – Studies in Higher Education, 2021
Evaluating the impact of Academic Literacies teaching (Lea and Street [1998. "Student Writing in Higher Education: An Academic Literacies Approach." "Studies in Higher Education" 23 (2): 157-72. doi:10.1080/03075079812331380364]) is difficult, as it involves gauging whether writers: (1) gain better understanding of what…
Descriptors: Writing Evaluation, Evaluation Methods, Undergraduate Students, Foreign Countries
Duvall, Steven F.; Fox, Ashley M.; Meeks, Courtney G. – American Journal of Distance Education, 2022
Following the pandemic-related school shutdowns in spring 2020, direct observations continued to be a necessary component of special education evaluations even when students were not present at school. As students began learning at home instead of in classrooms, the continued need for observational data likely compelled most educators to use video…
Descriptors: Interrater Reliability, Distance Education, Observation, COVID-19
Shasha Chen; Shaohui Chi; Zuhao Wang – Journal of Baltic Science Education, 2025
Interdisciplinary thinking is critical for equipping students to apply scientific knowledge and tackle societal challenges across various disciplines, which has been recognized as a key objective of twenty-first century science education. However, research on effective interdisciplinary assessment in secondary school science education is still…
Descriptors: Thinking Skills, Interdisciplinary Approach, Science Instruction, Grade 7
Brittany Grey; Marren C. Brooks; Emily A. Lund; Krystal L. Werfel – Language, Speech, and Hearing Services in Schools, 2025
Purpose: This study examined the internal consistency reliability, interrater reliability, and concurrent validity of the norm-referenced Test of Early Written Language--Third Edition (TEWL-3) to determine if it is an appropriate measure to use when determining if elementary children who are deaf and hard of hearing (DHH) meet grade-level writing…
Descriptors: Hard of Hearing, Sensory Aids, Writing Improvement, Writing Instruction
Nicolas Petit; Flavia Mengarelli; Marie-Maude Geoffray Cassar; Giorgio Arcara; Valentina Bambini – Journal of Speech, Language, and Hearing Research, 2025
Purpose: This study aims (a) to assess the psychometric properties of a French adaptation of the Assessment of Pragmatic Abilities and Cognitive Substrates (APACS-Fr), a comprehensive test of pragmatic abilities for French-speaking adolescents and adults, and (b) to use it to study lifespan variations in pragmatic abilities, to determine when…
Descriptors: Pragmatics, Cognitive Ability, Language Skills, Cognitive Measurement
Øydis Hide; Dagrun Slettebø Daltveit; Åse Sivertsen; Anne Katherine Hvistendahl; Randi Lovise Kjerstad; Marit Berntsen Kvinnsland; Nina Helen Pedersen; Christina Sørensen – International Journal of Language & Communication Disorders, 2025
Background: Cleft lip and palate (CLP) treatment in Norway is centralized and multidisciplinary, with long-term follow-up from birth to adulthood. The Norwegian Registry of Cleft Lip and Palate was established to ensure high-quality care and enable systematic data collection. Speech data are a key component, assessed by speech--language therapists…
Descriptors: Foreign Countries, Validity, Reliability, Data Collection
Van Elsen, Joris; Faddar, Jerich; Appels, Lies; De Maeyer, Sven; Vanhoof, Jan; Van Petegem, Peter – School Effectiveness and School Improvement, 2023
In order to support research on school effectiveness, there is a need for valid and reliable instruments to assess policymaking capacities of schools. Increasingly, policymaking is seen as a shared responsibility of the entire pedagogical team of a school. In this article, data were analysed from a sample of 1,696 (care) teachers coordinators and…
Descriptors: Educational Policy, Policy Formation, Questionnaires, School Effectiveness
Karen N. Sommers – ProQuest LLC, 2023
The Resident Assistant (RA) position is the foundational role in the student affairs staffing structure. RA job description has expanded exponentially since the earliest iterations (Boone et al., 2016). The selection of RAs presents unique challenges because it is costly, requires a lot of staff, typically draws a large candidate pool, and…
Descriptors: Resident Advisers, College Students, Student Personnel Services, Personnel Selection
Samantha B. Godoy – ProQuest LLC, 2024
The process of conducting child and adolescent psychoeducational assessments has changed over the past 2 decades (Shapiro & Heick, 2004). In the past, the school psychologists commonly concentrated on behavioral, achievement, and projective assessments and usually did not include systematic multi-rater observation rating scales of behavior.…
Descriptors: Interrater Reliability, Parent Teacher Cooperation, Psychoeducational Methods, Psychological Evaluation
On-Soon Lee – Journal of Pan-Pacific Association of Applied Linguistics, 2024
Despite the increasing interest in using AI tools as assistant agents in instructional settings, the effectiveness of ChatGPT, the generative pretrained AI, for evaluating the accuracy of second language (L2) writing has been largely unexplored in formative assessment. Therefore, the current study aims to examine how ChatGPT, as an evaluator,…
Descriptors: Foreign Countries, Undergraduate Students, English (Second Language), Second Language Learning
Joshi, Ashwini; Baheti, Isha; Angadi, Vrushali – Journal of Speech, Language, and Hearing Research, 2020
Aim: The purpose of this study was to develop and assess the reliability of a Hindi version of the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V). Reliability was assessed by comparing Hindi CAPE-V ratings with English CAPE-V ratings and by the Grade, Roughness, Breathiness, Asthenia and Strain (GRBAS) scale. Method: Hindi sentences…
Descriptors: Test Construction, Indo European Languages, Test Reliability, Voice Disorders
Hicks, Nathan M. – ProQuest LLC, 2020
Grades serve as one of the primary indicators of student learning, directing subsequent actions for students, instructors, and administrators, alike. Therefore, grade validity--that is, the extent to which grades communicate a meaningful and credible representation of what they purport to measure--is of utmost importance. However, a grade cannot…
Descriptors: Grading, Scoring Rubrics, Interrater Reliability, Test Validity
Markelz, Andrew M.; Riden, Benjamin S.; Zoder-Martell, Kimberly A.; Miller, Joseph E.; Bolinger, Sarah J. – Journal of Positive Behavior Interventions, 2021
Supported by decades of research on praise and its effect on student behaviors, we developed the Behavior-Specific Praise--Observation Tool (BSP-OT) to measure characteristics of effective praise. We evaluated interrater reliability of the BSP-OT to measure praise specificity, contingency, and variety using intraclass correlation (ICC) and Cohen's…
Descriptors: Test Reliability, Classroom Observation Techniques, Positive Reinforcement, Interrater Reliability
Patel, Priya; Lee, Seungmin; Myers, Nicholas D.; Lee, Mei-Hua – Journal of Motor Learning and Development, 2021
Missing data incidents are common in experimental studies of motor learning and development. Inadequate handling of missing data may lead to serious problems, such as addition of bias, reduction in power, and so on. Thus, this study aimed to conduct a systematic review of the past (2007) and present (2017) practices used for reporting and…
Descriptors: Motor Development, Research Reports, Periodicals, Research Methodology

Peer reviewed
Direct link
