Publication Date
| In 2026 | 0 |
| Since 2025 | 56 |
| Since 2022 (last 5 years) | 282 |
| Since 2017 (last 10 years) | 778 |
| Since 2007 (last 20 years) | 2040 |
Descriptor
| Interrater Reliability | 3122 |
| Foreign Countries | 654 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 24 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Gyamfi, George; Hanna, Barbara E.; Khosravi, Hassan – Assessment & Evaluation in Higher Education, 2022
Rubrics have been suggested as a means to foster students' evaluative judgement, the capacity to appraise their own work and that of others; however, empirical evidence of rubrics' effectiveness is still emerging. This paper contributes findings from a randomised controlled experiment on the effect of rubrics on evaluative judgement. Participants…
Descriptors: Scoring Rubrics, Evaluative Thinking, Peer Evaluation, Undergraduate Students
Purwadi; Saputra, Wahyu N. E.; Handaka, Irvan B.; Barida, Muya; Wahyudi, Amien; Widyastuti, Dian A.; Agungbudiprabowo; Rodhiya, Zaenab A. – Pegem Journal of Education and Instruction, 2022
This study aims to identify the acceptability and effectiveness of peace guidance based on the perspective of Markesot. This model seeks to reduce student aggressiveness. This study uses the research and development stages by adapting the Borg & Gall model. The participants of this study were 275 students who were taken randomly. The study…
Descriptors: Peace, Guidance, Models, Interrater Reliability
Bodfish, James W.; Lecavalier, Luc; Harrop, Clare; Dallman, Aaron; Kalburgi, Sahana Nagabhushan; Hollway, Jill; Faldowski, Richard; Boyd, Brian A. – Journal of Autism and Developmental Disorders, 2022
For individuals with autism spectrum disorder (ASD), behavioral inflexibility can affect multiple domains of functioning and family life. The objective of this study was to develop and validate a clinical interview version of the Behavioral Inflexibility Scale. Trained interviewers conducted interviews with parents of 144 children with ASD and 70…
Descriptors: Children, Autism, Pervasive Developmental Disorders, Child Behavior
Roessger, Kevin M. – Adult Learning, 2020
Practitioners often struggle to assess reflective learning in the workplace because of difficulties conceptualizing reflection and its effects in the workplace. This article addresses this problem by offering a pragmatic approach to assessment that asks practitioners to specify why they are using reflection, what they are hoping to gain from it,…
Descriptors: Workplace Learning, Evaluation Methods, Reflection, Adult Education
Kaila L. Stipancic; Mojgan Golzy; Yunxin Zhao; Louise Pinkerton; Andrea Rohl; Mili Kuruvilla-Dugdale – Journal of Speech, Language, and Hearing Research, 2023
Purpose: Auditory training has been shown to reduce rater variability in perceptual voice assessment. Because rater variability is also a central issue in the auditory-perceptual assessment of dysarthria, this study sought to determine if training produces a meaningful change in rater reliability, criterion validity, and scaling magnitude of four…
Descriptors: Auditory Training, Auditory Perception, Program Effectiveness, Speech Impairments
Kelly Little; Yongyue Qi; Vanessa D. Jewell – Journal of Occupational Therapy Education, 2023
The Occupation-Centered Intervention Assessment (OCIA) was developed as a reflective tool for students to improve their comprehension of occupation-centered practice. Finding new and innovative ways to incorporate occupation-centered assignments can serve as a strategy to develop student integration of occupation-centered practice and allow…
Descriptors: Occupational Therapy, Allied Health Occupations Education, Interrater Reliability, Intervention
Yubin Xu; Lin Liu; Jianwen Xiong; Guangtian Zhu – Journal of Baltic Science Education, 2025
As the development and application of large language models (LLMs) in physics education progress, the well-known AI-based chatbot ChatGPT4 has presented numerous opportunities for educational assessment. Investigating the potential of AI tools in practical educational assessment carries profound significance. This study explored the comparative…
Descriptors: Physics, Artificial Intelligence, Computer Software, Accuracy
Peter Iserbyt; Jackie Lund – Journal of Teaching in Physical Education, 2025
Purpose: The purpose of the study was to investigate the instructional alignment of unit and lesson plans in physical education. Methods: Unit and lesson plans of 31 student teachers from one Physical Education Teacher Education program were analyzed. Trained coders assessed the quality and alignment of unit goals and lesson outcomes, assessments,…
Descriptors: Student Teachers, Physical Education, Physical Education Teachers, Units of Study
Office of Inspector General, US Department of Education, 2025
The U.S. Department of Education (Department) allocates funds to States through statutory formulas based primarily on census poverty estimates and the cost of education in each State. To receive funding, a State plan that includes a description of its accountability system must be submitted to the Department for review and approval. For the…
Descriptors: Elementary Secondary Education, Educational Legislation, Federal Legislation, Accountability
King-Dow Su – Journal of Baltic Science Education, 2024
Building 21st-century life science skills requires educating participants according to STEM abilities. Therefore, this research aimed to examine the effectiveness and feasibility of the STEM ability assessment framework in the practical learning environment. The study uses STEM coffee preparation experiential activity with a Royal Belgian siphon…
Descriptors: STEM Education, Content Validity, Instructional Effectiveness, Interrater Reliability
King, Pete; Atkins, LaDonna; Burr, Brandon – Journal of Early Childhood Research, 2021
The Play Cycle Observation Method (PCOM) is an observational tool developed to focus on the process of play and has shown good reliability when watching videos of children playing. This study piloted use of the PCOM in 'real time' in a pre-school setting where 3-year-old children play. The results from two independent observers not familiar with…
Descriptors: Pilot Projects, Play, Observation, Video Technology
Alighieri, Cassandra; Bettens, Kim; Bruneel, Laura; D'haeseleer, Evelien; Van Gaever, Ellen; Van Lierde, Kristiane – Journal of Speech, Language, and Hearing Research, 2021
Purpose: This study compared the inter- and intrarater reliability of the percentage of consonants correct (PCC) metrics and the probe scoring system between an experienced and a less experienced rater and between two experienced raters. In addition, these outcome measures' ability to reflect changes following speech intervention was measured.…
Descriptors: Congenital Impairments, Speech, Intervention, Interrater Reliability
Thomas, Anne E.; Ambrose, Sophie E.; Marvin, Christine A.; Oleson, Jacob; Moeller, Mary Pat – Journal of Speech, Language, and Hearing Research, 2021
Purpose: Parent report was compared to judgments made by a trained researcher to determine the utility of the Vocal Development Landmarks Interview (VDLI) for monitoring development of vocal behaviors in very young children. Method: Parents of 40 typically developing children, ages 6-21 months, provided full-day naturalistic audio recordings of…
Descriptors: Parents, Researchers, Interrater Reliability, Interviews
Pereira, Valerie J.; Tuomainen, Jyrki; Lee, Kathy Y. S.; Tong, Michael C. F.; Sell, Debbie A. – International Journal of Language & Communication Disorders, 2021
Background: The status of the velopharyngeal mechanism can be inferred from perceptual ratings of specified speech parameters. Several studies have proposed the measure of an overall velopharyngeal composite score based on these perceptual ratings and have reported good validity. The Cleft Audit Protocol for Speech--Augmented (CAPS-A) is a…
Descriptors: Congenital Impairments, Speech Tests, Outcome Measures, Test Validity
Bhatnagar, Ruchi; Tanguay, Carla L.; Sullivan, Caroline; Many, Joyce E. – Georgia Educational Researcher, 2021
Most teacher education assessments are criticized for lacking validity and reliability. This study describes the process of developing the Observation of Field Performance rubric to assess initial teacher candidates' classroom performance and establishing the content validity as well as reliability of the rubric. A panel of content area experts…
Descriptors: Scoring Rubrics, Observation, Content Validity, Interrater Reliability

Peer reviewed
Direct link
