Publication Date
| In 2026 | 0 |
| Since 2025 | 58 |
| Since 2022 (last 5 years) | 284 |
| Since 2017 (last 10 years) | 780 |
| Since 2007 (last 20 years) | 2042 |
Descriptor
| Interrater Reliability | 3124 |
| Foreign Countries | 655 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 25 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Geronikou, Eleftheria; Vance, Maggie; Wells, Bill; Thomson, Jenny – Child Language Teaching and Therapy, 2019
Intervention with children with speech and language difficulties has been proven beneficial compared with no treatment yet, knowing what type of intervention to provide remains a challenge. Studies of English-speaking children indicate that intervention targeting the production of morphological targets may have a positive effect on phonological…
Descriptors: Greek, Males, Speech Impairments, Intervention
Nagy, Ede; Wehmeyer, Meike; Gaese, Franziska; Nicolai, Elisabeth; Schweitzer-Rothers, Jochen – Journal of Mental Health Research in Intellectual Disabilities, 2019
Introduction: This article describes the development of an Aggression and Restriction Observation Checklist (AROC) for use in residential and in-patient services for adults with intellectual disabilities (ID). The AROC was developed in collaboration between researchers and frontline staff. It assesses self-, person-, and object-directed aggressive…
Descriptors: Adults, Intellectual Disability, Check Lists, Aggression
Westergård, Elsa; Ertesvåg, Sigrun K.; Rafaelsen, Frank – Scandinavian Journal of Educational Research, 2019
Observations seem particularly susceptible to rater error due to the level of subjectivity involved in assessment. Thus, the present paper aims to investigate: (1) inter-rater agreement (IRA) using the Classroom Assessment Scoring System -- Secondary version (CLASS-S) and (2) the CLASS-S factor structure in a Norwegian context. Inter-rater…
Descriptors: Foreign Countries, Classroom Environment, Classroom Observation Techniques, Secondary School Students
Huckabee, Maggie-Lee; McIntosh, Theresa; Fuller, Laura; Curry, Morgan; Thomas, Paige; Walshe, Margaret; McCague, Ellen; Battel, Irene; Nogueira, Dalia; Frank, Ulrike; van den Engel-Hoek, Lenie; Sella-Weiss, Oshrat – International Journal of Language & Communication Disorders, 2018
Background: Clinical swallowing assessment is largely limited to qualitative assessment of behavioural observations. There are limited quantitative data that can be compared with a healthy population for identification of impairment. The Test of Masticating and Swallowing Solids (TOMASS) was developed as a quantitative assessment of solid bolus…
Descriptors: Medical Evaluation, Clinical Diagnosis, Motor Reactions, Reliability
Cengiz, Cemre; Aylar, Ebru; Yildiz, Esengül – International Electronic Journal of Elementary Education, 2018
This paper investigated the intuitive development of the concept of integers among primary school students. In order to reveal if primary school students had an intuitive sense of integers, an assessment consisting of five questions was prepared and applied to a total 100 4th grade students. A variety of integer concepts were utilized in the…
Descriptors: Intuition, Elementary School Students, Grade 4, Mathematics Instruction
Walkington, Candace; Marder, Michael – ZDM: The International Journal on Mathematics Education, 2018
The UTeach Observation Protocol (UTOP) was designed to inform STEM teacher education. The instrument has been used in prior studies examining inter-rater reliability and relationships to teacher value-added scores. However, prior work has not shown examples of how rating with the UTOP works in practice nor has it discussed the instrument's…
Descriptors: Classroom Observation Techniques, Mathematics Instruction, Instructional Effectiveness, STEM Education
Raczynski, Kevin; Cohen, Allan – Applied Measurement in Education, 2018
The literature on Automated Essay Scoring (AES) systems has provided useful validation frameworks for any assessment that includes AES scoring. Furthermore, evidence for the scoring fidelity of AES systems is accumulating. Yet questions remain when appraising the scoring performance of AES systems. These questions include: (a) which essays are…
Descriptors: Essay Tests, Test Scoring Machines, Test Validity, Evaluators
Cohen, Yoav; Levi, Effi; Ben-Simon, Anat – Applied Measurement in Education, 2018
In the current study, two pools of 250 essays, all written as a response to the same prompt, were rated by two groups of raters (14 or 15 raters per group), thereby providing an approximation to the essay's true score. An automated essay scoring (AES) system was trained on the datasets and then scored the essays using a cross-validation scheme. By…
Descriptors: Test Validity, Automation, Scoring, Computer Assisted Testing
Clayson, Dennis E. – Assessment & Evaluation in Higher Education, 2018
The student evaluation of teaching process is generally thought to produce reliable results. The consistency is found within class and instructor averages, while a considerable amount of inconsistency exists with individual student responses. This paper reviews these issues along with a detailed examination of common measures of reliability that…
Descriptors: Student Evaluation of Teacher Performance, Reliability, Validity, Evaluation Criteria
Becraft, Jessica L.; Borrero, John C.; Davis, Barbara J.; Mendres-Smith, Amber E. – Education and Treatment of Children, 2016
The current study was designed to evaluate a rotating momentary time sampling (MTS) data collection system. A rotating MTS system has been used to measure activity preferences of preschoolers but not to collect data on responses that vary in duration and frequency (e.g., talking). We collected data on talking for 10 preschoolers using a 5-s MTS…
Descriptors: Sampling, Time, Interrater Reliability, Data Collection
Yvette R. Harris – Sage Research Methods Cases, 2016
The goal of this case study was to introduce students to ways to conduct research on parent child cognitive learning interactions. To this end, the case study begins with an overview of the theoretical and empirical work supporting the development of my research program on parent child cognitive learning interaction research and continues with a…
Descriptors: Student Research, Parent Child Relationship, Interaction, Sampling
Richard, Veronique; Aubertin, Patrice; Yang, Yan Yun; Kriellaars, Dean – Creativity Research Journal, 2020
Few assessment tools have been designed to assess motor creativity, and the existing tools have limitations. To bridge this gap, the current study aimed at designing a new movement creativity assessment tool that considers the unique features underlying the expression of creativity through movement. A modified Delphi technique was used to collect…
Descriptors: Play, Creativity, Delphi Technique, Factor Analysis
Earle, Sarah – Primary Science, 2017
Moderation is put forward as they key strategy for improving the reliability of teacher assessment. However, for many teachers the word "moderation" conjures up ideas of uncomfortable situations in which marking is being checked by others and there are prolonged arguments about tiny features of individual work. In this article, the…
Descriptors: Grading, Interrater Reliability, Faculty Development, Professional Continuing Education
Irby, Sarah M.; Floyd, Randy G. – Psychology in the Schools, 2017
This study examined the exchangeability of total scores (i.e., intelligent quotients [IQs]) from three brief intelligence tests. Tests were administered to 36 children with intellectual giftedness, scored live by one set of primary examiners and later scored by a secondary examiner. For each student, six IQs were calculated, and all 216 values…
Descriptors: Intelligence Tests, Gifted, Error of Measurement, Scores
Wilks, Scott E.; Geiger, Jennifer R.; Bates, Samantha M.; Wright, Amy L. – Research on Social Work Practice, 2017
Objective: The objective was to examine reference errors in research articles published in Research on Social Work Practice. High rates of reference errors in other top social work journals have been noted in previous studies. Methods: Via a sampling frame of 22,177 total references among 464 research articles published in the previous decade, a…
Descriptors: Social Work, Social Services, Accuracy, Educational Research

Peer reviewed
Direct link
