Publication Date
| In 2026 | 0 |
| Since 2025 | 58 |
| Since 2022 (last 5 years) | 284 |
| Since 2017 (last 10 years) | 780 |
| Since 2007 (last 20 years) | 2042 |
Descriptor
| Interrater Reliability | 3124 |
| Foreign Countries | 655 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 25 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Ciorba, Charles R.; Smith, Neal Y. – Journal of Research in Music Education, 2009
Recent policy initiatives instituted by major accrediting bodies require the implementation of specific assessment tools to provide evidence of student achievement in a number of areas, including applied music study. The purpose of this research was to investigate the effectiveness of a multidimensional assessment rubric, which was administered to…
Descriptors: Music Education, Performance Based Assessment, Music, Scoring Rubrics
Rolfhus, Eric; Decker, Lauren E.; Brite, Jessica L.; Gregory, Lois – Regional Educational Laboratory Southwest (NJ1), 2010
This study of four national English language arts college readiness standards sets compares content alignment and level of alignment of the standards statements in three comparison sets to a benchmark set, the American Diploma Project (ADP), and analyzes the cognitive complexity of all four sets. Specifically, this report addresses two primary…
Descriptors: School Readiness, Language Arts, Interrater Reliability, Measures (Individuals)
Prain, Meredith; McVilly, Keith; Ramcharan, Paul; Currie, Sally; Reece, John – Journal of Intellectual & Developmental Disability, 2010
Background: Adults with congenital deafblindness (CDB) have received little attention from researchers. In this study we examined the nature of interactions between adults with CDB and the staff who mediate their support, and investigated the reliability of an observation coding system, originally designed for observing adults with severe…
Descriptors: Observation, Interrater Reliability, Interpersonal Relationship, Group Homes
Stone, Lisanne L.; Otten, Roy; Engels, Rutger C. M. E.; Vermulst, Ad A.; Janssens, Jan M. A. M. – Clinical Child and Family Psychology Review, 2010
Since its development, the Strengths and Difficulties Questionnaire (SDQ) has been widely used in both research and practice. The SDQ screens for positive and negative psychological attributes. This review aims to provide an overview of the psychometric properties of the SDQ for 4- to 12-year-olds. Results from 48 studies (N = 131,223) on…
Descriptors: Predictive Validity, Factor Structure, Psychopathology, Behavior Rating Scales
Huang, Jinyan; Foote, Chandra J. – Language Assessment Quarterly, 2010
This study examines score variations and differences in the reliability of ratings between English-as-a-second-language (ESL) and native English (NE) authored papers in a graduate course. Generalizability (G-) theory was used as a framework for analysis because it is powerful in detecting rater variability and the relative contributions of…
Descriptors: Graduate Students, Holistic Evaluation, North Americans, English (Second Language)
Bagner, Daniel M.; Boggs, Stephen R.; Eyberg, Sheila M. – Education and Treatment of Children, 2010
This study examined the psychometric properties of the Revised Edition of the School Observation Coding System (REDSOCS). Participants were 68 children ages 3 to 6 who completed parent-child interaction therapy for Oppositional Defiant Disorder as part of a larger efficacy trial. Interobserver reliability on REDSOCS categories was moderate to…
Descriptors: Behavior Problems, Student Behavior, Teacher Evaluation, Test Reliability
Erling, Elizabeth J.; Richardson, John T. E. – Assessing Writing, 2010
Measuring the Academic Skills of University Students is a procedure developed in the 1990s at the University of Sydney's Language Centre to identify students in need of academic writing development by assessing examples of their written work against five criteria. This paper reviews the literature relating to the development of the procedure with…
Descriptors: Foreign Countries, Writing Evaluation, Assignments, Psychometrics
Helou, Leah B.; Solomon, Nancy Pearl; Henry, Leonard R.; Coppit, George L.; Howard, Robin S.; Stojadinovic, Alexander – American Journal of Speech-Language Pathology, 2010
Purpose: To determine whether experienced and inexperienced listeners rate postthyroidectomy voice samples similarly using the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V). Method: Prospective observational study of voice quality ratings of randomized and blinded voice samples was performed. Twenty-one postthyroidectomy patients'…
Descriptors: Listening Comprehension, Voice Disorders, Interrater Reliability, Speech Language Pathology
Bothe, Anne K. – Journal of Speech, Language, and Hearing Research, 2008
Purpose: The purposes of this study were (a) to determine whether highly experienced clinicians and researchers agreed with each other in judging the presence or absence of stuttering in the speech of children who stutter and (b) to determine how those binary stuttered/nonstuttered judgments related to categorizations of the same speech based on…
Descriptors: Stuttering, Identification, Young Children, Speech
Cicchetti, Domenic V.; Lord, Catherine; Koenig, Kathy; Klin, Ami; Volkmar, Fred R. – Journal of Autism and Developmental Disorders, 2008
The authors assessed the reliability of the Autism Diagnostic Interview (ADI-R). Seven Clinical Examiners evaluated a three and one half year old female toddler suspected of being on the Autism Spectrum. Examiners showed agreement levels of 94-96% across all items, with weighted kappa (K[subscript w]) between 0.80 and 0.88. They were in 100%…
Descriptors: Autism, Interrater Reliability, Measures (Individuals), Clinical Diagnosis
Mann, Zennetta; McLaughlin, T. F.; Williams, Randy Lee; Derby, K. Mark; Everson, Mary – Journal of Special Education Apprenticeship, 2012
The purpose of the present study was to evaluate the effects of Direct Instruction (DI) flashcard procedure, combined with strategies and rewards on multiplication fact accuracy of two elementary school-age students. A single subject replication design across three and four sets of multiplication facts was used to evaluate outcomes. The results…
Descriptors: Direct Instruction, Instructional Materials, Mathematics Instruction, Rewards
Buelin-Biesecker, Jennifer Katherine – ProQuest LLC, 2012
This study compared the creative outcomes in student work resulting from two pedagogical approaches to creative problem solving activities. A secondary goal was to validate the Consensual Assessment Technique (CAT) as a means of assessing creativity. Linear models for problem solving and design processes serve as the current paradigm in classroom…
Descriptors: Technology Education, Creativity, Problem Solving, Teaching Methods
Gorsky, Paul; Caspi, Avner; Blau, Ina; Vine, Yodfat; Billet, Amit – International Review of Research in Open and Distance Learning, 2012
The goal of this study is to further corroborate a hypothesized population parameter for the frequencies of social presence versus the sum of teaching presence and cognitive presence as defined by the community of inquiry model in higher education asynchronous course forums. This parameter has been found across five variables: academic institution…
Descriptors: Foreign Countries, Open Universities, Inquiry, Communities of Practice
Johnson, Erik A. – Contributions to Music Education, 2011
The purpose of this study was to determine the effect of peer-based instruction on rhythm reading achievement of instrumental and choral music students attending a large urbanfringe high school in a major metropolitan area. Participants (N = 131) included band (n = 71) and choir (n = 60) students whose backgrounds reflected extensive economic (78%…
Descriptors: Music, Music Education, Music Reading, High School Students
Siu, Andrew M. H.; Lai, Cynthia Y. Y.; Chiu, Amy S. M.; Yip, Calvin C. K. – Research in Developmental Disabilities: A Multidisciplinary Journal, 2011
Objectives: Most of the fine-motor assessment tools used in Hong Kong have been designed in Western countries, so there is a need to develop a standardized assessment which is relevant to the culture and daily living tasks of the local (that is, Chinese) population. This study aimed to (1) develop a fine-motor assessment tool (the Hong Kong…
Descriptors: Developmental Disabilities, Content Validity, Interrater Reliability, Young Children

Peer reviewed
Direct link
