Publication Date
| In 2026 | 0 |
| Since 2025 | 58 |
| Since 2022 (last 5 years) | 284 |
| Since 2017 (last 10 years) | 780 |
| Since 2007 (last 20 years) | 2042 |
Descriptor
| Interrater Reliability | 3124 |
| Foreign Countries | 655 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 25 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Peer reviewedGoodwin, Laura D.; Goodwin, William L. – Journal of Early Intervention, 1991
Four approaches to estimating interrater reliability in early childhood special education research are illustrated and compared: correlation, comparison of means, percentage of agreement, and generalizability theory techniques. Generalizability theory techniques are proposed as a method for estimating the amount of variance attributable to…
Descriptors: Analysis of Variance, Disabilities, Early Childhood Education, Educational Research
Peer reviewedHuot, Brian – College Composition and Communication, 1990
Describes holistic scoring as one of the biggest breakthroughs in writing assessment. Suggests that the technique's high interrater reliability coefficients partly explain holistic scoring's popularity. Argues that validity has been largely neglected. Concludes that more must be learned about the uses and effects of holistic scoring. (SG)
Descriptors: Educational Testing, Higher Education, Holistic Approach, Holistic Evaluation
Mabry, Linda – Phi Delta Kappan, 1999
Education remains heavily shackled by punitive, test-driven reform. Despite reasonable alternatives, testing increasingly drives educational accountability and reform. Standardization of direct writing assessments promotes scoring reliability and facilitates educational comparisons and rankings. However, standardized writing is not good writing,…
Descriptors: Elementary Secondary Education, Interrater Reliability, Performance Based Assessment, Scoring Rubrics
Peer reviewedSadler, Philip M.; Hammerman, James K. – College and University, 1999
A quantitative study modeled the inherently subjective admissions process for 592 graduate school candidates and 72 raters. Logistic regression models were well-fitting and parsimonious, allowing analysis of each stage of the process. Extended committee discussion/deliberation phases were of limited productivity when inter-rater agreement was…
Descriptors: Admission Criteria, Bias, College Admission, Committees
Peer reviewedGraham, Susan A.; Poulin-Dubois, Diane – Journal of Child Language, 1999
Two experiments examined infants' reliance on object shape versus color for word generalization to animate and inanimate objects. Infants were taught labels for either novel vehicles or novel animals using preferential-looking procedure or an interactive procedure. Results of both experiments indicated that infants limited their word…
Descriptors: Animals, Auditory Stimuli, Child Language, Color
Peer reviewedNordin, Viviann; Gillberg, Christopher; Nyden, Agneta – Journal of Autism and Developmental Disorders, 1998
This study assessed the interrater reliability of a Swedish version of the Childhood Autism Rating Scale (CARS), an instrument for screening and diagnosis of autism. The CARS was used for rating autistic behavior by two investigators in 25 children. Results indicated fair to excellent agreement. Aspects of validity and reliability are discussed.…
Descriptors: Autism, Behavior Rating Scales, Clinical Diagnosis, Disability Identification
Peer reviewedHurman, John – Language Learning Journal, 1996
Studied the marking characteristics of experienced markers of GCSE role-play to ascertain the extent of variation between the marks they award and to determine whether more intermarker consistency could be obtained with a small increase in time for thought before a particular mark is awarded. Results underlie the importance of reducing the…
Descriptors: Evaluators, Foreign Countries, Interrater Reliability, Oral Language
Peer reviewedHenning, Grant – Language Testing, 1996
Analyzes simulated performance ratings on a six-point scale by two independent raters to account for nonsystematic error in performance ratings. Results suggest that rater agreement or covariance is not always a dependable estimate of score reliability and that the practice of seeking additional raters for adjudication of discrepant ratings is not…
Descriptors: Correlation, Error Patterns, Interrater Reliability, Language Tests
Peer reviewedCordes, Anne K.; Ingham, Roger J. – Journal of Speech and Hearing Research, 1996
Ten speech-language pathology students judged five-second audiovisually recorded speech intervals as stuttered or nonstuttered in group and single-subject experiments. Results showed that judgment accuracy tended to increase after training, both for speakers used during the training process and unfamiliar speakers. Slight increases in interjudge…
Descriptors: Disability Identification, Evaluative Thinking, Higher Education, Instructional Effectiveness
Peer reviewedFeinberg, Mark; Neiderhiser, Jenae; Howe, George; Hetherington, E. Mavis – Child Development, 2001
Examined low interrater agreement by decomposing common and unique variance among parent, adolescent, and observer reports of parental warmth and negativity into genetic and environmental factors. Model-fitting analyses findings generally supported predictions for warmth and negativity at Family and Individual levels. At the Social level, genetic…
Descriptors: Adolescents, Environmental Influences, Heredity, Interrater Reliability
Matson, Johnny L.; Laud, Rinita B.; Gonzalez, Melissa L.; Malone, Carrie J.; Swender, Stephen L. – Research in Developmental Disabilities: A Multidisciplinary Journal, 2005
The use of anti-epileptic medications (AEDs) is much higher in individuals with intellectual disabilities than in the general population. As many of these individuals rely on such medications, clinicians should consider psychometrically sound instruments for assessing adverse side effects of these medications as one aspect of routine clinical…
Descriptors: Evaluation Methods, Seizures, Epilepsy, Developmental Disabilities
Pine, Elyse; Luby, Joan; Abbacchi, Anna; Constantino, John N. – Autism: The International Journal of Research & Practice, 2006
Given a growing emphasis on early intervention for children with autism, valid quantitative tools for measuring treatment response are needed. The Social Responsiveness Scale (SRS) is a brief (15-20 minute) quantitative measure of autistic traits in 4-to 18-year-olds, for which a version for 3-year-olds was recently developed. We obtained serial…
Descriptors: Pervasive Developmental Disorders, Preschool Children, Early Intervention, Interrater Reliability
Brown, William H.; Pfeiffer, Karin A.; McIver, Kerry L.; Dowda, Marsha; Almeida, M. Joao C. A.; Pate, Russell R. – Research Quarterly for Exercise and Sport, 2006
In this paper we present initial information concerning a new direct observation system--the Observational System for Recording Physical Activity in Children-Preschool Version. The system will allow researchers to record young children's physical activity levels while also coding the topography of their physical activity, as well as detailed…
Descriptors: Preschool Children, Student Evaluation, Physical Activities, Physical Activity Level
Norbury, Courtenay Frazier; Nash, Marysia; Baird, Gillian; Bishop, Dorothy V. M. – International Journal of Language and Communication Disorders, 2004
Background: The Children's Communication Checklist (CCC 1998) was revised in 2003 (CCC-2) to provide a general screen for communication disorder and to identify pragmatic/social interaction deficits. Two validation studies were conducted with different populations of children with language and communication impairments. Methods & Procedures: In…
Descriptors: Interpersonal Competence, Pragmatics, Language Impairments, Check Lists
Mohr, C.; Tonge, B. J.; Einfeld, S. L. – Journal of Intellectual Disability Research, 2005
People with intellectual disability (ID) and untreated psychiatric disorder lead unnecessarily difficult and unhappy lives. The prevalence of mental illness in children and adults with ID is greater than that found in the general population. A carer-completed checklist of psychopathology that could be used with both children and adults would help…
Descriptors: Psychometrics, Psychopathology, Check Lists, Test Validity

Direct link
