Publication Date
| In 2026 | 0 |
| Since 2025 | 58 |
| Since 2022 (last 5 years) | 284 |
| Since 2017 (last 10 years) | 780 |
| Since 2007 (last 20 years) | 2042 |
Descriptor
| Interrater Reliability | 3124 |
| Foreign Countries | 655 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 25 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Nesmith, Suzanne; Cooper, Sandi – Journal of Research in Childhood Education, 2010
The integration of children's trade books in the mathematics classroom has experienced a dramatic surge in its popularity; yet, though the positive benefits of this strategy have been well documented, these benefits may only be realized if the literature is of high quality. Utilizing a mathematics trade book evaluation instrument, this inquiry…
Descriptors: Childrens Literature, Evaluation, Mathematics Education, Mathematics Instruction
McLeod, Bryce D.; Weisz, John R. – Journal of Clinical Child and Adolescent Psychology, 2010
Most everyday child and adolescent psychotherapy does not follow manuals that document the procedures. Consequently, usual clinical care has remained poorly understood and rarely studied. The Therapy Process Observational Coding System for Child Psychotherapy-Strategies scale (TPOCS-S) is an observational measure of youth psychotherapy procedures…
Descriptors: Interrater Reliability, Measures (Individuals), Psychotherapy, Depression (Psychology)
Tweed, Mike; Ingham, Christopher – Advances in Health Sciences Education, 2010
Judgments made by the assessors observing consultations are widely used in the assessment of medical students. The aim of this research was to study judgment accuracy and confidence and the relationship between these. Assessors watched recordings of consultations, scoring the students on: a checklist of items; attributes of consultation; a…
Descriptors: Medical Students, Student Evaluation, Consultation Programs, Observation
Touchie, Claire; Humphrey-Murto, Susan; Ainslie, Martha; Myers, Kathryn; Wood, Timothy J. – Advances in Health Sciences Education, 2010
Oral examinations have become more standardized over recent years. Traditionally a small number of raters were used for this type of examination. Past studies suggested that more raters should improve reliability. We compared the results of a multi-station structured oral examination using two different rater models, those based in a station,…
Descriptors: Interrater Reliability, Internal Medicine, Evaluation Methods, Tests
Ruble, Lisa A.; McGrew, John; Dalrymple, Nancy; Jung, Lee Ann – Journal of Autism and Developmental Disorders, 2010
The purpose of this study was to develop an Individual Education Program (IEP) evaluation tool based on Individuals with Disabilities Education Act (IDEA) requirements and National Research Council recommendations for children with autism; determine the tool's reliability; test the tool on a pilot sample of IEPs of young children; and examine…
Descriptors: Autism, Interrater Reliability, Disabilities, Young Children
Hermans, Heidi; Evenhuis, Heleen M. – Research in Developmental Disabilities: A Multidisciplinary Journal, 2010
The aim of this study was to obtain information on feasibility, reliability and validity of available instruments screening for depression applied in people with intellectual disabilities (ID). Therefore, literature was systematically reviewed. For self-report, the Glasgow Depression scale for people with a Learning Disability appears most…
Descriptors: Mental Retardation, Learning Disabilities, Interrater Reliability, Psychometrics
Villa, Susanna; Micheli, Enrico; Villa, Laura; Pastore, Valentina; Crippa, Alessandro; Molteni, Massimo – Journal of Autism and Developmental Disorders, 2010
The PEP-R (psychoeducational profile revised) is an instrument that has been used in many countries to assess abilities and formulate treatment programs for children with autism and related developmental disorders. To the end to provide further information on the PEP-R's psychometric properties, a large sample (N = 137) of children presenting…
Descriptors: Autism, Interrater Reliability, Psychometrics, Screening Tests
Clarke, Brandy L.; Sheridan, Susan M.; Kim, Elizabeth M.; Kupzyk, Kevin A.; Knoche, Lisa L.; Ransom, Kelly A.; Sjuts, Tara M. – Nebraska Center for Research on Children, Youth, Families and Schools, 2012
Children in poverty are at greater risk of academic failure due to impoverished living conditions and a lack of parental nurturance. Mothers' engagement in children's learning can be undermined by maternal depression, placing children at risk for cognitive and motor delays. With intervention, parents experiencing poverty and depression can…
Descriptors: Academic Failure, School Readiness, Intervention, Parent Child Relationship
Unal, Zafer; Bodur, Yasar; Unal, Aslihan – Contemporary Issues in Technology and Teacher Education (CITE Journal), 2012
The researchers in this study undertook development of a webquest evaluation rubric and investigated its reliability. The rubric was created using the strengths of the currently available webquest rubrics with improvements based on the comments provided in the literature and feedback received from educators. After the rubric was created, 23…
Descriptors: Test Construction, Test Reliability, Instructional Material Evaluation, Scoring Rubrics
Storch, Eric A.; Wood, Jeffrey J.; Ehrenreich-May, Jill; Jones, Anna M.; Park, Jennifer M.; Lewin, Adam B.; Murphy, Tanya K. – Journal of Autism and Developmental Disorders, 2012
The psychometric properties of the Pediatric Anxiety Rating Scale (PARS), a clinician-administered measure for assessing severity of anxiety symptoms, were examined in 72 children and adolescents diagnosed with an autism spectrum disorder (ASD). The internal consistency of the PARS was 0.59, suggesting that the items were related but not…
Descriptors: Test Reliability, Test Validity, Rating Scales, Anxiety
Sutherland, Kevin S.; McLeod, Bryce D.; Conroy, Maureen A.; Abrams, Lisa M.; Smith, Meghan M. – Journal of Emotional and Behavioral Disorders, 2014
The measurement of treatment integrity is critical to evaluate the efficacy and effectiveness of evidence-based programs (EBPs) designed to improve the developmental outcomes of young children at risk of emotional/behavioral disorders. Unfortunately, the science of treatment integrity measurement lags behind the development and evaluation of EBP…
Descriptors: Psychometrics, Competence, Emotional Disturbances, Behavior Disorders
Murley, Lisa D.; Stobaugh, Rebecca; Jukes, Pamela; Tassell, Janet – Educational Renaissance, 2014
The purpose of this article is to provide an overview of the process used to examine the inter-rater reliability of the Teacher Work Sample (TWS) Scoring Rubric involved with the senior culminating experience for teacher candidates used at a large comprehensive university. The study compared holistic and analytic scores reported by Student Teacher…
Descriptors: Teacher Education, Interrater Reliability, Scoring Rubrics, Preservice Teachers
Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013
In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…
Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests
Kiliç, Çigdem; Yanpar Yelken, Tugba – Eurasian Journal of Educational Research, 2013
Problem Statement: Recent studies in education have focused on how to handle metaphors as research and evaluation tools. Metaphors have many advantages for researchers, educators and learners with the most important being that they can help educators understand pre-service teachers' thinking and belief systems of mathematics. A study of previous…
Descriptors: Preservice Teachers, Elementary School Teachers, Figurative Language, Language Usage
Park, Jungjun; Lombardino, Linda J.; Ritter, Michaela – American Annals of the Deaf, 2013
The investigators measured 7 literacy skills in a group of 21 school-age children with mild to moderate sensorineural hearing loss (MSNH group), and compared the scores to those of 2 age-matched groups: children with dyslexia (DYS group) and, as a control, typically developing hearing children (CA group). The MSNH group performed consistently…
Descriptors: Phonological Awareness, Reading Skills, Spelling, Children

Peer reviewed
Direct link
