Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedRowley, Glenn – Journal of Educational Measurement, 1978
The reliabilities of various observational measures were determined, and the influence of both the number and the length of the observation periods on reliability was examined, both separately and jointly. A single simplifying assumption leads to a variant of the Spearman-Brown formula, which may have wider application. (Author/CTM)
Descriptors: Career Development, Classroom Observation Techniques, Observation, Reliability
Peer reviewedStewart, Krista J. – Psychology in the Schools, 1987
Evaluated the technical aspects of three Wechsler Intelligence Scale for Children-Revised (WISC-R) administrations of five psychology graduate students using the WISC-R Administration Observational Checklist (WAOC) to evaluate interrater agreement. Students performed significantly better on the second than on the first observation, with…
Descriptors: Educational Diagnosis, Error Patterns, Examiners, Graduate Students
Shaw, Emily J.; Milewski, Glenn B. – College Entrance Examination Board, 2004
In order for individualized review in college admissions to be fair, issues of consistency and reliability must be considered. There are a number of ways to assess interrater reliability, including calculating the composite reliability of readers, computing the proportion of times that readers make consistent ratings, and evaluating reader…
Descriptors: College Applicants, College Admission, Interrater Reliability, Reliability
Khan, S. B.; Roberts, D. M. – Measurement and Evaluation in Guidance, 1971
This is an examination of the structural stability of affective characteristics relevant to education over a period of approximately seven months. Results indicated that the six interpretable factors were differentially stable over the time intervals. (Author)
Descriptors: Affective Behavior, Affective Objectives, Attitude Measures, Junior High School Students
Peer reviewedMuris, Peter; Steerneman, Pim; Ratering, Elise – Journal of Autism and Developmental Disorders, 1997
A study of 10 children (ages 3-6) with pervasive developmental disorders investigated the interrater reliability of the Psychoeducational Profile (PEP). Results show good interrater reliability for the developmental items, indicating that the PEP can be used to evaluate progress in development of children with pervasive developmental disorders.…
Descriptors: Child Development, Children, Evaluation Methods, Foreign Countries
Peer reviewedConroy, Maureen A.; And Others – Education and Training in Mental Retardation and Developmental Disabilities, 1996
This study assessed the intra-rater and inter-rater reliability of the Motivation Assessment Scale as used with 20 adults with mental retardation, expanding the results of previous research by evaluating across additional time and administrations. Results from 19 raters indicated variable moderate-to-low intra-rater and inter-rater reliability.…
Descriptors: Adults, Behavior Problems, Interrater Reliability, Measures (Individuals)
Peer reviewedHux, Karen; And Others – Journal of Communication Disorders, 1997
A study evaluated and compared four methods of assessing reliability on one discourse analysis procedure--a modified version of Damico's Clinical Discourse Analysis. The methods were Pearson product-moment correlations; interobserver agreement; Cohen's kappa; and generalizability coefficients. The strengths and weaknesses of the methods are…
Descriptors: Communication Disorders, Discourse Analysis, Evaluation Methods, Evaluation Problems
Milner, Joel S.; Robertson, Kevin R. – Child Abuse and Neglect: The International Journal, 1989
A study examined the responses of 89 physical child abusers and 108 comparison subjects to the Child Abuse Potential Inventory to determine whether the inventory's response inconsistency scale could be used for screening for physical child abuse. The scale was rejected as a reliable or valid measure. (Author/MSE)
Descriptors: Child Abuse, Predictive Measurement, Psychological Patterns, Reliability
Adams, Joyce A.; Wells, Robert – Child Abuse and Neglect: The International Journal, 1993
Preselected colposcopic photographs of the anogenital area of 16 patients were shown to 170 medical examiners, who rated their level of suggestion or indication of penetrating injury. Agreement between the participants and experts was higher on the abnormal cases than on the normal cases, and higher on genital findings than on anal findings.…
Descriptors: Child Abuse, Interrater Reliability, Medical Evaluation, Pediatrics
Peer reviewedSigafoos, Jeff; And Others – Research in Developmental Disabilities, 1994
Eighteen adolescents and adults with severe/profound intellectual disability were rated by two staff members using the Motivation Assessment Scale to identify variables maintaining their aggressive behaviors. Analysis of interrater reliability indicated that for some individuals the scale may not represent a feasible alternative to more formal…
Descriptors: Adolescents, Adults, Aggression, Behavior Problems
Peer reviewedPark, Hyun-Sook; And Others – Journal of Experimental Education, 1990
The reliability of visual inspection in single-case research was investigated by determining agreement among 5 judges visually inspecting 44 graphs depicting behavior from baseline to intervention. Agreement between visual inspection and statistical procedures was determined. Implications for single-case research are discussed. (SLD)
Descriptors: Behavior Patterns, Evaluation Methods, Evaluators, Graphs
Peer reviewedMuris, Peter; Steerneman, Pim; Meesters, Cor; Merckelbach, Harald; Horselenberg, Robert; van den Hogen, Tanja; van Dongen, Lieke – Journal of Autism and Developmental Disorders, 1999
Four studies investigated reliability and validity of the Theory of Mind (TOM) test, an instrument for assessing theory-of-mind ability in typical children and children with pervasive developmental disorders. The TOM test was found to be a reliable and valid instrument for measuring various aspects of theory of mind. (Author/CR)
Descriptors: Children, Interpersonal Competence, Interrater Reliability, Pervasive Developmental Disorders
Peer reviewedEpstein, Michael H.; Cullinan, Douglas; Harniss, Mark K.; Ryser, Gail – Behavioral Disorders, 1999
Three studies are reported addressing the reliability of the Scale for Assessing Emotional Disturbance (SAED), a standardized, norm-reference measure linked to the federal definition of emotional disturbance (ED). Results indicate the SAED possesses acceptable test-retest reliability and reasonable interrater reliability and can assist in the…
Descriptors: Disability Identification, Elementary Secondary Education, Eligibility, Emotional Disturbances
Peer reviewedGillberg, Christopher; Gillberg, Carina; Rastam, Maria; Wentz, Elisabeth – Autism: The International Journal of Research and Practice, 2001
The development of the Asperger Syndrome (and high-functioning autism) Diagnostic Interview (ASDI) is described. Preliminary data from a clinical study of 20 individuals (ages 6-55) suggest that interrater reliability and test-retest stability may be excellent, with kappas exceeding 0.90 in both instances. The validity appears to be relatively…
Descriptors: Adults, Asperger Syndrome, Autism, Children
Spreat, Scott; Connelly, Lisa – American Journal on Mental Retardation, 1996
Reliability analysis of the Motivation Assessment Scale was conducted on subscales completed by staff members working with 47 institutionalized adults with severe to profound mental retardation and self-injurious behavior problems. Internal consistency was found to be superior to interrater reliability. The instrument's internal consistency…
Descriptors: Adults, Behavior Problems, Institutionalized Persons, Interrater Reliability


