Publication Date
| In 2026 | 0 |
| Since 2025 | 13 |
| Since 2022 (last 5 years) | 48 |
| Since 2017 (last 10 years) | 151 |
| Since 2007 (last 20 years) | 301 |
Descriptor
| Interrater Reliability | 503 |
| Test Reliability | 503 |
| Test Validity | 260 |
| Test Construction | 106 |
| Foreign Countries | 103 |
| Psychometrics | 91 |
| Evaluation Methods | 90 |
| Scores | 67 |
| Correlation | 62 |
| Scoring | 61 |
| Rating Scales | 58 |
| More ▼ | |
Source
Author
| Epstein, Michael H. | 7 |
| Johnson, Evelyn S. | 4 |
| Matson, Johnny L. | 4 |
| Tasse, Marc J. | 4 |
| Aman, Michael G. | 3 |
| Canivez, Gary L. | 3 |
| Capie, William | 3 |
| Conroy, Maureen A. | 3 |
| Crawford, Angela R. | 3 |
| Lecavalier, Luc | 3 |
| McLeod, Bryce D. | 3 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 41 |
| Practitioners | 8 |
| Administrators | 3 |
| Teachers | 3 |
| Counselors | 1 |
Location
| Turkey | 11 |
| Canada | 10 |
| Australia | 9 |
| United Kingdom | 9 |
| Pennsylvania | 7 |
| Florida | 6 |
| Netherlands | 6 |
| Sweden | 5 |
| United Kingdom (England) | 5 |
| China | 4 |
| Illinois | 4 |
| More ▼ | |
Laws, Policies, & Programs
| Individuals with Disabilities… | 2 |
| No Child Left Behind Act 2001 | 1 |
| Pell Grant Program | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Burbidge, C.; Oliver, C.; Moss, J.; Arron, K.; Berg, K.; Furniss, F.; Hill, L.; Trusler, K.; Woodcock, K. – Journal of Intellectual Disability Research, 2010
Background: There is a need for assessments of psychological difference and disorder in people who have more severe intellectual disability (ID). Hyperactivity and impulsivity are two behavioural domains of importance as they are correlated with self-injury and aggression and this alludes to a shared cognitive correlate of compromised behavioural…
Descriptors: Mental Retardation, Hyperactivity, Interrater Reliability, Factor Structure
Bradshaw, Catherine P.; Debnam, Katrina; Koth, Christine W.; Leaf, Philip – Journal of Positive Behavior Interventions, 2009
Schoolwide positive behavioral interventions and supports (SWPBIS) are becoming increasingly popular with schools across the country to help create safer learning environments for students. An important aspect of SWPBIS is the ongoing monitoring and evaluation of implementation fidelity. Although a few measures have been created to assess the…
Descriptors: Interrater Reliability, Positive Reinforcement, Behavior Modification, Program Validation
Lobbestael, Jill; Arntz, Arnoud; Harkema-Schouten, Petra; Bernstein, David – Child Abuse & Neglect: The International Journal, 2009
Objective: We conducted a comprehensive assessment of the reliability and validity of the Interview for Traumatic Events in Childhood (ITEC, Lobbestael, Arntz, Kremers, & Sieswerda, 2006), a retrospective, semi-structured interview for childhood maltreatment. The ITEC aims to yield dimensional scores for severity of experiences of different…
Descriptors: Evaluation Methods, Test Reliability, Test Validity, Sexual Abuse
Sawchuk, Stephen – Education Digest: Essential Readings Condensed for Quick Review, 2010
Most experts in the testing community have presumed that the $350 million promised by the U.S. Department of Education to support common assessments would promote those that made greater use of open-ended items capable of measuring higher-order critical-thinking skills. But as measurement experts consider the multitude of possibilities for an…
Descriptors: Educational Quality, Test Items, Comparative Analysis, Multiple Choice Tests
Korat, Ofra – Early Child Development and Care, 2009
The relationship between mothers' and educators' evaluation of 75 children's emergent literacy levels and actual levels were investigated. Two groups of mothers participated: mothers with a low education and mothers with a high education. The children's emergent literacy was measured. The mothers evaluated their own children and 40 teachers…
Descriptors: Mothers, Emergent Literacy, Interrater Reliability, Mother Attitudes
Langton, Calvin M.; Barbaree, Howard E.; Harkins, Leigh; Peacock, Edward J.; Arenovich, Tamara – Journal of Interpersonal Violence, 2008
Among a number of widely used risk assessment instruments with adult sexual offenders, the Minnesota Sex Offender Screening Tool-Revised (MnSOST-R) has been subject to relatively few evaluation studies. Only two independent research groups have published replication studies in the peer-reviewed literature with data not provided by the MnSOST-R's…
Descriptors: Sexual Abuse, At Risk Persons, Criminals, Recidivism
Blijd-Hoogewys, E. M. A.; van Geert, P. L. C.; Serra, M.; Minderaa, R. B. – Journal of Autism and Developmental Disorders, 2008
Although research on Theory-of-Mind (ToM) is often based on single task measurements, more comprehensive instruments result in a better understanding of ToM development. The ToM Storybooks is a new instrument measuring basic ToM-functioning and associated aspects. There are 34 tasks, tapping various emotions, beliefs, desires and mental-physical…
Descriptors: Construct Validity, Interrater Reliability, Psychometrics, Research Methodology
Wang, Wen-Chung; Wilson, Mark – Applied Psychological Measurement, 2005
The random-effects facet model that deals with local item dependence in many-facet contexts is presented. It can be viewed as a special case of the multidimensional random coefficients multinomial logit model (MRCMLM) so that the estimation procedures for the MRCMLM can be directly applied. Simulations were conducted to examine parameter recovery…
Descriptors: Test Reliability, Item Response Theory, Interrater Reliability, Rating Scales
Solanto, Mary V.; Alvir, Jose – Journal of Attention Disorders, 2009
Objective: The objective of this study was to examine the intrarater reliability of "DSM-IV" ADHD symptoms. Method: Two-hundred-two children referred for attention problems and 49 comparison children (all 7-12 years) were rated by parents and teachers on the identical "DSM-IV" items presented in two different formats, the…
Descriptors: Symptoms (Individual Disorders), Test Reliability, Attention Deficit Hyperactivity Disorder, Classification
New York State Education Department, 2014
This technical report provides an overview of the New York State Alternate Assessment (NYSAA), including a description of the purpose of the NYSAA, the processes utilized to develop and implement the NYSAA program, and Stakeholder involvement in those processes. The purpose of this report is to document the technical aspects of the 2013-14 NYSAA.…
Descriptors: Alternative Assessment, Educational Assessment, State Departments of Education, Student Evaluation
Epstein, Michael H.; Synhorst, Lori – Journal of Child and Family Studies, 2008
The Preschool Behavioral and Emotional Rating Scale (PreBERS) is a standardized, norm-referenced instrument that assesses the emotional and behavioral strengths of preschool children. Two studies that investigated the test-retest and inter-rater reliability of the PreBERS are reported. In the first study, teachers rated preschool children (N = 96)…
Descriptors: Interrater Reliability, Preschool Children, Behavior Rating Scales, Measures (Individuals)
Matson, Johnny L.; Gonzalez, Melissa L.; Wilkins, Jonathan; Rivet, Tessa T. – Research in Autism Spectrum Disorders, 2008
The reliability of a new scale to assess Autistic Disorder, Pervasive Developmental Disorder, Not Otherwise Specified (PDD-NOS), and Asperger's Disorder in children was examined. Parents or other caregivers rated symptoms of 207 children between 2 and 16 years of age. The scale, which had 40 items in the final version, correlated highly with…
Descriptors: Autism, Interrater Reliability, Criteria, Psychopathology
Gray, K. M.; Tonge, B. J.; Sweeney, D. J.; Einfeld, S. L. – Journal of Autism and Developmental Disorders, 2008
The ability to identify children who require specialist assessment for the possibility of autism at as early an age as possible has become a growing area of research. A number of measures have been developed as potential screening tools for autism. The reliability and validity of one of these measures for screening for autism in young children…
Descriptors: Check Lists, Autism, Interrater Reliability, Young Children
Das, Jacqueline; de Ruiter, Corine; Doreleijers, Theo; Hillege, Sanne – Assessment, 2009
The present study examines the reliability and construct validity of the Dutch version of the Psychopathy Check List: Youth Version (PCL:YV) in a sample of male adolescents admitted to a secure juvenile justice treatment institution (N = 98). Hare's four-factor model is used to examine reliability and validity of the separate dimensions of…
Descriptors: Check Lists, Construct Validity, Test Validity, Personality
Chavez, Oscar; Papick, Ira; Ross, Dan J.; Grouws, Douglas A. – Online Submission, 2010
The purpose of this paper was to describe the process of development of assessment instruments for the Comparing Options in Secondary Mathematics: Investigating Curriculum (COSMIC) project. The COSMIC project was a three-year longitudinal comparative study focusing on evaluating high school students' mathematics learning from two distinct…
Descriptors: Mathematics Education, Mathematics Achievement, Interrater Reliability, Scoring Rubrics

Peer reviewed
Direct link
