Publication Date
| In 2026 | 0 |
| Since 2025 | 13 |
| Since 2022 (last 5 years) | 48 |
| Since 2017 (last 10 years) | 151 |
| Since 2007 (last 20 years) | 301 |
Descriptor
| Interrater Reliability | 503 |
| Test Reliability | 503 |
| Test Validity | 260 |
| Test Construction | 106 |
| Foreign Countries | 103 |
| Psychometrics | 91 |
| Evaluation Methods | 90 |
| Scores | 67 |
| Correlation | 62 |
| Scoring | 61 |
| Rating Scales | 58 |
| More ▼ | |
Source
Author
| Epstein, Michael H. | 7 |
| Johnson, Evelyn S. | 4 |
| Matson, Johnny L. | 4 |
| Tasse, Marc J. | 4 |
| Aman, Michael G. | 3 |
| Canivez, Gary L. | 3 |
| Capie, William | 3 |
| Conroy, Maureen A. | 3 |
| Crawford, Angela R. | 3 |
| Lecavalier, Luc | 3 |
| McLeod, Bryce D. | 3 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 41 |
| Practitioners | 8 |
| Administrators | 3 |
| Teachers | 3 |
| Counselors | 1 |
Location
| Turkey | 11 |
| Canada | 10 |
| Australia | 9 |
| United Kingdom | 9 |
| Pennsylvania | 7 |
| Florida | 6 |
| Netherlands | 6 |
| Sweden | 5 |
| United Kingdom (England) | 5 |
| China | 4 |
| Illinois | 4 |
| More ▼ | |
Laws, Policies, & Programs
| Individuals with Disabilities… | 2 |
| No Child Left Behind Act 2001 | 1 |
| Pell Grant Program | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Yocum, Allison; McCoy, Sarah Westcott; Bjornson, Kristie F.; Mullens, Pamela; Burton, Gay Naganuma – Physical & Occupational Therapy in Pediatrics, 2010
A standardized protocol for a pediatric heel-rise test was developed and reliability and validity are reported. Fifty-seven children developing typically (CDT) and 34 children with plantar flexion weakness performed three tests: unilateral heel rise, vertical jump, and force measurement using handheld dynamometry. Intraclass correlation…
Descriptors: Test Validity, Test Reliability, Interrater Reliability, Psychomotor Skills
Latimer, Marvin E., Jr.; Bergee, Martin J.; Cohen, Mary L. – Journal of Research in Music Education, 2010
The purpose of this study was to investigate the reliability and perceived pedagogical utility of a multidimensional weighted performance assessment rubric used in Kansas state high school large-group festivals. Data were adjudicator rubrics (N = 2,016) and adjudicator and director questionnaires (N = 515). Rubric internal consistency was…
Descriptors: Music Activities, State Programs, Performance Based Assessment, Weighted Scores
Rufino, Katrina A.; Boccaccini, Marcus T.; Guy, Laura S. – Assessment, 2011
Although reliability is essential to validity, most research on violence risk assessment tools has paid little attention to strategies for improving rater agreement. The authors evaluated the degree to which perceived subjectivity in scoring guidelines for items from two measures--the Psychopathy Checklist-Revised (PCL-R) and the Historical,…
Descriptors: Risk Management, Predictive Validity, Interrater Reliability, Scoring
Nunan, Anna – Language Learning in Higher Education, 2014
The Applied Language Centre at University College Dublin offers foreign language modules to students in ten languages at CEFR [Common European Framework of Reference for Languages] levels ranging from A1 to B2. Efforts have been underway in the Centre to standardise the assessment components across languages to ensure parity between module credits…
Descriptors: Second Language Learning, Second Language Instruction, College Students, Standards
Johnson, Evelyn S.; Semmelroth, Carrie L. – Journal of Special Education Apprenticeship, 2012
This paper reports the results of interrater agreement analyses on a pilot special education teacher evaluation instrument, the Recognizing Effective Special Education Teachers (RESET) Observation Tool (OT). Using evidence-based instructional practices as the basis for the evaluation, the RESET OT is designed for the spectrum of different…
Descriptors: Interrater Reliability, Pilot Projects, Special Education, Special Education Teachers
Hasson, Natalie; Dodd, Barbara; Botting, Nicola – International Journal of Language & Communication Disorders, 2012
Background: Sentence construction and syntactic organization are known to be poor in children with specific language impairments (SLI), but little is known about the way in which children with SLI approach language tasks, and static standardized tests contribute little to the differentiation of skills within the population of children with…
Descriptors: Alternative Assessment, Sentence Structure, Syntax, Language Processing
McVilly, K.; Webber, L.; Paris, M.; Sharp, G. – Journal of Intellectual Disability Research, 2013
Background: Having an objective means of evaluating the quality of behaviour support plans (BSPs) could assist service providers and statutory authorities to monitor and improve the quality of support provided to people with intellectual disability (ID) who exhibit challenging behaviour. The Behaviour Support Plan Quality Evaluation Guide II…
Descriptors: Foreign Countries, Behavior Problems, Behavior Modification, Adults
King, Kathleen; Reschly, Amy L.; Appleton, James J. – Journal of Psychoeducational Assessment, 2012
The purpose of the current study was to evaluate a screening instrument. The sample contained 496 elementary children from the rural southeast. Properties of the Teacher, Parent, and Student Forms of the Behavioral and Emotional Screening System were examined. Results indicated that all forms had high levels of internal consistency. There were low…
Descriptors: Elementary School Students, Rural Schools, Elementary School Teachers, Parents
Zhao, Zhongbao – RELC Journal: A Journal of Language Teaching and Research, 2013
This study investigates the validity of the Diagnostic College English Speaking Test (DCEST) in the context of EFL teaching and learning in China. The experiment was conducted in three stages over the course of eight weeks at a national key university in China. By means of test administration and questionnaire survey, the researcher gathered…
Descriptors: Oral Language, Construct Validity, Language Tests, Diagnostic Tests
Hermans, Heidi; van der Pas, Femke H.; Evenhuis, Heleen M. – Research in Developmental Disabilities: A Multidisciplinary Journal, 2011
Background: In the last decades several instruments measuring anxiety in adults with intellectual disabilities have been developed. Aim: To give an overview of the characteristics and psychometric properties of self-report and informant-report instruments measuring anxiety in this group. Method: Systematic review of the literature. Results:…
Descriptors: Mental Retardation, Learning Disabilities, Interrater Reliability, Measures (Individuals)
Bian, Xiaoyan; Yao, Guoying; Squires, Jane; Hoselton, Rob; Chen, Ching-I; Murphy, Kimberly; Wei, Mei; Fang, Binghua – Journal of Early Childhood Research, 2012
As part of efforts throughout China to improve the outcomes of individuals with disabilities, the Shanghai government has launched a campaign to screen at least 95 percent of newborns. To assist in meeting this goal, the Ages & Stages Questionnaires (ASQ), Third Edition, was translated into Chinese and the feasibility of a screening system…
Descriptors: Translation, Screening Tests, Caregivers, Validity
Hermans, Heidi; Jelluma, Naftha; van der Pas, Femke H.; Evenhuis, Heleen M. – Research in Developmental Disabilities: A Multidisciplinary Journal, 2012
Background: The informant-based Anxiety, Depression And Mood Scale was translated into Dutch and its feasibility, reliability and validity in older adults (aged greater than or equal to 50 years) with intellectual disabilities (ID) was studied. Method: Test-retest (n = 93) and interrater reliability (n = 83), and convergent (n = 202 and n = 787),…
Descriptors: Mental Retardation, Interrater Reliability, Measures (Individuals), Depression (Psychology)
Hidecker, Mary Jo Cooley; Paneth, Nigel; Rosenbaum, Peter L.; Kent, Raymond D.; Lillie, Janet; Eulenberg, John B.; Chester, Ken, Jr.; Johnson, Brenda; Michalsen, Lauren; Evatt, Morgan; Taylor, Kara – Developmental Medicine & Child Neurology, 2011
Aim: The purpose of this study was to create and validate the Communication Function Classification System (CFCS) for children with cerebral palsy (CP), for use by a wide variety of individuals who are interested in CP. This paper reports the content validity, interrater reliability, and test-retest reliability of the CFCS for children with CP.…
Descriptors: Cerebral Palsy, Validity, Test Reliability, Interrater Reliability
Ruble, Lisa A.; McGrew, John; Dalrymple, Nancy; Jung, Lee Ann – Journal of Autism and Developmental Disorders, 2010
The purpose of this study was to develop an Individual Education Program (IEP) evaluation tool based on Individuals with Disabilities Education Act (IDEA) requirements and National Research Council recommendations for children with autism; determine the tool's reliability; test the tool on a pilot sample of IEPs of young children; and examine…
Descriptors: Autism, Interrater Reliability, Disabilities, Young Children
Villa, Susanna; Micheli, Enrico; Villa, Laura; Pastore, Valentina; Crippa, Alessandro; Molteni, Massimo – Journal of Autism and Developmental Disorders, 2010
The PEP-R (psychoeducational profile revised) is an instrument that has been used in many countries to assess abilities and formulate treatment programs for children with autism and related developmental disorders. To the end to provide further information on the PEP-R's psychometric properties, a large sample (N = 137) of children presenting…
Descriptors: Autism, Interrater Reliability, Psychometrics, Screening Tests

Peer reviewed
Direct link
