Publication Date
| In 2026 | 0 |
| Since 2025 | 58 |
| Since 2022 (last 5 years) | 284 |
| Since 2017 (last 10 years) | 780 |
| Since 2007 (last 20 years) | 2042 |
Descriptor
| Interrater Reliability | 3124 |
| Foreign Countries | 655 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 25 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Woolley, Michael E.; Bowen, Gary L.; Bowen, Natasha K. – Educational and Psychological Measurement, 2006
Cognitive pretesting (CP) is an interview methodology for pretesting the validity of items during the development of self-report instruments. This article reports on the development and evaluation of a systematic method to rate self-report item validity performance utilizing CP interview text data. Five raters were trained in the application of…
Descriptors: Measurement Techniques, Validity, Pretesting, Interviews
Peer reviewedO'Neill, Thomas R. – Popular Measurement, 1999
"Adjusting for Rater Severity over Time" by T. R. O'Neill discusses the application of Rasch measurement theory to accounting for rater severity effects. (SLD)
Descriptors: Interrater Reliability
Sell, D.; John, A.; Harding-Bell, A.; Sweeney, T.; Hegarty, F.; Freeman, J. – International Journal of Language & Communication Disorders, 2009
Background: The previous literature has largely focused on speech analysis systems and ignored process issues, such as the nature of adequate speech samples, data acquisition, recording and playback. Although there has been recognition of the need for training on tools used in speech analysis associated with cleft palate, little attention has been…
Descriptors: Listening, Continuing Education, Congenital Impairments, Interrater Reliability
Schneider, Leann; Schimmack, Ulrich – Social Indicators Research, 2009
A meta-analysis of published studies that reported correlations between self-ratings and informant ratings of well-being (life-satisfaction, happiness, positive affect, negative affect) was performed. The average self-informant correlation based on 44 independent samples and 81 correlations for a total of 8,897 participants was r = 0.42 [99%…
Descriptors: Affective Behavior, Psychological Patterns, Well Being, Meta Analysis
Bradshaw, Catherine P.; Debnam, Katrina; Koth, Christine W.; Leaf, Philip – Journal of Positive Behavior Interventions, 2009
Schoolwide positive behavioral interventions and supports (SWPBIS) are becoming increasingly popular with schools across the country to help create safer learning environments for students. An important aspect of SWPBIS is the ongoing monitoring and evaluation of implementation fidelity. Although a few measures have been created to assess the…
Descriptors: Interrater Reliability, Positive Reinforcement, Behavior Modification, Program Validation
Lobbestael, Jill; Arntz, Arnoud; Harkema-Schouten, Petra; Bernstein, David – Child Abuse & Neglect: The International Journal, 2009
Objective: We conducted a comprehensive assessment of the reliability and validity of the Interview for Traumatic Events in Childhood (ITEC, Lobbestael, Arntz, Kremers, & Sieswerda, 2006), a retrospective, semi-structured interview for childhood maltreatment. The ITEC aims to yield dimensional scores for severity of experiences of different…
Descriptors: Evaluation Methods, Test Reliability, Test Validity, Sexual Abuse
Gotwals, John K.; Dunn, John G. H. – Measurement in Physical Education and Exercise Science, 2009
This article presents a chronology of three empirical studies that outline the measurement process by which two new subscales ("Doubts about Actions" and "Organization") were developed and integrated into a revised version of Dunn, Causgrove Dunn, and Syrotuik's (2002) "Sport Multidimensional Perfectionism Scale"…
Descriptors: Construct Validity, Measures (Individuals), Multidimensional Scaling, Multitrait Multimethod Techniques
Hockett, Jessica A. – Journal for the Education of the Gifted, 2009
Legislative measures designed to ensure that all students meet minimal expectations have concerned leaders in gifted education. In this current educational climate of standards and accountability, however, there is arguably greater agreement than ever before between experts and professional organizations in general education and their counterparts…
Descriptors: Curriculum Design, General Education, Gifted, Educational Indicators
Sawchuk, Stephen – Education Digest: Essential Readings Condensed for Quick Review, 2010
Most experts in the testing community have presumed that the $350 million promised by the U.S. Department of Education to support common assessments would promote those that made greater use of open-ended items capable of measuring higher-order critical-thinking skills. But as measurement experts consider the multitude of possibilities for an…
Descriptors: Educational Quality, Test Items, Comparative Analysis, Multiple Choice Tests
The Infant Motor Profile: A Standardized and Qualitative Method to Assess Motor Behaviour in Infancy
Heineman, Kirsten R.; Bos, Arend F.; Hadders-Algra, Mijna – Developmental Medicine & Child Neurology, 2008
A reliable and valid instrument to assess neuromotor condition in infancy is a prerequisite for early detection of developmental motor disorders. We developed a video-based assessment of motor behaviour, the Infant Motor Profile (IMP), to evaluate motor abilities, movement variability, ability to select motor strategies, movement symmetry, and…
Descriptors: Validity, Interrater Reliability, Infants, Evaluation Methods
Lafave, Mark; Katz, Larry; Butterwick, Dale – Advances in Health Sciences Education, 2008
Content validation of an instrument that measures student performance in OSCE-type practical examinations is a critical step in a tool's overall validity and reliability [Hopkins (1998), "Educational and Psychological Measurement and Evaluation" (8th ed.). Toronto: Allyn & Bacon]. The purpose of the paper is to outline the process…
Descriptors: Check Lists, Physical Activities, Observation, Physicians
Mechling, Linda C.; Gast, David L.; Fields, Elizabeth A. – Journal of Special Education, 2008
This study evaluated the effectiveness of a portable DVD player plus the system of least prompts (SLP) for DVD player use as a self-prompting device to teach cooking tasks to three young adults with moderate intellectual disabilities. A multiple probe design across three cooking tasks and replicated across three students was used to evaluate the…
Descriptors: Mental Retardation, Prompting, Young Adults, Cooking Instruction
Nieminen, Timo A.; Choi, Serene Hyun-Jin – International Journal of Research & Method in Education, 2008
Quantitative behaviour analysis requires the classification of behaviour to produce the basic data. This can be challenging when the theoretical taxonomy does not match observational limitations, or if a theoretical taxonomy is unavailable. Binary keys allow qualitative observation to be used to modify a theoretical taxonomy to produce a practical…
Descriptors: Developmental Disabilities, Behavioral Science Research, Classification, Identification
Nicastro, Gerilee; Moreton, Kyle M. – Assessment Update, 2008
Western Governors University (WGU) is an online competency-based university in which students demonstrate content competence through a series of assessments. Assessments most often are performance-based or objective assessments that are developed in accordance with specific content objectives. Objective assessments generally assess lower-level…
Descriptors: Evaluators, Performance Based Assessment, Interrater Reliability, Educational Objectives
Orosco, Michael J.; Swanson, H. Lee; O'Connor, Rollanda; Lussier, Cathy – Grantee Submission, 2011
English language learners (ELLs) struggle with solving word problems for a number of reasons beyond math procedures or calculation challenges. As a result, ELLs may not only need math support but also reading and linguistic support. The purpose of this study was to assess the effectiveness of a math comprehension strategy called Dynamic Strategic…
Descriptors: English Language Learners, Problem Solving, Mathematics Instruction, Intervention

Direct link
