Publication Date
| In 2026 | 0 |
| Since 2025 | 58 |
| Since 2022 (last 5 years) | 284 |
| Since 2017 (last 10 years) | 780 |
| Since 2007 (last 20 years) | 2042 |
Descriptor
| Interrater Reliability | 3124 |
| Foreign Countries | 655 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 25 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
De Houwer, Jan – Learning and Motivation, 2006
Implicit measures such as the Implicit Association Test (OAT) have recently become popular as tools in research on evaluative conditioning. The reason is that these measures are thought to be impervious to changes in valence that are due to conscious propositional knowledge about the relation between the conditioned stimulus (CS) and the…
Descriptors: Association Measures, Conditioning, Stimuli, Interrater Reliability
Hintze, John M.; Matthews, William J. – School Psychology Review, 2004
This study examined the generalizability of systematic direct observation across setting and time. Participants included 14 students from an intact inclusionary fifth grade classroom. On-task/off-task behavior was directly observed using momentary time-sampling recording, twice a day, for 10 school days. Using Generalizability (G) theory, results…
Descriptors: Grade 5, Psychometrics, Classroom Observation Techniques, Interrater Reliability
Sloutsky, Vladimir M.; Fisher, Anna V. – Journal of Experimental Psychology: General, 2006
This article is a response to E. Heit and B. K. Hayes's comment on the target article "Induction and Categorization in Young Children: A Similarity-Based Model" (V. M. Sloutsky & A. V. Fisher, 2004a). The response discusses points of agreement and disagreement with Heit and Hayes; phenomena predicted by similarity, induction, naming, and…
Descriptors: Logical Thinking, Classification, Young Children, Recognition (Psychology)
Luhmann, Christian C.; Ahn, Woo-kyoung – Psychological Review, 2005
This paper comments on the response offered by Cheng and Novick to Luhmann and Ahn's initial comments on Cheng's and Cheng and Novick's previous articles. Cheng and Novick argue that people's willingness to generalize across contexts contradicts our hypothesis. They argue that previous studies demonstrate that participants generalize their…
Descriptors: Criticism, Reader Response, Generalization, Hypothesis Testing
The Application of Interrater Reliability as a Solidification Instrument in a Phenomenological Study
Marques, Joan F.; McCall, Chester – Qualitative Report, 2005
Interrater reliability has thus far not been a common application in phenomenological studies. However, once the suggestion was brought up by a team of supervising professors during the preliminary orals of a phenomenological study, the utilization of this verification tool turned out to be vital to the credibility level of this type of inquiry,…
Descriptors: Interrater Reliability, Phenomenology, Qualitative Research, Computation
Williams, Thomas O., Jr.; Fall, Anna-Maria; Eaves, Ronald C.; Woods-Groves, Suzanne – Journal of Psychoeducational Assessment, 2006
The reliability of scores for the "Draw-A-Person Intellectual Ability Test for Children, Adolescents, and Adults" is examined with a sample of 110 college students from two universities in the southeast. The alpha coefficient for the total sample and the interscorer and intrascorer reliability for a subset of 31 students are analyzed.…
Descriptors: Reliability, Scores, College Students, Intelligence Tests
Meester-Delver, Anke; Beelen, Anita; Hennekam, Raoul; Nollet, Frans; Hadders-Algra, Mijna – Developmental Medicine & Child Neurology, 2007
The aim of this study was to determine the interrater reliability and stability over time of the Capacity Profile (CAP). The CAP is a standardized method for classifying additional care needs indicated by current impairments in five domains of body functions: physical health, neuromusculoskeletal and movement-related, sensory, mental, and voice…
Descriptors: Physical Health, Cerebral Palsy, Interrater Reliability, Interviews
Rao, Rahul – Academic Psychiatry, 2007
Objective: The multifaceted nature of training and the diverse backgrounds of potential Senior House Officers (Postgraduate Residents) require a novel approach to the selection of trainees wishing to pursue a career in psychiatry. The author reports the properties of a semi-structured interview (the SCRIPT) for assessing doctors short-listed for a…
Descriptors: Graduate Medical Education, Test Validity, Interrater Reliability, Psychiatry
Konold, Timothy R.; Pianta, Robert C. – Journal of Psychoeducational Assessment, 2007
Standardized rating scales remain the primary mechanism through which child behaviors are recorded. Despite the many advantages of such systems, the documented lack of agreement among different informants' ratings of the same child remains a pervasive problem for clinicians. This study examines the degree to which observed behavior ratings were…
Descriptors: Behavior Problems, Child Behavior, Behavior Rating Scales, Interrater Reliability
Oliver, P. C.; Crawford, M. J.; Rao, B.; Reece, B.; Tyrer, P. – Journal of Applied Research in Intellectual Disabilities, 2007
Background: Reliable measures of aggressive challenging behaviour are required if interventions aimed at reducing this behaviour among people with intellectual disability (ID) are to be formally evaluated. The present authors examined the reliability of the Modified Overt Aggression Scale (MOAS), an instrument not yet formally tested in those with…
Descriptors: Aggression, Mental Retardation, Measures (Individuals), Correlation
Heal, Nicole A.; Hanley, Gregory P. – Journal of Applied Behavior Analysis, 2007
Preschool teachers rely on several strategies for motivating children to participate in learning activities. In the current study, we evaluated the effectiveness of and preference for three teaching contexts in which embedded, sequential, or no programmed reinforcement was arranged. The embedded context included highly preferred teaching…
Descriptors: Teaching Methods, Preschool Teachers, Preschool Children, Context Effect
Nathan, Mitchell J.; Eilam, Billie; Kim, Suyeon – Journal of the Learning Sciences, 2007
Learning in a socially mediated context like a classroom places emphasis on the ability of learners to communicate their ideas to others, and for members of a class to achieve shared meaning or intersubjectivity (IS). We take a participatory view of IS, where both consensual agreement and disagreement are regarded as aspects of a common set of…
Descriptors: Social Cognition, Brain, Empathy, Mathematics
Mariano, Louis T.; Junker, Brian W. – Journal of Educational and Behavioral Statistics, 2007
When constructed response test items are scored by more than one rater, the repeated ratings allow for the consideration of individual rater bias and variability in estimating student proficiency. Several hierarchical models based on item response theory have been introduced to model such effects. In this article, the authors demonstrate how these…
Descriptors: Test Items, Item Response Theory, Rating Scales, Scoring
Bond, C.; Cole, M.; Crook, H.; Fletcher, J.; Lucanz, J.; Noble, J. – Educational Psychology in Practice, 2007
This article is an initial evaluation of a motor skills assessment for primary aged children. The Manchester Motor Skills Assessment (MMSA) is designed to be quick and easy for teaching assistants to complete, with the dual purposes of informing group programme planning and demonstrating an individual child's progress following a period of…
Descriptors: Focus Groups, Interrater Reliability, Intervention, Teaching Assistants
Titus, Janet C.; Godley, Susan H.; White, Michelle K. – Journal of Child & Adolescent Substance Abuse, 2007
Qualitative data from 923 adolescents treated in outpatient and residential settings were used to create taxonomies of their reasons for starting, continuing, and quitting use of drugs and alcohol. Three raters independently categorized reasons by dominant theme. The final sets of taxonomies were defined within several iterations and the raters'…
Descriptors: Narcotics, Interrater Reliability, Adolescents, Classification

Peer reviewed
Direct link
