Publication Date
| In 2026 | 0 |
| Since 2025 | 58 |
| Since 2022 (last 5 years) | 284 |
| Since 2017 (last 10 years) | 780 |
| Since 2007 (last 20 years) | 2042 |
Descriptor
| Interrater Reliability | 3124 |
| Foreign Countries | 655 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 25 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Corrin, William; Lindsay, James J.; Somers, Marie-Andree; Myers, Nathan E.; Meyers, Coby V.; Condon, Christopher A.; Smith, Janell K. – National Center for Education Evaluation and Regional Assistance, 2012
This report presents the findings of a rigorous experimental impact evaluation and implementation study of one such intervention, the Content Literacy Continuum (CLC), developed by researchers at the University of Kansas Center for Research on Learning. This evaluation of CLC was conducted by three partnering organizations: REL Midwest, MDRC, and…
Descriptors: High School Students, Reading Difficulties, Reading Comprehension, Scores
Graves, Scott L., Jr.; Blake, Jamilia; Kim, Eun Sook – Journal of Early Intervention, 2012
Previous research has demonstrated that informant disagreement is common with the use of rating scales to assess problem behavior in school-age populations. However, much less is known about this phenomenon in preschool populations. This is important because the accurate assessment of problem behavior in preschool is complex due to the rapid…
Descriptors: At Risk Students, Educational Attainment, Rating Scales, Behavior Problems
De Grez, Luc; Valcke, Martin; Roozen, Irene – Active Learning in Higher Education, 2012
Assessment of oral presentation skills is an underexplored area. The study described here focuses on the agreement between professional assessment and self- and peer assessment of oral presentation skills and explores student perceptions about peer assessment. The study has the merit of paying attention to the inter-rater reliability of the…
Descriptors: Feedback (Response), Interrater Reliability, Scoring Rubrics, Speech Skills
Newton, J. Stephen; Horner, Robert H.; Todd, Anne W.; Algozzine, Robert F.; Algozzine, Kate M. – Education and Treatment of Children, 2012
Many schools have problem-solving teams that support teachers by helping identify and resolve students' academic and social problems. Although research is limited, it has been found that teams sometimes fail to implement problem-solving processes with fidelity, which may hinder the resolution of problems. We developed the Team-Initiated Problem…
Descriptors: Social Problems, Problem Solving, Workshops, Models
On the Reliability and Validity of Human and LSA-Based Evaluations of Complex Student-Authored Texts
Seifried, Eva; Lenhard, Wolfgang; Baier, Herbert; Spinath, Birgit – Journal of Educational Computing Research, 2012
This study investigates the potential of a software tool based on Latent Semantic Analysis (LSA; Landauer, McNamara, Dennis, & Kintsch, 2007) to automatically evaluate complex German texts. A sample of N = 94 German university students provided written answers to questions that involved a high amount of analytical reasoning and evaluation.…
Descriptors: Foreign Countries, Computer Software, Computer Software Evaluation, Computer Uses in Education
Di Blas, Lisa; Grassi, Michele; Luccio, Riccardo; Momente, Silvia – Assessment, 2012
The authors developed the Interpersonal Behavior Questionnaire for Children with the aim of assessing the constructs of the interpersonal circumplex model, that is, Dominance and Love and their possible combinations, via third- to fifth-grade children's self- and peer reports. In the three studies presented herein, the authors examined several…
Descriptors: Interpersonal Competence, Child Behavior, Questionnaires, Psychometrics
Benton, Stephen L.; Gross, Amy B.; Pallett, William H.; Song, Jihyun; Webster, Russell; Guo, Meixi – IDEA Center, Inc., 2011
The IDEA Feedback for Administrators system provides feedback to academic administrators about their performance of relevant administrative responsibilities and their leadership style and interpersonal characteristics. The system is based on a model of reflective practice, which is consistent with The IDEA Center's longstanding approach to…
Descriptors: Feedback (Response), Administrators, Administrator Attitudes, Administrator Evaluation
Awan, Shaheen N.; Stine, Carolyn L – Clinical Linguistics & Phonetics, 2011
The purpose of this study was to determine possible differences in voice onset time (VOT) between speakers of standard American English (AE) and Indian English (IE) in a continuous speech context. The participants were 20 AE speakers, who were native to the Northeastern Pennsylvania region, and 20 IE speakers from the Indian subcontinent who had…
Descriptors: Foreign Countries, North American English, Indians, Dialects
Xavier, Jean; Vannetzel, Leonard; Viaux, Sylvie; Leroy, Arthur; Plaza, Monique; Tordjman, Sylvie; Mille, Christian; Bursztejn, Claude; Cohen, David; Guile, Jean-Marc – Research in Autism Spectrum Disorders, 2011
The Pervasive Developmental Disorder-Not Otherwise Specified (PDD-NOS) category is a psychopathological entity few have described and is poorly, and mainly negatively, defined by autism exclusion. In order to limit PDD-NOS heterogeneity, alternative clinical constructs have been developed. This study explored the reliability and the diagnostic…
Descriptors: Socialization, Autism, Test Validity, Adjustment (to Environment)
Wanstreet, Constance E.; Stein, David S. – American Journal of Distance Education, 2011
This study investigated the small-group, learner-led discussion process in synchronous discussions. Transcripts from online chats and face-to-face discussions were analyzed within the context of the Community of Inquiry framework to examine the relationship of teaching presence, social presence, and cognitive presence to one another and for…
Descriptors: Computer Mediated Communication, Inquiry, Models, Asynchronous Communication
Cullen, Anne; Coryn, Chris L. S. – Journal of MultiDisciplinary Evaluation, 2011
Background: Since the late 1970s participatory approaches have been widely promoted to evaluate international development programs. However, there is no universal agreement of what is meant by participatory evaluation. For some evaluators, participatory evaluations involve the extensive participation of all stakeholder groups (from donor to…
Descriptors: Economic Development, Global Approach, International Programs, Program Evaluation
Thompson, Carla J. – International Journal of Special Education, 2011
An observational research study based on sensory integration theory was conducted to examine the observed impact of student selected multi-sensory experiences within a multi-sensory intervention center relative to the sustained focus levels of students with special needs. A stratified random sample of 50 students with severe developmental…
Descriptors: Sensory Integration, Intervention, Observation, Measures (Individuals)
O'Connor, Rollanda E.; Beach, Kristen D.; Sanchez, Victoria M.; Bocian, Kathleen M.; Flynn, Lindsay J. – Exceptional Children, 2015
We tested the effects of teaching reading skills through U.S. history content for 38 eighth-grade poor readers whose reading ability ranged from second-to fourth-grade levels. Half of the students received special education services, and half of the students were English language learners. Students were taught to decode multisyllabic words, learn…
Descriptors: Grade 8, Middle School Students, Reading Difficulties, Reading Improvement
Thompson, James R.; Tasse, Marc J.; McLaughlin, Colleen A. – American Journal on Mental Retardation, 2008
The interrater reliability of the Supports Intensity Scale (SIS) was investigated under the condition that interviewers had to have been trained and/or experienced in its administration and scoring. Both corrected and noncorrected Pearson's product-moment coefficients were generated to assess interinterviewer, interrespondent, and mixed interrater…
Descriptors: Interrater Reliability, Measures (Individuals), Correlation, Evaluators
Warrens, Matthijs J. – Psychometrika, 2008
This paper studies correction for chance in coefficients that are linear functions of the observed proportion of agreement. The paper unifies and extends various results on correction for chance in the literature. A specific class of coefficients is used to illustrate the results derived in this paper. Coefficients in this class, e.g. the simple…
Descriptors: Interrater Reliability, Statistical Analysis, Generalization, Mathematical Concepts

Peer reviewed
Direct link
