Publication Date
| In 2026 | 0 |
| Since 2025 | 56 |
| Since 2022 (last 5 years) | 282 |
| Since 2017 (last 10 years) | 778 |
| Since 2007 (last 20 years) | 2040 |
Descriptor
| Interrater Reliability | 3122 |
| Foreign Countries | 654 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 24 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Adelstein, David; Barbour, Michael K. – International Journal of E-Learning & Distance Education, 2016
Designers have a limited selection of K-12 online course creation standards to choose from that are not blocked behind proprietary or pay walls. For numerous institutions and states, the use of the iNACOL "National Standards for Quality Online Courses" is becoming a widely used resource. This article presents the final phase in a…
Descriptors: Instructional Design, Online Courses, National Standards, Elementary Secondary Education
Mailend, Marja-Liisa; Plante, Elena; Anderson, Michele A.; Applegate, E. Brooks; Nelson, Nickola W. – International Journal of Language & Communication Disorders, 2016
Background: As new standardized tests become commercially available, it is critical that clinicians have access to the information about a test's psychometric properties, including aspects of reliability. Aims: The purpose of the three studies reported in this article was to investigate the reliability of a new test, the Test of Integrated…
Descriptors: Standardized Tests, Psychometrics, Reliability, Language Skills
Miller, M. Elizabeth; Kwon, Sockju – Journal of Child Nutrition & Management, 2015
Purpose/Objectives: The purpose of this study was to explore milk and yogurt selection among students participating in a School Breakfast Program. Methods: Researchers observed breakfast selection of milk, juice and yogurt in six elementary and four secondary schools. Data were analyzed using descriptive statistics and logistic regression to…
Descriptors: Breakfast Programs, Food, Decision Making, Secondary Schools
Liou, Pey-Yan – Research in Science & Technological Education, 2015
Background: The nature of technology has been rarely discussed despite the fact that technology plays an essential role in modern society. It is important to discuss students' concepts of the nature of technology, and further to advance their technological literacy and adaptation to modern society. There is a need to assess high school students'…
Descriptors: Foreign Countries, Technology, Student Attitudes, Knowledge Level
Lieberman-Betz, Rebecca G. – Topics in Early Childhood Special Education, 2015
This article examined the reporting of four elements of fidelity of implementation (FOI) in parent-mediated early communication treatment studies. Thirty-five studies were reviewed to extract information regarding reporting of dosage, adherence, quality, and participant responsiveness for both practitioners and parents involved in parent-delivered…
Descriptors: Early Intervention, Oral Language, Communication Skills, Interpersonal Communication
Ho, Andrew D.; Kane, Thomas J. – Bill & Melinda Gates Foundation, 2013
For many teachers, the classroom observation has been the only opportunity to receive direct feedback from another school professional. As such, it is an indispensable part of every teacher evaluation system. Yet it also requires a major time commitment from teachers, principals, and peer observers. To justify the investment of time and resources,…
Descriptors: Observation, Teacher Evaluation, Accuracy, Reliability
de Lima, Alberto Alves; Conde, Diego; Costabel, Juan; Corso, Juan; Van der Vleuten, Cees – Advances in Health Sciences Education, 2013
Reliability estimations of workplace-based assessments with the mini-CEX are typically based on real-life data. Estimations are based on the assumption of local independence: the object of the measurement should not be influenced by the measurement itself and samples should be completely independent. This is difficult to achieve. Furthermore, the…
Descriptors: Test Reliability, Graduate Students, Medical Students, Vocational Evaluation
Ledford, Jennifer R.; Wolery, Mark; Meeker, Kathleen Artman; Wehby, Joseph H. – Journal of Behavioral Education, 2012
Collection of interobserver agreement data and reporting the results with summary statistics are standard practices in single-case research. An alternative to summary statistics is plotting the second observer's data on the same graph as the primary observer. In this study, we evaluated whether plotting the second observer's data differentially…
Descriptors: Graphs, Interrater Reliability, Graduate Students, Expertise
Ferdig, Richard E.; Pytash, Kristine E.; Kosko, Karl W.; Memis, Riza; Ryan, Kelli; Dunlosky, John – Journal of Interactive Learning Research, 2017
This study set out to examine two important aspects of the use of eWriters by early elementary students. First, it explored the impact of eWriters on literacy motivation and self efficacy of students in Pre-Kindergarten, Kindergarten, and First grade. Second, it explored if and how the technology implementation would affect parent and teacher…
Descriptors: Self Efficacy, Parent Teacher Cooperation, Elementary School Students, Preschool Education
Smith, Jane Ellen; Gianini, Loren M.; Garner, Bryan R.; Malek, Karen L.; Godley, Susan H. – Journal of Child & Adolescent Substance Abuse, 2014
This study evaluated a process for training raters to reliably rate clinicians delivering the Adolescent Community Reinforcement Approach (A-CRA) in a national dissemination project. The unique A-CRA coding system uses specific behavioral anchors throughout its 73 procedure components. Five randomly selected raters each rated "passing"…
Descriptors: Training, Evaluators, Coding, Reinforcement
Overton, Sarah; Wren, Yvonne – Child Language Teaching and Therapy, 2014
The ultimate aim of intervention for children with language impairment is an improvement in their functional language skills. Baseline and outcome measurement of this is often problematic however and practitioners commonly resort to using formal assessments that may not adequately reflect the child's competence. Language sampling,…
Descriptors: Language Impairments, Language Skills, Computer Software, Allied Health Personnel
Kokkinaki, Theano; Pratikaki, Anastasia – Early Child Development and Care, 2014
Primary objective: Research has provided evidence of the intersubjective function of imitation in grandparent-infant interaction based on the basic aspects of imitation. This lacks the systematic investigation of behaviour dynamics framing spontaneous imitation. The aim of this study was to compare the dyadic expressive behaviours (vocal, kinetic…
Descriptors: Grandparents, Video Technology, Infants, Imitation
King, Kathleen R.; Reschly, Amy L. – Journal of Psychoeducational Assessment, 2014
The purpose of this study was to evaluate and compare two behavior screening instruments--the Behavioral and Emotional Screening System and the Behavior Screening Checklist. The sample consisted of 492 elementary school children from the southeastern United States. The psychometric properties of the screening instruments were evaluated in terms of…
Descriptors: Screening Tests, Behavior, Comparative Analysis, Predictive Validity
Nicole B. Kersting; Bruce L. Sherin; James W. Stigler – Educational and Psychological Measurement, 2014
In this study, we explored the potential for machine scoring of short written responses to the Classroom-Video-Analysis (CVA) assessment, which is designed to measure teachers' usable mathematics teaching knowledge. We created naïve Bayes classifiers for CVA scales assessing three different topic areas and compared computer-generated scores to…
Descriptors: Scoring, Automation, Video Technology, Teacher Evaluation
Li, Hui – English Language Teaching, 2016
The aim of the study was to investigate how raters come to their decisions when judging spoken vocabulary. Segmental rating was introduced to quantify raters' decision-making process. It is hoped that this simulated study brings fresh insight to future methodological considerations with spoken data. Twenty trainee raters assessed five Chinese…
Descriptors: Foreign Countries, Evaluators, Interrater Reliability, Decision Making

Peer reviewed
Direct link
