Publication Date
| In 2026 | 0 |
| Since 2025 | 56 |
| Since 2022 (last 5 years) | 282 |
| Since 2017 (last 10 years) | 778 |
| Since 2007 (last 20 years) | 2040 |
Descriptor
| Interrater Reliability | 3122 |
| Foreign Countries | 654 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 24 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Glad, Johan; Kottorp, Anders; Jergeby, Ulla; Gustafsson, Carina; Sonnander, Karin – Research on Social Work Practice, 2014
Objectives: The aim of this pilot study was to explore psychometric properties of two versions of the Home Observation for Measurement of the Environment Inventory in a Swedish social service sample. Method: Social workers employed at 22 Swedish child protections agencies participated in the data collection. Both classic test theory approaches and…
Descriptors: Psychometrics, Item Response Theory, Foreign Countries, Social Services
Drogan, Robin R.; Kern, Lee – Topics in Early Childhood Special Education, 2014
A significant number of young children exhibit challenging behaviors in preschool settings. A tiered framework of intervention has documented effectiveness in elementary and secondary schools, and recently has been extended to preschool settings. Although there is emerging research to support the effectiveness of Tier 1 (universal) and Tier 3…
Descriptors: Intervention, Behavior Problems, Preschool Children, Child Development
Young, Karen; James, Kimberley; Noy, Sue – Asia-Pacific Journal of Cooperative Education, 2016
Work integrated learning (WIL) educators using reflective practice to facilitate student learning require a set of standards that works within the traditional assessment frame of Higher Education, to ascertain the level at which reflective practice has been demonstrated. However, there is a paucity of tested assessment instruments that provide…
Descriptors: Work Experience Programs, Reflection, Student Evaluation, Scoring Rubrics
Nan, Joshua Kin-man; Hinz, Lisa D. – Art Therapy: Journal of the American Art Therapy Association, 2012
Assessment is the foundation for conceptualizing effective interventions. Due to their nonverbal nature, art therapy assessments have an advantage over traditional verbal assessments in some populations and potentially across cultures. This pilot study provides preliminary reliability data to support the cross-cultural use of the Formal Elements…
Descriptors: Foreign Countries, Art Therapy, Measures (Individuals), Freehand Drawing
Artman, Kathleen; Wolery, Mark; Yoder, Paul – Remedial and Special Education, 2012
Most investigators using single-case experimental designs use interobserver agreement (IOA) checks to enhance the credibility of the collected data, and they report the results of those assessments using percentage of agreement estimates. An alternative is to graph both observers' records of the measured behavior on the primary study graphs. Such…
Descriptors: Research Methodology, Inspection, Interrater Reliability, Research Design
Saxton, Emily; Belanger, Secret; Becker, William – Assessing Writing, 2012
The purpose of this study was to investigate the intra-rater and inter-rater reliability of the Critical Thinking Analytic Rubric (CTAR). The CTAR is composed of 6 rubric categories: interpretation, analysis, evaluation, inference, explanation, and disposition. To investigate inter-rater reliability, two trained raters scored four sets of…
Descriptors: Scoring Rubrics, Critical Thinking, Interrater Reliability, Performance Based Assessment
Daigneault, Pierre-Marc; Jacob, Steve; Tremblay, Joel – Evaluation Review, 2012
Background: Stakeholder participation is an important trend in the field of program evaluation. Although a few measurement instruments have been proposed, they either have not been empirically validated or do not cover the full content of the concept. Objectives: This study consists of a first empirical validation of a measurement instrument that…
Descriptors: Stakeholders, Participation, Program Evaluation, Measurement Techniques
Larsen, Linda; Kohnen, Saskia; Nickels, Lyndsey; McArthur, Genevieve – Australian Journal of Learning Difficulties, 2015
Children who have difficulty learning to read are at increased risk for academic failure, poor self-esteem, anxiety and depression, and unemployment. To help reduce these risks, it is important to identify and treat weaknesses in a child's reading as early as possible. The aim of this study was to develop a valid and reliable comprehensive…
Descriptors: Phoneme Grapheme Correspondence, Reading Tests, Standardized Tests, Test Reliability
Ohtake, Yoshihisa – International Journal of Disability, Development and Education, 2015
The present pilot study investigated the impact of video hero modelling (VHM) on the daily living skills of an elementary-aged student with autism spectrum disorder. The VHM, in which a character much admired by the student exhibited a correct response, was shown to the participant immediately before the situation where he needed to exhibit the…
Descriptors: Foreign Countries, Daily Living Skills, Video Technology, Elementary School Students
Hayes, Suzanne; Smith, Sedef Uzuner; Shea, Peter – Online Learning, 2015
As the pivotal role of self-regulation has been widely accepted in online learning literature, much interest is focused on identifying pedagogical strategies to help foster regulatory behaviors in online learners. The authors of this article argue that the learning presence (LP) construct, a recently proposed addition to the Community of Inquiry…
Descriptors: Online Courses, Metacognition, Communities of Practice, Role
Hale, Chris C. – Language Testing in Asia, 2015
Student self-assessment has been heralded as a way of increasing student ownership of the learning process, enhancing metacognative awareness of their learning progress as well as promoting learner autonomy. In a university setting, where a major aim is to promote critical thinking and attentiveness to one's responsibility in an academic…
Descriptors: Self Evaluation (Individuals), Learning Processes, Metacognition, Personal Autonomy
Camilleri, Bernard; Botting, Nicola – International Journal of Language & Communication Disorders, 2013
Background: Children's low scores on vocabulary tests are often erroneously interpreted as reflecting poor cognitive and/or language skills. It may be necessary to incorporate the measurement of word-learning ability in estimating children's lexical abilities. Aims: To explore the reliability and validity of the Dynamic Assessment of…
Descriptors: Receptive Language, Vocabulary, Language Tests, Test Reliability
Bernfeld, L. Elizabeth Shirley; Morrison, Timothy G.; Sudweeks, Richard R.; Wilcox, Brad – Literacy Research and Instruction, 2013
The purpose of this study was to rate oral retellings of fifth graders to determine how passages, raters, and rating occasions affect those ratings, and to identify what combination of those elements produce reliable retelling ratings. A group of 36 fifth grade students read and orally retold three contemporary realistic fiction passages. Two…
Descriptors: Elementary School Students, Grade 5, Story Telling, Reading Comprehension
Johnson, Sandra – Research Papers in Education, 2013
For a number of reasons, increasing reliance is being placed on teacher assessment in high-stakes contexts in many countries around the world. Simultaneously, countries that have for some time relied to greater or lesser degrees on teacher assessment for high-stakes purposes are in the process of questioning the validity of that reliance. In…
Descriptors: Reliability, Student Evaluation, High Stakes Tests, Evidence
Meacham, Paul Douglas, Jr. – ProQuest LLC, 2013
The purpose of this study was to explore the effect of instrument-specific rater training on interrater reliability (IRR) and counseling skills performance differentiation. Strong IRR is of primary concern to effective program evaluation (McCullough, Kuhn, Andrews, Valen, Hatch, & Osimo, 2003; Schanche, Nielsen, McCullough, Valen, &…
Descriptors: Counselor Training, Interrater Reliability, Measures (Individuals), Counseling Techniques

Peer reviewed
Direct link
