Publication Date
| In 2026 | 0 |
| Since 2025 | 58 |
| Since 2022 (last 5 years) | 284 |
| Since 2017 (last 10 years) | 780 |
| Since 2007 (last 20 years) | 2042 |
Descriptor
| Interrater Reliability | 3124 |
| Foreign Countries | 655 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 25 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Jones, Ian; Alcock, Lara – Studies in Higher Education, 2014
Peer assessment typically requires students to judge peers' work against assessment criteria. We tested an alternative approach in which students judged pairs of scripts against one another in the absence of assessment criteria. First year mathematics undergraduates (N?=?194) sat a written test on conceptual understanding of multivariable…
Descriptors: Peer Evaluation, Evaluation Criteria, Alternative Assessment, Undergraduate Students
Riley-Ayers, Shannon; Jung, Kwanghee; Quinn, Jorie – National Institute for Early Education Research, 2014
The Kindergarten Early Learning Scale (KELS) was developed as a concise observational assessment for young children. It examines three domains including (1) Math/Science, (2) Social Emotional/Social Studies, and (3) Language and Literacy, with a total of 10 items across the domains. Scores reported for each of the 10 items are based upon…
Descriptors: Kindergarten, Early Childhood Education, Rating Scales, Student Evaluation
Hallett, Victoria; Ronald, Angelica; Colvert, Emma; Ames, Catherine; Woodhouse, Emma; Lietz, Stephanie; Garnett, Tracy; Gillan, Nicola; Rijsdijk, Fruhling; Scahill, Lawrence; Bolton, Patrick; Happé, Francesca – Journal of Child Psychology and Psychiatry, 2013
Background: Although many children with autism spectrum disorders (ASDs) experience difficulties with anxiety, the manifestation of these difficulties remains unresolved. The current study assessed anxiety in a large population-based twin sample, aged 10-15 years. Phenotypic analyses were used to explore anxiety symptoms in children with ASDs,…
Descriptors: Anxiety, Symptoms (Individual Disorders), Twins, Pervasive Developmental Disorders
Bures, Eva Mary; Barclay, Alexandra; Abrami, Philip C.; Meyer, Elizabeth J. – Canadian Journal of Learning and Technology, 2013
This study explores electronic portfolios and their potential to assess student literacy and selfregulated learning in elementary-aged children. Assessment tools were developed and include a holistic rubric that assigns a mark from 1 to 5 to self-regulated learning (SRL) and a mark to literacy, and an analytical rubric measuring multiple…
Descriptors: Portfolio Assessment, Electronic Publishing, Elementary School Students, Literacy
Isaacs, Talia; Thomson, Ron I. – Language Assessment Quarterly, 2013
This mixed-methods study examines the effects of rating scale length and rater experience on listeners' judgments of second-language (L2) speech. Twenty experienced and 20 novice raters, who were randomly assigned to 5-point or 9-point rating scale conditions, judged speech samples of 38 newcomers to Canada on numerical rating scales for…
Descriptors: Foreign Countries, Adults, Second Language Learning, English (Second Language)
Strickland, Tricia K.; Maccini, Paula – Learning Disabilities: A Multidisciplinary Journal, 2013
The current study focuses on the effects of incorporating multiple visual representations on students' conceptual understanding of quadratic expressions embedded within area word problems and students' procedural fluency of transforming quadratic expressions in standard form to factored-form and vice versa. The intervention included the…
Descriptors: Mathematics Instruction, Algebra, Secondary School Mathematics, Learning Disabilities
Kelcey, Ben; Carlisle, Joanne F. – Reading Research Quarterly, 2013
The purpose of this study is to contribute to efforts to improve methods for gathering and analyzing data from classroom observations in early literacy. The methodological approach addresses current problems of reliability and validity of classroom observations by taking into account differences in teachers' uses of instructional actions (e.g.,…
Descriptors: Data Collection, Data Analysis, Emergent Literacy, Reliability
Scharf, Davida – ProQuest LLC, 2013
Purpose: The goal of the study was to test an intervention using a brief essay as an instrument for evaluating higher-order information literacy skills in college students, while accounting for prior conditions such as socioeconomic status and prior academic achievement, and identify other predictors of information literacy through an evaluation…
Descriptors: Information Literacy, Intervention, Student Evaluation, College Students
Matsugu, Sawako – ProQuest LLC, 2013
Understanding the sources of variance in speaking assessment is important in Japan where society's high demand for English speaking skills is growing. Three challenges threaten fair assessment of speaking. First, in Japanese university speaking courses, teachers are typically the only raters, but teachers' knowledge of their students may unfairly…
Descriptors: Foreign Countries, Oral Language, English (Second Language), Second Language Learning
Sheehan, Dwayne P.; Lafave, Mark R.; Katz, Larry – Measurement in Physical Education and Exercise Science, 2011
This study was designed to test the intra- and inter-rater reliability of the University of North Carolina's Balance Error Scoring System in 9- and 10-year-old children. Additionally, a modified version of the Balance Error Scoring System was tested to determine if it was more sensitive in this population ("raw scores"). Forty-six…
Descriptors: Elementary School Students, Interrater Reliability, Scoring, Raw Scores
Hudson, Shawna S.; Lewis, Tim; Stichter, Janine P.; Johnson, Nanci W. – Journal of Emotional and Behavioral Disorders, 2011
Corresponding with the 30th year following the passage of P.L. 94-142, a task force of the Council for Exceptional Children Division of Research published a set of articles outlining quality indicators (QIs) for research and evidence-based practice in special education. This article details the operationalization and application of QIs for…
Descriptors: Behavior Disorders, Special Education, Teaching Methods, Emotional Disturbances
Marrus, Natasha; Faughn, Carley; Shuman, Jeremy; Petersen, Steve E.; Constantino, John N.; Povinelli, Daniel J.; Pruett, John R., Jr. – Journal of the American Academy of Child & Adolescent Psychiatry, 2011
Objective: Comparative studies of social responsiveness, an ability that is impaired in autism spectrum disorders, can inform our understanding of both autism and the cognitive architecture of social behavior. Because there is no existing quantitative measure of social responsiveness in chimpanzees, we generated a quantitative, cross-species…
Descriptors: Animals, Social Behavior, Interrater Reliability, Measures (Individuals)
Gresham, Frank M. – School Psychology Review, 2011
The author was favorably impressed with the breadth, scope, and quality of the articles in this issue that dealt with the various aspects and correlates of social behavioral functioning as well as assessment and intervention considerations. Each of these articles dealt with a unique aspect of social behavioral functioning in children and youth and…
Descriptors: Intervention, School Psychologists, Social Behavior, Evaluation Methods
Drost, Ellen A. – Education Research and Perspectives, 2011
In this paper, the author aims to provide novice researchers with an understanding of the general problem of validity in social science research and to acquaint them with approaches to developing strong support for the validity of their research. She provides insight into these two important concepts, namely (1) validity; and (2) reliability, and…
Descriptors: Social Science Research, Validity, Reliability, Measurement Techniques
van Oorsouw, Wietske M. W. J.; Embregts, Petri J. C. M.; Sohier, Jody – Research in Developmental Disabilities: A Multidisciplinary Journal, 2011
It is common to use questionnaires and interviews to assess the emotions of staff who serve clients with intellectual disabilities. Remarkably, observations of actual staff behaviour and assessments of nonverbal expressions are usually not involved. In the present study, we have made a first start in the development of an observation instrument…
Descriptors: Expertise, Nonverbal Communication, Observation, Mental Retardation

Peer reviewed
Direct link
