Publication Date
| In 2026 | 0 |
| Since 2025 | 58 |
| Since 2022 (last 5 years) | 284 |
| Since 2017 (last 10 years) | 780 |
| Since 2007 (last 20 years) | 2042 |
Descriptor
| Interrater Reliability | 3124 |
| Foreign Countries | 655 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 25 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Spooren, Pieter; Mortelmans, Dimitri – Educational Studies, 2006
The use of student evaluations of teaching performance has been an important but controversial tool in the improvement of teaching quality during the past few decades. Although student evaluations of teaching are implemented in many faculties, not everyone is convinced of the desirability and utility of these ratings. In this paper, we present the…
Descriptors: Student Attitudes, Teacher Effectiveness, Rating Scales, Student Evaluation of Teacher Performance
Apache, R. R. – Physical Educator, 2006
A behavioral assessment system for scoring the behaviors of parents and coaches at youth sports games is described within this paper. The Youth Sports Behavior Assessment System (YSBAS) contains nine behavioral categories describing behaviors commonly seen during youth sports. The developmental process of YSBAS and the observer-training program…
Descriptors: Evaluators, Training, Scoring, Parent Education
Ludtke, Oliver; Trautwein, Ulrich; Kunter, Mareike; Baumert, Jurgen – Learning Environments Research, 2006
In educational research, characteristics of the learning environment are generally assessed by asking students to evaluate features of their lessons. The student ratings produced by this simple and efficient research strategy can be analysed from two different perspectives. At the "individual level", they represent the individual student's…
Descriptors: Teacher Effectiveness, Educational Research, Student Evaluation of Teacher Performance, Classroom Environment
Cohen, Patricia; Kasen, Stephanie; Bifulco, Antonia; Andrews, Howard; Gordon, Kathy – International Journal of Behavioral Development, 2005
This methodological investigation examines the accuracy of narrative-based scaled ratings covering several post high school years. Guided narratives by young adults described developmentally relevant behaviour and context for each month between ages 17 and the mid-20s. "Prospective" narratives covered shorter time periods in three interviews…
Descriptors: Young Adults, Developmentally Appropriate Practices, Developmental Tasks, Developmental Psychology
England, Margaret; Tripp-Reimer, Toni – International Journal of Aging and Human Development, 2003
The purpose of this descriptive study was to generate information about imminent concerns of adult children that could serve as initial context for development of a meaningful framework for coping with an ongoing parent care situation. Ninety-two adult children pre-selected for self-reports of crisis were interviewed about their concerns and goals…
Descriptors: Caregivers, Coping, Interrater Reliability, Parent Child Relationship
Karnell, Michael P.; Rogus, Nicole M. – Journal of Speech, Language, and Hearing Research, 2005
Practicing clinicians frequently offer judgments about aspects of swallowing physiology rather than performing actual measurements. Little is known about the accuracy of those judgments. The purpose of this preliminary study was to explore agreement of clinicians' judgments of pharyngeal swallow response time (PSRT) with temporal measurements of…
Descriptors: Reaction Time, Physiology, Comparative Analysis, Equipment
Voaklander, Donald C.; Thommasen, Harvey V.; Michalos, Alex C. – Social Indicators Research, 2006
The objective of this study was to understand the relationship between health survey and medical chart based information. The study population consisted of adult patients (17 years of age and older) attending the Bella Coola Medical Clinic who also completed a detailed Health and Quality of Life Survey. A total of 674 adults completed the Health…
Descriptors: Rural Population, Adults, Health, Measurement Techniques
Van Moere, Alistair – Language Testing, 2006
This article investigates a group oral test as administered at a university in Japan to find if it is appropriate to use scores for higher stakes decision making. It is one component of an in-house English proficiency test used for placing students, evaluating their progress, and making informed decisions for the development of the English…
Descriptors: Foreign Countries, Generalizability Theory, Achievement Tests, English (Second Language)
Chang, Lei – 1996
It was hypothesized that, when compared to the Angoff method (W. H. Angoff, 1971), the Nedelsky method (L. Nedelsky, 1954) for standard setting had lower intrajudge inconsistency, lower cutscores, and lower cutscores especially for items presenting challenges to the judges. These hypotheses were tested and supported in a sample of 22 graduate…
Descriptors: Comparative Analysis, Cutting Scores, Difficulty Level, Distractors (Tests)
Koretz, Daniel; And Others – 1992
Since 1988, Vermont has been developing an innovative performance assessment program that relies substantially on portfolios of student work. This interim report presents findings about the reliability of scores from the first statewide implementation of the portfolio program in the 1991-92 school year. The focus is not on program impact as an…
Descriptors: Educational Assessment, Elementary Secondary Education, Evaluation Methods, Evaluation Utilization
Nasser, Ramzi; Carifio, James – 1993
The validation of key contextual features of algebra word problems was studied in two phases. In the first phase, five experts were asked to assess the appropriateness of the concepts in the problems and the adequacy of the assignment of the contextual features to the problems. In the second phase, construct validity was established by having 6…
Descriptors: Algebra, Analysis of Variance, Construct Validity, Context Effect
Klein, Gerald A. – 1991
In an effort to improve the management of the Georgia Department of Education innovation program, a concept paper process for selecting developmental projects was implemented in January of 1989. The first effort, in January of 1989, yielded 48 papers; the second effort, in November of 1990, yielded 56 papers. For the Novemeber effort, funding was…
Descriptors: Concept Formation, Developmental Programs, Educational Improvement, Educational Innovation
Griffin, Patrick – 1990
Results of the International English Language Testing System (IELTS) battery trials in Australia are reported. The IELTS tests of productive language skills use direct assessment strategies and subjective scoring according to detailed guidelines. The receptive skills tests use indirect assessment strategies and clerical scoring procedures.…
Descriptors: English (Second Language), Foreign Countries, Grammar, Interrater Reliability
Breland, Hunter M. – 1983
Direct assessment of writing skill, usually considered to be synonymous with assessment by means of writing samples, is reviewed in terms of its history and with respect to evidence of its reliability and validity. Reliability is examined as it is influenced by reader inconsistency, domain sampling, and other sources of error. Validity evidence is…
Descriptors: Essay Tests, Evaluation Needs, Higher Education, Interrater Reliability
Stolworthy, Reed L. – 1990
The degrees of variance among three groups of evaluators relative to their assessments of the teaching competencies of preservice teacher education students were studied. Subjects included groups of 23 and 32 undergraduates who were certified to teach by the teacher preparation program at Washburn University in Topeka (Kansas) in 1987 and in 1988,…
Descriptors: Cooperating Teachers, Elementary Secondary Education, Higher Education, Interrater Reliability

Peer reviewed
Direct link
