Publication Date
| In 2026 | 0 |
| Since 2025 | 56 |
| Since 2022 (last 5 years) | 282 |
| Since 2017 (last 10 years) | 778 |
| Since 2007 (last 20 years) | 2040 |
Descriptor
| Interrater Reliability | 3122 |
| Foreign Countries | 654 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 24 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Ludtke, Oliver; Trautwein, Ulrich; Kunter, Mareike; Baumert, Jurgen – Learning Environments Research, 2006
In educational research, characteristics of the learning environment are generally assessed by asking students to evaluate features of their lessons. The student ratings produced by this simple and efficient research strategy can be analysed from two different perspectives. At the "individual level", they represent the individual student's…
Descriptors: Teacher Effectiveness, Educational Research, Student Evaluation of Teacher Performance, Classroom Environment
Cohen, Patricia; Kasen, Stephanie; Bifulco, Antonia; Andrews, Howard; Gordon, Kathy – International Journal of Behavioral Development, 2005
This methodological investigation examines the accuracy of narrative-based scaled ratings covering several post high school years. Guided narratives by young adults described developmentally relevant behaviour and context for each month between ages 17 and the mid-20s. "Prospective" narratives covered shorter time periods in three interviews…
Descriptors: Young Adults, Developmentally Appropriate Practices, Developmental Tasks, Developmental Psychology
England, Margaret; Tripp-Reimer, Toni – International Journal of Aging and Human Development, 2003
The purpose of this descriptive study was to generate information about imminent concerns of adult children that could serve as initial context for development of a meaningful framework for coping with an ongoing parent care situation. Ninety-two adult children pre-selected for self-reports of crisis were interviewed about their concerns and goals…
Descriptors: Caregivers, Coping, Interrater Reliability, Parent Child Relationship
Karnell, Michael P.; Rogus, Nicole M. – Journal of Speech, Language, and Hearing Research, 2005
Practicing clinicians frequently offer judgments about aspects of swallowing physiology rather than performing actual measurements. Little is known about the accuracy of those judgments. The purpose of this preliminary study was to explore agreement of clinicians' judgments of pharyngeal swallow response time (PSRT) with temporal measurements of…
Descriptors: Reaction Time, Physiology, Comparative Analysis, Equipment
Voaklander, Donald C.; Thommasen, Harvey V.; Michalos, Alex C. – Social Indicators Research, 2006
The objective of this study was to understand the relationship between health survey and medical chart based information. The study population consisted of adult patients (17 years of age and older) attending the Bella Coola Medical Clinic who also completed a detailed Health and Quality of Life Survey. A total of 674 adults completed the Health…
Descriptors: Rural Population, Adults, Health, Measurement Techniques
Van Moere, Alistair – Language Testing, 2006
This article investigates a group oral test as administered at a university in Japan to find if it is appropriate to use scores for higher stakes decision making. It is one component of an in-house English proficiency test used for placing students, evaluating their progress, and making informed decisions for the development of the English…
Descriptors: Foreign Countries, Generalizability Theory, Achievement Tests, English (Second Language)
Chang, Lei – 1996
It was hypothesized that, when compared to the Angoff method (W. H. Angoff, 1971), the Nedelsky method (L. Nedelsky, 1954) for standard setting had lower intrajudge inconsistency, lower cutscores, and lower cutscores especially for items presenting challenges to the judges. These hypotheses were tested and supported in a sample of 22 graduate…
Descriptors: Comparative Analysis, Cutting Scores, Difficulty Level, Distractors (Tests)
Koretz, Daniel; And Others – 1992
Since 1988, Vermont has been developing an innovative performance assessment program that relies substantially on portfolios of student work. This interim report presents findings about the reliability of scores from the first statewide implementation of the portfolio program in the 1991-92 school year. The focus is not on program impact as an…
Descriptors: Educational Assessment, Elementary Secondary Education, Evaluation Methods, Evaluation Utilization
Nasser, Ramzi; Carifio, James – 1993
The validation of key contextual features of algebra word problems was studied in two phases. In the first phase, five experts were asked to assess the appropriateness of the concepts in the problems and the adequacy of the assignment of the contextual features to the problems. In the second phase, construct validity was established by having 6…
Descriptors: Algebra, Analysis of Variance, Construct Validity, Context Effect
Klein, Gerald A. – 1991
In an effort to improve the management of the Georgia Department of Education innovation program, a concept paper process for selecting developmental projects was implemented in January of 1989. The first effort, in January of 1989, yielded 48 papers; the second effort, in November of 1990, yielded 56 papers. For the Novemeber effort, funding was…
Descriptors: Concept Formation, Developmental Programs, Educational Improvement, Educational Innovation
Griffin, Patrick – 1990
Results of the International English Language Testing System (IELTS) battery trials in Australia are reported. The IELTS tests of productive language skills use direct assessment strategies and subjective scoring according to detailed guidelines. The receptive skills tests use indirect assessment strategies and clerical scoring procedures.…
Descriptors: English (Second Language), Foreign Countries, Grammar, Interrater Reliability
Breland, Hunter M. – 1983
Direct assessment of writing skill, usually considered to be synonymous with assessment by means of writing samples, is reviewed in terms of its history and with respect to evidence of its reliability and validity. Reliability is examined as it is influenced by reader inconsistency, domain sampling, and other sources of error. Validity evidence is…
Descriptors: Essay Tests, Evaluation Needs, Higher Education, Interrater Reliability
Stolworthy, Reed L. – 1990
The degrees of variance among three groups of evaluators relative to their assessments of the teaching competencies of preservice teacher education students were studied. Subjects included groups of 23 and 32 undergraduates who were certified to teach by the teacher preparation program at Washburn University in Topeka (Kansas) in 1987 and in 1988,…
Descriptors: Cooperating Teachers, Elementary Secondary Education, Higher Education, Interrater Reliability
Hawk, Anne W.; Cross, James Logan – 1987
This study involved the selection and adaptation of a writing assessment procedure for teachers and researchers in the Duval County Public Schools (Florida) to use in assessing changes in writing ability among elementary grade students. Through a review of the literature, four writing assessment procedures (analytic, holistic, focused holistic,…
Descriptors: Elementary Education, Elementary School Teachers, Evaluators, Holistic Evaluation
Jansen, Hans P. – 1985
The Air Force Occupational Measurement Center conducts task-based occupational surveys of Air Force specialties that include supervisor ratings on recommended training emphasis for entry-level airmen. Priorities are input to the Instructional System Development training model, which guides the development and revision of specialty training…
Descriptors: Cluster Analysis, Computer Software, Evaluation Methods, Factor Analysis

Peer reviewed
Direct link
