NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 2,656 to 2,670 of 3,124 results Save | Export
Peer reviewed Peer reviewed
Yarbrough, Cornelia; And Others – Bulletin of the Council for Research in Music Education, 1994
Reports on a study of 614 experienced music teachers, non-music teachers, college-level music students, and non-music students on the effect of sequential patterns and different modes of presentation of music teaching. Finds that experienced teachers' evaluations were significantly higher than those of university students. (CFR)
Descriptors: Educational Strategies, Evaluative Thinking, Evaluators, Higher Education
Peer reviewed Peer reviewed
Shaw, Darlene L.; And Others – Academic Medicine, 1995
A study found that interviewers of medical school applicants (n=471) were influenced in their ratings of applicants' noncognitive attributes by grade point average and Medical College Admission Test scores, when available, and by gender and race in accordance with affirmative action goals. Only moderate reliability across interviewers was found.…
Descriptors: Affirmative Action, College Admission, College Applicants, Higher Education
Peer reviewed Peer reviewed
Albanese, Mark A. – Academic Medicine, 1991
A study compared student and trained observer ratings of 15 high-rated and 15 low-rated lecturers in a multi-instructor medical course to identify distinguishing delivery characteristics. Student ratings were stable over three years; trained observers discriminated between students' highest- and lowest-rated lecturers. Voice presentation was the…
Descriptors: Faculty Evaluation, Higher Education, Interrater Reliability, Medical Education
Peer reviewed Peer reviewed
Ball, Martin J.; And Others – Journal of Communication Disorders, 1991
This study investigated two pragmatic profiles (the Pragmatic Profile and the Profile of Communicative Appropriateness) used to assess the language of two aphasic patients. The study examined interscorer reliability, scoring sensitivity, and diagnostic accuracy. Findings indicate that training in scoring these profiles must be uniform, and greater…
Descriptors: Adults, Aphasia, Behavior Rating Scales, Communication Disorders
Peer reviewed Peer reviewed
Pippin, David J.; Feil, Philip – Journal of Dental Education, 1992
Two studies investigated interrater agreement among 10 clinical dental examiners who scored residual subgingival calculus after student scaling on 4,160 real and 92 manikin tooth surfaces. Interrater reliability was low. Results suggest a need in periodontics for effective examiner calibration methods and objective subgingival calculus detection…
Descriptors: Dental Evaluation, Dental Schools, Dentistry, Evaluation Methods
Peer reviewed Peer reviewed
Plake, Barbara S.; And Others – Educational Measurement: Issues and Practice, 1991
Possible sources of intrajudge inconsistency in standard setting are reviewed, and approaches are presented to improve the accuracy of rating. Procedures for providing judges with feedback through discussion or computerized communication are discussed. Monitoring and maintaining judges' consistency throughout the rating process are essential. (SLD)
Descriptors: Computer Assisted Instruction, Evaluators, Examiners, Feedback
Peer reviewed Peer reviewed
Fehrmann, Melinda L.; And Others – Educational and Psychological Measurement, 1991
Two frame-of-reference rater training approaches were compared for effects on reliability and accuracy of cutoff scores generated by 21 raters using Angoff methods on tests taken by 155 undergraduates. Both approaches result in higher interrater reliability and more accuracy than does a non-frame-of-reference method. (SLD)
Descriptors: Cutting Scores, Evaluators, Generalizability Theory, Higher Education
Peer reviewed Peer reviewed
McCrae, Robert R. – Multivariate Behavioral Research, 1993
To assess cross-observer agreement on personality profiles, an Index of Profile Agreement and an associated coefficient are proposed that take into account both the difference between the ratings and the extremes of their mean. Data from the Revised NEO Personality Inventory for 250 peer ratings/self-reports and 68 spouse ratings/self-reports…
Descriptors: Adults, Comparative Analysis, Equations (Mathematics), Evaluation Methods
Peer reviewed Peer reviewed
Goldman, Ronald L. – Evaluation and the Health Professions, 1994
A meta-analysis of studies examining the interrater reliability of the standard practice of peer assessments of quality of care was conducted through the use of several databases. The mean weighted kappa of 21 findings from 13 studies was 0.31, which suggests that the interrater reliability of peer assessment is limited. (SLD)
Descriptors: Databases, Evaluation Methods, Health Services, Interrater Reliability
Peer reviewed Peer reviewed
Otto, Istvan – TESOL Quarterly, 1998
Discusses the rationale for, and end results of, a small-scale study of the effects of learners' creativity on language learning. The study was part of an ongoing, larger-scale project on individual differences at Eotvos University in Budapest, Hungary. (Author/VWL)
Descriptors: Academic Achievement, Cognitive Ability, Creativity, English (Second Language)
Peer reviewed Peer reviewed
Taylor, Roy E.; Davidson, Fred – World Englishes, 1996
This article cautions against complacency in "subjective" assessment, arguing that even tests designed to reflect the development of learner-centered, interactive and communicative approaches to teaching English may have cultural bias built into their assessment criteria. The reply article singles out as an unresolved issue whether or…
Descriptors: Cultural Context, English, Error of Measurement, Ethnic Groups
Peer reviewed Peer reviewed
Freeman, Mark – Assessment & Evaluation in Higher Education, 1995
A study compared evaluations of undergraduate students' oral presentations by groups of peers and faculty teams. Students (n=210), in assessment teams, rated quality of content and presentation using a 22-point guide. Comparison of peer and faculty scores showed no significant difference in averages but did show significant differences in standard…
Descriptors: College Students, Comparative Analysis, Evaluation Criteria, Evaluation Methods
Peer reviewed Peer reviewed
Oren, Thomas A.; Ruhl, Kathy L. – Early Childhood Education Journal, 2000
Investigated the reliability and item appropriateness, as discerned by adults affiliated with an infant center, of the Caregiver-Environment Scale (CES). Found the CES to be an easy to use, reliable instrument for evaluation. (Author/SD)
Descriptors: Caregiver Child Relationship, Child Caregivers, Child Development, Day Care
Peer reviewed Peer reviewed
Erdosy, M. Usman – International Journal of English Studies, 2001
Describes how some background factors influenced the way in which one experienced rater dealt with a number of operations involved in setting up and applying scoring criteria in the assessment of 60 Test of English as a Foreign Language essays. Implications are drawn for both future research into interrater variability and for rater training.…
Descriptors: English (Second Language), Evaluation Criteria, Interrater Reliability, Language Research
Manzo, Kathleen Kennedy – Education Week, 2005
This paper reports the results of a special urban study of the 2005 National Assessment of Education Progress which indicates that city school districts may be seeing some payoff from years of work to improve mathematics instruction. However, similar initiatives to raise reading achievement have not led to significant gains. While, most of the 11…
Descriptors: Urban Schools, School Districts, National Competency Tests, Mathematics Achievement
Pages: 1  |  ...  |  174  |  175  |  176  |  177  |  178  |  179  |  180  |  181  |  182  |  ...  |  209