ERIC - Search Results

Publication Date

In 2026	0
Since 2025	58
Since 2022 (last 5 years)	284
Since 2017 (last 10 years)	780
Since 2007 (last 20 years)	2042

Descriptor

Interrater Reliability	3124
Foreign Countries	655
Test Reliability	503
Evaluation Methods	502
Test Validity	410
Correlation	401
Scoring	347
Comparative Analysis	327
Scores	324
Validity	310
Student Evaluation	308
Measures (Individuals)	298
Evaluators	295
Rating Scales	282
Statistical Analysis	268
Higher Education	264
Psychometrics	241
Reliability	231
Observation	229
Scoring Rubrics	216
Test Construction	212
English (Second Language)	211
Teaching Methods	208
Writing Evaluation	206
Intervention	200
More ▼

Education Level

Higher Education	574
Postsecondary Education	420
Elementary Education	282
Secondary Education	180
Early Childhood Education	145
Elementary Secondary Education	120
Middle Schools	109
High Schools	86
Preschool Education	72
Junior High Schools	65
Adult Education	59
Primary Education	57
Kindergarten	45
Grade 4	41
Grade 5	40
Intermediate Grades	40
Grade 1	37
Grade 6	35
Grade 8	32
Grade 3	31
Grade 2	27
Grade 7	27
Grade 10	13
Grade 9	11
Two Year Colleges	8
More ▼

Audience

Researchers	130
Practitioners	42
Teachers	22
Administrators	11
Counselors	3
Policymakers	2

Location

Australia	56
Turkey	53
United Kingdom	46
Canada	45
Netherlands	40
China	38
California	37
United States	30
United Kingdom (England)	25
Taiwan	23
Germany	22
Japan	22
Pennsylvania	22
Florida	21
Sweden	21
Iran	19
North Carolina	19
Hong Kong	17
South Korea	17
Texas	17
Georgia	16
Israel	15
New Zealand	14
South Africa	14
Washington	14
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	13
Individuals with Disabilities…	7
Elementary and Secondary…	3
Race to the Top	3
Elementary and Secondary…	2
American Recovery and…	1
Americans with Disabilities…	1
Education Consolidation…	1
Education for All Handicapped…	1
Every Student Succeeds Act…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
Pell Grant Program	1
Rehabilitation Act 1973…	1
Stewart B McKinney Homeless…	1
Temporary Assistance for…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	3
Meets WWC Standards with or without Reservations	3
Does not meet standards	3

Showing 2,806 to 2,820 of 3,124 results Save | Export

Rater Effects in Clinical Performance Ratings of Surgery Residents

Download full text

Iramaneerat, Cherdsak; Myford, Carol M. – Online Submission, 2006

A multi-faceted Rasch measurement (MFRM) approach was used to analyze clinical performance ratings of 24 first-year residents in one surgery residency program in Thailand to investigate three types of rater effects: leniency, rater inconsistency, and restriction of range. Faculty from 14 surgical services rated the clinical performance of…

Descriptors: Foreign Countries, Measures (Individuals), Job Performance, Interrater Reliability

How Good Are Our Raters? Rater Errors in Clinical Skills Assessment

Download full text

Iramaneerat, Cherdsak; Yudkowsky, Rachel – Online Submission, 2006

A multi-faceted Rasch measurement (MFRM) model was used to analyze a clinical skills assessment of 173 fourth-year medical students in a Midwestern medical school to investigate four types of rater errors: leniency, inconsistency, halo, and restriction of range. Each student performed six clinical tasks with six standardized patients (SPs), who…

Descriptors: Patients, Physical Examinations, Medical Students, Clinical Experience

Minimum Competency Standards Set by Three Divergent Groups of Raters Using Three Judgmental Procedures: Implications for Validity.

Peer reviewed

Halpin, Gerald; And Others – Educational and Psychological Measurement, 1983

Although arbitrary, whenever multiple judgmental standard-setting procedures are utilized by different groups concurrently, stability across raters can be achieved and decisions can be made in a relatively judicious manner. Greater stability across methods (Ebel, Nedelsky, Angoff) may be effected by slightly modifying the Ebel approach. (Author/PN)

Descriptors: Admission Criteria, College Entrance Examinations, Cutting Scores, Higher Education

Interdependence and Interpersonal Attraction Among Heterogeneous and Homogeneous Individuals: A Theoretical Formulation and a Meta-analysis of the Research.

Peer reviewed

Johnson, David W.; And Others – Review of Educational Research, 1983

A theoretical model is presented with a review of supportive literature to establish the conditions under which desegregation and mainstreaming will result in constructive or destructive outcomes. Meta-analysis procedures examine all the available research relevant to the model, and point toward practical intergroup procedures based on the…

Descriptors: Desegregation Effects, Disabilities, Elementary Secondary Education, Ethnic Relations

A Study in Self-Assessment: Tutor and Students' Perceptions of Performance Criteria.

Peer reviewed

Orsmond, Paul; Merry, Stephen; Reiling, Kevin – Assessment & Evaluation in Higher Education, 1997

Reports on a study of a student self-assessment method in college biology, comparing students' self-evaluation, students' peer evaluation, and the teacher's evaluation criteria. Results illustrate potential problems in making assumptions about student ability to self-evaluate but also support previous findings about the instructional usefulness of…

Descriptors: Biology, College Faculty, College Instruction, College Students

The Importance of Marking Criteria in the Use of Peer Assessment.

Peer reviewed

Orsmond, Paul; And Others – Assessment & Evaluation in Higher Education, 1996

A study comparing peer and teacher evaluations of British university biology students' (n=39) performance found such comparison misleading as a guide to the validity of peer assessment. When individual criteria were analyzed, agreement of peers and teacher ranged from 31-62%, with specific areas of the criteria prone to over- and undervaluation.…

Descriptors: Bias, Biology, College Students, Comparative Analysis

Testing the Language Proficiency of Bilingual Teachers: Arizona's Spanish Proficiency Test.

Peer reviewed

Grant, Leslie – Language Testing, 1997

Describes current procedures used for testing bilingual teachers in the United States and focuses on one means of assessment used in Arizona. Examinee questionnaire responses, teacher questionnaire responses and test section analysis all contributed evidence for validity. (33 references) (Author/CK)

Descriptors: Bilingualism, Criterion Referenced Tests, Interrater Reliability, Language Teachers

Relational Aggression, Gender, and Peer Acceptance: Invariance across Culture, Stability over Time, and Concordance among Informants.

Peer reviewed

Tomada, Giovanna; Schneider, Barry H. – Developmental Psychology, 1997

Replicated and extended American research on overt and relational aggression with Italian children. Found that peer and teacher nominations for aggression and prosocial behavior were highly stable, although with very poor concordance between them. Peer nominations for overt and relational aggression were linked to peer rejection. Boys' scores were…

Descriptors: Aggression, Bullying, Child Behavior, Children

Pupil Talk in Biology Practical Work--A Preliminary Study.

Peer reviewed

Pugh, Malcolm; Lock, Roger – Research in Science and Technological Education, 1989

The development of a framework for analyzing pupil talk is described and the reliability of scoring transcribed conversions using the framework discussed. Definitions and examples of the terms used in the framework are appended. (Author/YP)

Descriptors: Biology, Foreign Countries, Group Discussion, Interrater Reliability

Evaluating Student Field Education: An Empirical Study.

Peer reviewed

Reid, William J.; And Others – Journal of Social Work Education, 1996

In a study with 13 social work and counseling interns, field supervisors' ratings of students' field performance were compared to an independent judge's content analysis of performance. Results revealed significant correlations between the evaluations, providing evidence of validity of the supervisors' assessments. Validity may have been enhanced…

Descriptors: Evaluation Methods, Field Experience Programs, Higher Education, Interrater Reliability

Respondent Agreement in Follow-Up Studies of Graduates of Special and Regular Education Programs.

Peer reviewed

Levine, Phyllis; Edgar, Eugene – Exceptional Children, 1994

High school graduates in regular (n=280) and special education (n=223) and their parents were interviewed. Parent-student agreement percentages were high for the variables of attending postsecondary school, employment status, type of residence, marital status, and number of children. Low agreement rates were obtained for salary level, hours…

Descriptors: Disabilities, Employment, Followup Studies, Graduate Surveys

Parent and Teacher Assessment of Receptive and Expressive Language in Preschool Children with Developmental Disabilities.

Peer reviewed

Sigafoos, Jeff; Pennell, Donna – Education and Training in Mental Retardation and Developmental Disabilities, 1995

Comparison using paired t-tests of parent and teacher ratings for 16 preschool children on the Receptive-Expressive Emergent Language Scale found no significant differences between parent and teacher ratings of expressive language, but a significant difference on the receptive language subscale. However, interrater reliability was relatively low…

Descriptors: Developmental Disabilities, Expressive Language, Interrater Reliability, Language Skills

A Study of Interrater Reliability of the ACTFL Oral Proficiency Interview in Five European Languages: Data from ESL, French, German, Russian, and Spanish.

Peer reviewed

Thompson, Irene – Foreign Language Annals, 1995

Considers the interrater reliability of certified testers in five European languages, the relationship between interviewer-assigned ratings and second ratings based on audio replay, interrater reliability as a function of proficiency level, effect of different languages on interrater agreement, and interrater disagreements with regard to…

Descriptors: Audiotape Recordings, English (Second Language), Evaluators, French

How Many Fidgets in a Pretty Much: A Critique of Behavior Rating Scales for Identifying Students with ADHD.

Peer reviewed

Reid, Robert; Maag, John W. – Journal of School Psychology, 1994

Article describes behavior rating scales and the difficulties in the use of cutoff scores to identify students as Attention-Deficit Hyperactivity Disorder. Also described are how problems with interobserver agreement hamper the validity of rating scales and the subsequent conclusions that can be drawn about students' behavior. (RJM)

Descriptors: Attention Deficit Disorders, Attention Span, Behavior Rating Scales, Children

The Triple-Jump Examination as an Assessment Tool in the Problem-Based Medical Curriculum at the University of Hawaii.

Peer reviewed

Smith, Richard Merrill – Academic Medicine, 1993

A University of Hawaii study compared objective and subjective assessments of the three-step triple jump examination which tests medical students' clinical problem-solving processes. Subjects were 58 first-year students. Results found the subjective assessments were more consistent across problems of varying difficulty level than were objective…

Descriptors: Case Studies, Difficulty Level, Higher Education, Interrater Reliability

« Previous Page | Next Page »

Pages: 1 | ... | 184 | 185 | 186 | 187 | 188 | 189 | 190 | 191 | 192 | ... | 209

Lunz, Mary E.	10
Wind, Stefanie A.	10
Engelhard, George, Jr.	8
Epstein, Michael H.	8
Ingham, Roger J.	8
Johnson, Evelyn S.	8
Matson, Johnny L.	7
McLeod, Bryce D.	7
Moylan, Laura A.	7
Cason, Carolyn L.	6
Cordes, Anne K.	6
Jaeger, Richard M.	6
Johnson, Robert L.	6
Lecavalier, Luc	6
Plake, Barbara S.	6
Tasse, Marc J.	6
Wyse, Adam E.	6
Zheng, Yuzhu	6
Aman, Michael G.	5
Barton, Erin E.	5
Cason, Gerald J.	5
Coniam, David	5
Conroy, Maureen A.	5
Crawford, Angela R.	5
More ▼

Journal Articles	2555
Reports - Research	2243
Reports - Evaluative	515
Speeches/Meeting Papers	272
Reports - Descriptive	163
Tests/Questionnaires	162
Information Analyses	130
Dissertations/Theses -…	89
Opinion Papers	61
Numerical/Quantitative Data	31
Guides - Non-Classroom	11
Books	7
Collected Works - General	3
Guides - Classroom - Teacher	3
Non-Print Media	3
Book/Product Reviews	2
Collected Works - Serials	2
Dissertations/Theses	2
ERIC Digests in Full Text	2
ERIC Publications	2
Guides - General	2
Reports - General	2
Collected Works - Proceedings	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	30
Child Behavior Checklist	18
National Assessment of…	14
Vineland Adaptive Behavior…	14
Autism Diagnostic Observation…	13
Strengths and Difficulties…	11
Woodcock Johnson Tests of…	10
Peabody Picture Vocabulary…	9
SAT (College Admission Test)	9
Wechsler Intelligence Scale…	9
Behavior Assessment System…	8
Dynamic Indicators of Basic…	8
Early Childhood Environment…	8
Graduate Record Examinations	8
International English…	7
Teacher Performance…	6
ACT Assessment	5
Advanced Placement…	5
Behavioral and Emotional…	5
Childhood Autism Rating Scale	5
Classroom Assessment Scoring…	5
Conners Teacher Rating Scale	5
Draw a Person Test	5
Raven Progressive Matrices	5
ACTFL Oral Proficiency…	4
More ▼

ProQuest LLC	86
Journal of Speech, Language,…	62
Educational and Psychological…	61
Journal of Autism and…	56
Grantee Submission	40
Language Testing	39
Online Submission	35
International Journal of…	34
Assessment & Evaluation in…	33
Research in Developmental…	31
Applied Measurement in…	28
Advances in Health Sciences…	26
Assessment for Effective…	26
ETS Research Report Series	25
Journal of Educational…	25
Educational Measurement:…	23
Measurement in Physical…	20
Language Assessment Quarterly	19
Psychology in the Schools	19
Topics in Early Childhood…	19
Psychological Assessment	18
Educational Assessment	16
Autism: The International…	15
Journal of Consulting and…	15
Personnel Psychology	15
More ▼