ERIC - Search Results

Publication Date

In 2026	0
Since 2025	58
Since 2022 (last 5 years)	284
Since 2017 (last 10 years)	780
Since 2007 (last 20 years)	2042

Descriptor

Interrater Reliability	3124
Foreign Countries	655
Test Reliability	503
Evaluation Methods	502
Test Validity	410
Correlation	401
Scoring	347
Comparative Analysis	327
Scores	324
Validity	310
Student Evaluation	308
Measures (Individuals)	298
Evaluators	295
Rating Scales	282
Statistical Analysis	268
Higher Education	264
Psychometrics	241
Reliability	231
Observation	229
Scoring Rubrics	216
Test Construction	212
English (Second Language)	211
Teaching Methods	208
Writing Evaluation	206
Intervention	200
More ▼

Education Level

Higher Education	574
Postsecondary Education	420
Elementary Education	282
Secondary Education	180
Early Childhood Education	145
Elementary Secondary Education	120
Middle Schools	109
High Schools	86
Preschool Education	72
Junior High Schools	65
Adult Education	59
Primary Education	57
Kindergarten	45
Grade 4	41
Grade 5	40
Intermediate Grades	40
Grade 1	37
Grade 6	35
Grade 8	32
Grade 3	31
Grade 2	27
Grade 7	27
Grade 10	13
Grade 9	11
Two Year Colleges	8
More ▼

Audience

Researchers	130
Practitioners	42
Teachers	22
Administrators	11
Counselors	3
Policymakers	2

Location

Australia	56
Turkey	53
United Kingdom	46
Canada	45
Netherlands	40
China	38
California	37
United States	30
United Kingdom (England)	25
Taiwan	23
Germany	22
Japan	22
Pennsylvania	22
Florida	21
Sweden	21
Iran	19
North Carolina	19
Hong Kong	17
South Korea	17
Texas	17
Georgia	16
Israel	15
New Zealand	14
South Africa	14
Washington	14
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	13
Individuals with Disabilities…	7
Elementary and Secondary…	3
Race to the Top	3
Elementary and Secondary…	2
American Recovery and…	1
Americans with Disabilities…	1
Education Consolidation…	1
Education for All Handicapped…	1
Every Student Succeeds Act…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
Pell Grant Program	1
Rehabilitation Act 1973…	1
Stewart B McKinney Homeless…	1
Temporary Assistance for…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	3
Meets WWC Standards with or without Reservations	3
Does not meet standards	3

Showing 2,626 to 2,640 of 3,124 results Save | Export

Implementing a Portfolio Assessment System for Chapter 1 Program Improvement: A Case Study.

Download full text

Leitner, David; Trevisan, Mike – 1993

This paper presents findings of a case study that documented the implementation of a portfolio assessment system in response to mandated program improvement and assessed its impact on teacher and student behaviors. The sample included elementary and middle school teachers and students from three Chapter 1 schools in a rural California school…

Descriptors: Educational Assessment, Educational Improvement, Elementary Education, Evaluation Criteria

Evaluating the Efficacy of Rater Self-Training.

Download full text

Kenyon, Dorry; Stansfield, Charles W. – 1993

This paper examines whether individuals who train themselves to score a performance assessment will rate acceptably when compared to known standards. Research on the efficacy of rater self-training materials developed by the Center for Applied Linguistics for the Texas Oral Proficiency Test (TOPT) is examined. Rater self-materials are described…

Descriptors: Bilingual Education, Comparative Analysis, Evaluators, Individual Characteristics

Setting Test-Level Standards for a Performance Assessment of Physicians' Clinical Skills: A Process Investigation.

Download full text

De Champlain, Andre F.; Margolis, Melissa J.; Ross, Linette P.; Macmillan, Mary K.; Klass, Daniel J. – 1998

The purpose of the present investigation was to address several critical issues relating to setting a performance standard on a nationally administered standardized patient examination (SPX). The specific goals of the study were to: (1) compare pass/fail rates from this exercise to those of past studies undertaken with the same examination; (2)…

Descriptors: Clinical Experience, Higher Education, Interrater Reliability, Medical Education

Management Assessment.

1998

This document contains three papers from a symposium on management assessment. In "The Air Force ROTC (Reserve Officer Training Corps) Selection System as a Predictor of Leadership" (Orlando V. Griego, George A. Morgan, Gary D. Geroy), 102 ROTC cadets rated their own leadership characteristics and were rated by subordinates; leaders and…

Descriptors: Administrator Evaluation, Adult Education, Employee Attitudes, Evaluation Methods

A Comparison of Effectiveness Ratings of Selected Principals and NASSP Assessment Center Ratings.

PDF pending restoration

Yates, Beverly J. – 1991

The predictive validity of the National Association of Secondary School Principals (NASSP) assessment center evaluation process for principals is compared with the perceived effectiveness of a selected population of principals. The NASSP assessment center approach includes a case study, a personal interview, two exercises, and a scholastic…

Descriptors: Administrator Evaluation, Assessment Centers (Personnel), Case Studies, Comparative Analysis

Observer Agreement on Judgments of Bilingualism in Deaf Children.

Download full text

Seal, Brenda C. – 1991

In order to better evaluate bilingualism in deaf children, this study examined whether observers (N=37) from different backgrounds would agree on deaf children's use of either American Sign Language (ASL) or English signing. Observers represented a range of background experience in a variety of schools and programs; 6 were deaf; 31 were hearing;…

Descriptors: American Sign Language, Bilingual Students, Bilingualism, Deafness

The Effect of Computers on the Test and Inter-Rater Reliability of Writing Tests of ESL Learners

Peer reviewed
PDF on ERIC

Download full text

Aydin, Selami – Turkish Online Journal of Educational Technology - TOJET, 2006

This research aimed to investigate the effect of computers on the test and inter-rater reliability of writing test scores of ESL learners. Writing samples of 20 pen-paper and 20 computer group students were scored in analytic scoring method by two scorers, and then the scores were analyzed in Alpha (Cronbach) model. The results showed that the…

Descriptors: Foreign Countries, College Students, Computer Assisted Testing, English (Second Language)

Severity of Grading across Time Periods.

Download full text

Lunz, Mary E.; Stahl, John A. – 1990

Three examinations administered to medical students were analyzed to determine differences among severities of judges' assessments and among grading periods. The examinations included essay, clinical, and oral forms of the tests. Twelve judges graded the three essays for 32 examinees during a 4-day grading session, which was divided into eight…

Descriptors: Clinical Diagnosis, Comparative Testing, Difficulty Level, Essay Tests

Three Selected Factors Affecting Performance Assessment of Teaching Using a Research-Based Observation Instrument.

Thomson, W. Scott – 1989

Three contextual factors, the gender of the principal, the choice of subject matter used for the demonstration of competence, and number of years of teaching experience, have been shown to have an effect on the outcome of teacher evaluation. The annual evaluations of 521 elementary personnel using the Florida state-mandated single assessment were…

Descriptors: Administrator Characteristics, Elementary Education, Evaluation Criteria, Interrater Reliability

Applying Empirical Analyses to the Evaluation of Test Content.

Download full text

Sireci, Stephen G.; And Others – 1990

Although some researchers have argued against use of the term "content validity," the ability of a test item to adequately represent the domain of knowledge tested continues to be an issue of paramount importance in test construction. The present paper reviews previous analyses of test content and proposes a new empirical method for…

Descriptors: Cluster Analysis, Content Analysis, Content Validity, Evaluators

Interjudge Consensus and Intrajudge Consistency: Is It Possible To Have Both in Standard Setting?

Friedman, Charles B.; Ho, Kevin T. – 1990

Eleven judges representing 11 different geographic regions in the United States participated in a standard-setting session designed to determine the possibility of obtaining interjudge consensus and intrajudge consistency simultaneously. Each judge had experience in the field for which standards were being set. The judges rated 65 multiple-choice…

Descriptors: Evaluators, Feedback, Interrater Reliability, Licensing Examinations (Professions)

Rating Format Effects on Rater Agreement and Reliability.

Download full text

Littlefield, John H.; Troendle, G. Roger – 1986

This study compares intra- and inter-rater agreement and reliability when using three different rating form formats to assess the same stimuli. One format requests assessment by marking detailed criteria without an overall judgement; the second format requests only an overall judgement without the use of detailed criteria; and the third format…

Descriptors: Cognitive Processes, Dental Evaluation, Dental Schools, Evaluation Criteria

Rater Stringency Error in Performance Rating: A Contrast of Three Models.

Download full text

Cason, Gerald J.; Cason, Carolyn L. – 1989

The use of three remedies for errors in the measurement of ability that arise from differences in rater stringency is discussed. Models contrasted are: (1) Conventional; (2) Handicap; and (3) deterministic Rater Response Theory (RRT). General model requirements, power, bias of measures, computing cost, and complexity are contrasted. Contrasts are…

Descriptors: Ability, Achievement Rating, Error of Measurement, Evaluation Methods

Total Score Reliability in Large-Scale Writing Assessment.

Download full text

Bunch, Michael B.; Littlefair, Wendy – 1988

A total of 2,000 essays written by 1,000 students was submitted to generalizability analyses for domain-referenced tests. Each student had written one essay on each of two prompts representing two models of discourse. Each essay was read by six readers and judged on a scale of from 1 to 4. No reader read essays from both prompts. Reader agreement…

Descriptors: Cutting Scores, Essay Tests, Generalizability Theory, Interrater Reliability

Practical and Theoretical Requirements for Controlling Rater Stringency in Peer Review.

Download full text

Cason, Gerald J.; Cason, Carolyn L. – 1987

This study describes a computer based, performance rating information processing system, performance rating theory, and programs for the application of the theory to obtain ratings free from the effects of reviewer stringency in reviewing abstracts of conference papers. Originally, the Performance Rating (PR) System was used to evaluate the…

Descriptors: Abstracts, Computer Oriented Programs, Conference Papers, Data Processing

« Previous Page | Next Page »

Pages: 1 | ... | 172 | 173 | 174 | 175 | 176 | 177 | 178 | 179 | 180 | ... | 209

ProQuest LLC	86
Journal of Speech, Language,…	62
Educational and Psychological…	61
Journal of Autism and…	56
Grantee Submission	40
Language Testing	39
Online Submission	35
International Journal of…	34
Assessment & Evaluation in…	33
Research in Developmental…	31
Applied Measurement in…	28
Advances in Health Sciences…	26
Assessment for Effective…	26
ETS Research Report Series	25
Journal of Educational…	25
Educational Measurement:…	23
Measurement in Physical…	20
Language Assessment Quarterly	19
Psychology in the Schools	19
Topics in Early Childhood…	19
Psychological Assessment	18
Educational Assessment	16
Autism: The International…	15
Journal of Consulting and…	15
Personnel Psychology	15
More ▼

Lunz, Mary E.	10
Wind, Stefanie A.	10
Engelhard, George, Jr.	8
Epstein, Michael H.	8
Ingham, Roger J.	8
Johnson, Evelyn S.	8
Matson, Johnny L.	7
McLeod, Bryce D.	7
Moylan, Laura A.	7
Cason, Carolyn L.	6
Cordes, Anne K.	6
Jaeger, Richard M.	6
Johnson, Robert L.	6
Lecavalier, Luc	6
Plake, Barbara S.	6
Tasse, Marc J.	6
Wyse, Adam E.	6
Zheng, Yuzhu	6
Aman, Michael G.	5
Barton, Erin E.	5
Cason, Gerald J.	5
Coniam, David	5
Conroy, Maureen A.	5
Crawford, Angela R.	5
More ▼

Journal Articles	2555
Reports - Research	2243
Reports - Evaluative	515
Speeches/Meeting Papers	272
Reports - Descriptive	163
Tests/Questionnaires	162
Information Analyses	130
Dissertations/Theses -…	89
Opinion Papers	61
Numerical/Quantitative Data	31
Guides - Non-Classroom	11
Books	7
Collected Works - General	3
Guides - Classroom - Teacher	3
Non-Print Media	3
Book/Product Reviews	2
Collected Works - Serials	2
Dissertations/Theses	2
ERIC Digests in Full Text	2
ERIC Publications	2
Guides - General	2
Reports - General	2
Collected Works - Proceedings	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	30
Child Behavior Checklist	18
National Assessment of…	14
Vineland Adaptive Behavior…	14
Autism Diagnostic Observation…	13
Strengths and Difficulties…	11
Woodcock Johnson Tests of…	10
Peabody Picture Vocabulary…	9
SAT (College Admission Test)	9
Wechsler Intelligence Scale…	9
Behavior Assessment System…	8
Dynamic Indicators of Basic…	8
Early Childhood Environment…	8
Graduate Record Examinations	8
International English…	7
Teacher Performance…	6
ACT Assessment	5
Advanced Placement…	5
Behavioral and Emotional…	5
Childhood Autism Rating Scale	5
Classroom Assessment Scoring…	5
Conners Teacher Rating Scale	5
Draw a Person Test	5
Raven Progressive Matrices	5
ACTFL Oral Proficiency…	4
More ▼