ERIC - Search Results

Publication Date

In 2026	0
Since 2025	56
Since 2022 (last 5 years)	282
Since 2017 (last 10 years)	778
Since 2007 (last 20 years)	2040

Descriptor

Interrater Reliability	3122
Foreign Countries	654
Test Reliability	503
Evaluation Methods	502
Test Validity	410
Correlation	401
Scoring	347
Comparative Analysis	327
Scores	324
Validity	310
Student Evaluation	308
Measures (Individuals)	298
Evaluators	295
Rating Scales	282
Statistical Analysis	268
Higher Education	264
Psychometrics	241
Reliability	230
Observation	229
Scoring Rubrics	216
Test Construction	212
English (Second Language)	211
Teaching Methods	208
Writing Evaluation	206
Intervention	200
More ▼

Education Level

Higher Education	574
Postsecondary Education	420
Elementary Education	282
Secondary Education	179
Early Childhood Education	145
Elementary Secondary Education	120
Middle Schools	109
High Schools	85
Preschool Education	72
Junior High Schools	65
Adult Education	58
Primary Education	57
Kindergarten	45
Grade 4	41
Grade 5	40
Intermediate Grades	40
Grade 1	37
Grade 6	35
Grade 8	32
Grade 3	31
Grade 2	27
Grade 7	27
Grade 10	13
Grade 9	11
Two Year Colleges	8
More ▼

Audience

Researchers	130
Practitioners	42
Teachers	22
Administrators	11
Counselors	3
Policymakers	2

Location

Australia	56
Turkey	53
United Kingdom	46
Canada	45
Netherlands	40
China	38
California	37
United States	30
United Kingdom (England)	24
Taiwan	23
Germany	22
Japan	22
Pennsylvania	22
Florida	21
Sweden	21
Iran	19
North Carolina	19
Hong Kong	17
South Korea	17
Texas	17
Georgia	16
Israel	15
New Zealand	14
South Africa	14
Washington	14
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	13
Individuals with Disabilities…	7
Elementary and Secondary…	3
Race to the Top	3
Elementary and Secondary…	2
American Recovery and…	1
Americans with Disabilities…	1
Education Consolidation…	1
Education for All Handicapped…	1
Every Student Succeeds Act…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
Pell Grant Program	1
Rehabilitation Act 1973…	1
Stewart B McKinney Homeless…	1
Temporary Assistance for…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	3
Meets WWC Standards with or without Reservations	3
Does not meet standards	3

Showing 2,356 to 2,370 of 3,122 results Save | Export

The Accuracy and Use of Item Difficulty Calibrations Estimated from Judges' Ratings of Item Difficulty.

Download full text

Taube, Kurt T.; Newman, Larry S. – 1996

A method of estimating Rasch-model difficulty calibrations from judges' ratings of item difficulty is described. The ability of judges to estimate item difficulty was assessed by correlating estimated and empirical calibrations on each of four examinations offered by the American Association of State Social Work Boards. Thirteen members of the…

Descriptors: Correlation, Cutting Scores, Difficulty Level, Estimation (Mathematics)

Nonparametric Test of Ordered Alternatives: Extension of Page's L Test for Two Groups of Unequal Size.

Download full text

Beasley, T. Mark; Leitner, Dennis W. – 1993

The L statistic of E. B. Page (1963) tests the agreement of a single group of judges with an a priori ordering of alternative treatments. This paper extends the two group test of D. W. Leitner and C. M. Dayton (1976), an extension of the L test, to analyze difference in consensus between two unequally sized groups of judges. Exact critical values…

Descriptors: Comparative Analysis, Equations (Mathematics), Estimation (Mathematics), Evaluators

Interrater Agreement: Same Data, Different Definitions, Different Outcomes.

Download full text

Micceri, Theodore; And Others – 1987

Several issues relating to agreement estimates for different types of data from performance evaluations are considered. New indices of agreement are presented for ordinal level items and for summative scores produced by nominal or ordinal level items. Two sets of empirical data illustrate the performance of the two formulas derived to estimate…

Descriptors: Correlation, Data Analysis, Educational Research, Estimation (Mathematics)

Interrater Reliability: A Selected and Annotated Bibliography of Articles Concerning Interrater Reliability.

Weare, Jane; And Others – 1987

This annotated bibliography was developed upon noting a deficiency of information in the literature regarding the training of raters for establishing agreement. The ERIC descriptor, "Interrater Reliability", was used to locate journal articles. Some of the 33 resulting articles focus on mathematical concepts and present formulas for computing…

Descriptors: Annotated Bibliographies, Cloze Procedure, Correlation, Essay Tests

A Comparison of Tabulation Methods at Two National Individual Events Tournaments: The AFA-NIET and the NFA IE Nationals.

Download full text

Littlefield, Robert S. – 1986

Comparing the manner in which contestants' scores were tabulated at both the 1985 American Forensic Association National Individual Events Tournament (AFA-NIET) and National Forensic Association Individual Events Nationals (NFA-IEN), a study (1) examined whether a correlation exists between contestants placing in the quarterfinals with five…

Descriptors: Debate, Eligibility, Interrater Reliability, Judges

The Observational Recording Dilemma.

Download full text

Kieren, Dianne K.; Munro, Brenda – 1985

Decision making about an observational recording system for family interaction research is crucial. Alternative coding-recording methods and combinations thereof are discussed, including: (1) paper-and-pencil on-site method; (2) video-tapes; (3) paper-and-pencil and mechanical coding devices; (4) transcripts; and (5) transcripts combined with…

Descriptors: Comparative Analysis, Decision Making, Family Life, Family Problems

Rater Reliability of the ACTFL Oral Proficiency Interview.

Peer reviewed

Magnan, Sally Sieloff – Canadian Modern Language Review, 1987

Differences between the academic (American Council on the Teaching of Foreign Languages) and government (Foreign Service Institute) versions of the oral proficiency interview test are examined, and data from two studies of interrater reliability are presented and discussed. (MSE)

Descriptors: Evaluation Methods, Interrater Reliability, Language Proficiency, Language Tests

Problems of Analyst and Observer Agreement in Naturalistic Narrative Data.

Peer reviewed

Scott, M. M.; Hatfield, James G. – Journal of Educational Measurement, 1985

Differences in agreement between observers and analysts of naturalistic narrative data cause problems in observation research. This paper discusses the advantages and disadvantages of several possible solutions. (Author/GDC)

Descriptors: Behavioral Science Research, Data Analysis, Data Collection, Interrater Reliability

The Paired Comparison Method in Educational Research.

Peer reviewed

Green, Kathy – Educational and Psychological Measurement, 1985

Five sets of paired comparison judgments were made concerning test item difficulty, in order to identify the most probable source of intrasensitivity in the data. The paired comparisons method was useful in providing information about sensitivity to stimulus differences, but less useful for assessing dimensionality of judgment criteria.…

Descriptors: Adults, Difficulty Level, Evaluative Thinking, Higher Education

Comparison of Thought-Listing Rating Methods.

Peer reviewed

Tarico, Valerie S.; And Others – Journal of Counseling Psychology, 1986

Compared three methods of rating thoughts: self-rating by subjects, rating by experts with thoughts presented randomly, and rating by experts with thoughts presented in context among 107 students who listed their thoughts prior to giving a speech. Results indicated all three methods were equal in predictions of speech anxiety and performance.…

Descriptors: Anxiety, Cognitive Measurement, Cognitive Processes, Comparative Analysis

Teaching Observational Methods: Time Sampling, Event Sampling, and Trait Rating Techniques.

Peer reviewed

Zeren, Andrea S.; Makosky, Vivian Parker – Teaching of Psychology, 1986

Presents an in-class activity which uses videotaped television shows to teach time sampling, event sampling, and trait rating techniques. Students responded favorably to this activity, and many reported that it increased their understanding of the different observation techniques. (Author/JDH)

Descriptors: Behavior Rating Scales, Higher Education, Instructional Improvement, Interrater Reliability

An Interactionist Analysis of Small Group Peer Assessment.

Peer reviewed

Montgomery, Barbara M. – Small Group Behavior, 1986

Investigates the relative and interactive effects of rater-, and ratee-, relationship-, situational-, and group-level contingencies on peer assessments of open communication. Results suggest that, given certain procedural conditions, peer assessments are highly reliable and valid. Rater bias accounted for a relatively small amount of rating…

Descriptors: College Students, Group Dynamics, Higher Education, Interaction Process Analysis

Variability Across Sources of Performance Ratings.

Peer reviewed

Fuqua, Dale R.; And Others – Journal of Counseling Psychology, 1984

Compares peer ratings, supervisor ratings, and self-ratings of counseling performance. Earlier studies of the relationship of performance ratings from different sources have indicated some comparability across source of rating, particularly late in the training process. These results indicated considerable variability across sources of ratings…

Descriptors: Counselor Evaluation, Counselor Performance, Counselor Training, Graduate Students

Reliability of Immediate Reward and Delayed Reward Categories.

Peer reviewed

Singletary, Michael W. – Journalism Quarterly, 1985

Reports that coders were able to judge adequately the difference between immediate reward and delayed reward in news stories but not the difference between subcategories. (FL)

Descriptors: Content Analysis, Interrater Reliability, Journalism, Mass Media

Hamilton Rating Scale for Depression: Reliability and Validity of Judgments of Novice Raters.

Peer reviewed

O'Hara, Michael W.; Rehm, Lynn P. – Journal of Consulting and Clinical Psychology, 1983

Used the intraclass correlation coefficient to estimate the interrater reliability of judgments of clinician and novice raters of depressed females (N=20) who took the Hamilton Rating Scale for Depression (HRSD). Expert and student raters both made reliable ratings on the HRSD. Criterion validity for student raters was also satisfactory.…

Descriptors: College Students, Comparative Testing, Cost Effectiveness, Counselor Role

« Previous Page | Next Page »

Pages: 1 | ... | 154 | 155 | 156 | 157 | 158 | 159 | 160 | 161 | 162 | ... | 209

ProQuest LLC	86
Journal of Speech, Language,…	62
Educational and Psychological…	61
Journal of Autism and…	56
Grantee Submission	40
Language Testing	39
Online Submission	35
International Journal of…	34
Assessment & Evaluation in…	33
Research in Developmental…	31
Applied Measurement in…	28
Advances in Health Sciences…	26
Assessment for Effective…	26
ETS Research Report Series	25
Journal of Educational…	24
Educational Measurement:…	23
Measurement in Physical…	20
Language Assessment Quarterly	19
Psychology in the Schools	19
Topics in Early Childhood…	19
Psychological Assessment	18
Educational Assessment	16
Autism: The International…	15
Journal of Consulting and…	15
Personnel Psychology	15
More ▼

Lunz, Mary E.	10
Wind, Stefanie A.	10
Engelhard, George, Jr.	8
Epstein, Michael H.	8
Ingham, Roger J.	8
Johnson, Evelyn S.	8
Matson, Johnny L.	7
McLeod, Bryce D.	7
Moylan, Laura A.	7
Cason, Carolyn L.	6
Cordes, Anne K.	6
Jaeger, Richard M.	6
Johnson, Robert L.	6
Lecavalier, Luc	6
Plake, Barbara S.	6
Tasse, Marc J.	6
Wyse, Adam E.	6
Zheng, Yuzhu	6
Aman, Michael G.	5
Barton, Erin E.	5
Cason, Gerald J.	5
Coniam, David	5
Conroy, Maureen A.	5
Crawford, Angela R.	5
More ▼

Journal Articles	2553
Reports - Research	2241
Reports - Evaluative	515
Speeches/Meeting Papers	272
Reports - Descriptive	163
Tests/Questionnaires	162
Information Analyses	130
Dissertations/Theses -…	89
Opinion Papers	61
Numerical/Quantitative Data	31
Guides - Non-Classroom	11
Books	7
Collected Works - General	3
Guides - Classroom - Teacher	3
Non-Print Media	3
Book/Product Reviews	2
Collected Works - Serials	2
Dissertations/Theses	2
ERIC Digests in Full Text	2
ERIC Publications	2
Guides - General	2
Reports - General	2
Collected Works - Proceedings	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	30
Child Behavior Checklist	18
National Assessment of…	14
Vineland Adaptive Behavior…	14
Autism Diagnostic Observation…	13
Strengths and Difficulties…	11
Woodcock Johnson Tests of…	10
Peabody Picture Vocabulary…	9
SAT (College Admission Test)	9
Wechsler Intelligence Scale…	9
Behavior Assessment System…	8
Dynamic Indicators of Basic…	8
Early Childhood Environment…	8
Graduate Record Examinations	8
International English…	7
Teacher Performance…	6
ACT Assessment	5
Advanced Placement…	5
Behavioral and Emotional…	5
Childhood Autism Rating Scale	5
Classroom Assessment Scoring…	5
Conners Teacher Rating Scale	5
Draw a Person Test	5
Raven Progressive Matrices	5
ACTFL Oral Proficiency…	4
More ▼