ERIC - Search Results

Publication Date

In 2026	0
Since 2025	58
Since 2022 (last 5 years)	284
Since 2017 (last 10 years)	780
Since 2007 (last 20 years)	2042

Descriptor

Interrater Reliability	3124
Foreign Countries	655
Test Reliability	503
Evaluation Methods	502
Test Validity	410
Correlation	401
Scoring	347
Comparative Analysis	327
Scores	324
Validity	310
Student Evaluation	308
Measures (Individuals)	298
Evaluators	295
Rating Scales	282
Statistical Analysis	268
Higher Education	264
Psychometrics	241
Reliability	231
Observation	229
Scoring Rubrics	216
Test Construction	212
English (Second Language)	211
Teaching Methods	208
Writing Evaluation	206
Intervention	200
More ▼

Education Level

Higher Education	574
Postsecondary Education	420
Elementary Education	282
Secondary Education	180
Early Childhood Education	145
Elementary Secondary Education	120
Middle Schools	109
High Schools	86
Preschool Education	72
Junior High Schools	65
Adult Education	59
Primary Education	57
Kindergarten	45
Grade 4	41
Grade 5	40
Intermediate Grades	40
Grade 1	37
Grade 6	35
Grade 8	32
Grade 3	31
Grade 2	27
Grade 7	27
Grade 10	13
Grade 9	11
Two Year Colleges	8
More ▼

Audience

Researchers	130
Practitioners	42
Teachers	22
Administrators	11
Counselors	3
Policymakers	2

Location

Australia	56
Turkey	53
United Kingdom	46
Canada	45
Netherlands	40
China	38
California	37
United States	30
United Kingdom (England)	25
Taiwan	23
Germany	22
Japan	22
Pennsylvania	22
Florida	21
Sweden	21
Iran	19
North Carolina	19
Hong Kong	17
South Korea	17
Texas	17
Georgia	16
Israel	15
New Zealand	14
South Africa	14
Washington	14
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	13
Individuals with Disabilities…	7
Elementary and Secondary…	3
Race to the Top	3
Elementary and Secondary…	2
American Recovery and…	1
Americans with Disabilities…	1
Education Consolidation…	1
Education for All Handicapped…	1
Every Student Succeeds Act…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
Pell Grant Program	1
Rehabilitation Act 1973…	1
Stewart B McKinney Homeless…	1
Temporary Assistance for…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	3
Meets WWC Standards with or without Reservations	3
Does not meet standards	3

Showing 2,191 to 2,205 of 3,124 results Save | Export

Structured Interviewing: Raising the Psychometric Properties of the Employment Interview.

Peer reviewed

Campion, Michael A.; And Others – Personnel Psychology, 1988

Proposes a highly structured six-step employment interviewing technique which includes asking the same questions, consistently administering the process to all candidates, and having an interview panel. Results of a field study of 243 job applicants using this technique demonstrated interrater reliability, predictive validity, test fairness for…

Descriptors: Employment Interviews, Interrater Reliability, Job Applicants, Measures (Individuals)

A Performance-Based Cooperating Teacher Report.

Peer reviewed

Phelps, Le Adelle; And Others – Journal of Teacher Education, 1986

A performance-based student teacher evaluation process was investigated to see if halo and leniency errors could be eliminated. Results are presented. (MT)

Descriptors: Cooperating Teachers, Evaluation Criteria, Higher Education, Interrater Reliability

A Specific Investigation of Relative Performance of Examination Markers.

Peer reviewed

Collier, Michael – Assessment and Evaluation in Higher Education, 1986

A study revealing wide variation in the grading of electronics engineering test items by different evaluators has implications for evaluator and test item selection, analysis and manipulation of grades, and the use of numerical methods of assessment. (MSE)

Descriptors: Electronics, Engineering Education, Evaluation Methods, Evaluators

The Effects of Method and Comprehensiveness of Training on the Reliability and Validity of Ratings of Counselor Empathy.

Peer reviewed

Wilson, F. Robert; Griswold, Mary Lynn – Measurement and Evaluation in Counseling and Development, 1985

Type and comprehensiveness of training were experimentally manipulated (N=128) to study their effects on the reliability and validity of rated counselor empathy. Implications for observer training are discussed. (Author)

Descriptors: College Students, Counselor Characteristics, Empathy, Interrater Reliability

The Role of Deliberation Style in Standard Setting for Licensing and Certification Examinations.

Download full text

Hertz, Norman R.; Chinn, Roberta N. – 2002

Nearly all of the research on standard setting focuses on different standard setting methods rather than the interaction of group members and the instructions given to group members. This study explored the effect of deliberation style and the requirement to reach consensus on the passing score, on rater satisfaction, and on postdecision…

Descriptors: Decision Making, Evaluation Methods, Evaluators, Interaction

A Method To Compare Rater Severity across Several Administrations.

Download full text

O'Neill, Thomas R.; Lunz, Mary E. – 1997

This paper illustrates a method to study rater severity across exam administrations. A multi-facet Rasch model defined the ratings as being dominated by four facets: examinee ability, rater severity, project difficulty, and task difficulty. Ten years of data from administrations of a histotechnology performance assessment were pooled and analyzed…

Descriptors: Ability, Comparative Analysis, Equated Scores, Interrater Reliability

An Analysis of Rater Impact on Composite Scores Using the Multifaceted Rasch Model.

Download full text

Taherbhai, Husein; Young, Michael James – 2000

This empirical study used data from the Reading: Basic Understanding section of the New Standards English Language Arts Examination. Data were collected for 3,200 high school students randomly selected from those who took the examination. The resulting sample had 16 raters who scored 200 students each, with each student rated by only 1 rater. The…

Descriptors: Evaluators, High School Students, High Schools, Interrater Reliability

Use of the Rasch IRT Model in Standard Setting: An Item Mapping Method.

Download full text

Wang, Ning; Wiser, Randall F.; Newman, Larry S. – 2001

This paper provides both logical and empirical evidence to justify the use of an item mapping method for establishing passing scores for multiple-choice licensure and certification examinations. After describing the item-mapping standard setting process, the paper discusses the theoretical basis and rationale for this newly developed method and…

Descriptors: Certification, Cutting Scores, Interrater Reliability, Item Response Theory

Two-Unit Reliability Analysis of Questionnaires Used in a Regulatory System.

Peer reviewed

Fleishman, Rachel; And Others – Evaluation Review, 1996

An interjudge reliability test was conducted to evaluate questionnaires used in the surveillance of residential care institutions in Israel. Results from 32 institutions (evaluated by two surveyor teams--one social worker and 1 nurse per team) and the variance in reliability were used to improve the questionnaires and their administration. (SLD)

Descriptors: Evaluators, Foreign Countries, Institutional Characteristics, Interrater Reliability

Content Analysis in Mass Communication: Assessment and Reporting of Intercoder Reliability.

Peer reviewed

Lombard, Matthew; Snyder-Duch, Jennifer; Bracken, Cheryl Campanella – Human Communication Research, 2002

Reviews the importance of intercoder agreement for content analysis in mass communication research. Describes several indices for calculating this type of reliability (varying in appropriateness, complexity, and apparent prevalence of use). Presents a content analysis of content analyses reported in communication journals to establish how…

Descriptors: Communication Research, Content Analysis, Higher Education, Interrater Reliability

Evaluating Family Therapy: Divergent Methods, Divergent Findings.

Peer reviewed

Kolevzon, Michael S.; And Others – Journal of Marital and Family Therapy, 1988

Employed triangulation strategy for assessing family interaction, involving family members, therapist, and coders independently viewing videotapes. Found weak agreement between paired assessments within family triad, and within therapist-coder dyad. Findings suggest that methodological and/or scaling strategies designed to maximize agreement may…

Descriptors: Counselor Attitudes, Evaluation Criteria, Evaluation Methods, Evaluation Problems

Stability and Discriminant Validity of the Adult Attachment Interview: A Psychometric Study in Young Israeli Adults.

Peer reviewed

Sagi, Abraham; And Others – Developmental Psychology, 1994

Interviewed Israeli students to assess the Adult Attachment Interview's test-retest reliability and effects of the interviewers on the interview itself. Information about subjects' memory and intellectual abilities was obtained from external sources. Found a high degree of interrater and test-retest reliabilities, irrespective of interviewers.…

Descriptors: Foreign Countries, Intelligence, Interrater Reliability, Memory

Effects of Using Two or More Standardized Patients to Simulate the Same Case on Case Means and Case Failure Rates.

Peer reviewed

Colliver, Jerry R.; And Others – Journal of Academic Medicine, 1991

Case means and case failures in performance-based medical student evaluations were examined to evaluate the consistency of ratings made by two or more standardized patients (SPs) simulating the same case. Results demonstrate a need for caution in interpreting scores obtained from a case checklist completed by multiple SPs. (Author/MSE)

Descriptors: Evaluation Methods, Higher Education, Interrater Reliability, Medical Education

Establishing the Reliability and Developmental Validity of a Neurobehavioral Assessment for Preterm Infants: A Methodological Process.

Peer reviewed

Korner, Anneliese F.; And Others – Child Development, 1991

The Neurobehavioral Assessment of the Preterm Infant instrument was developed by means of pilot, exploratory, and validation studies. The validation study tested the generalizability of results for different cohorts, test versions, hospitals, and examiners. Seven stable functions were identified: motor development; scarf sign; popliteal angle;…

Descriptors: Behavior Development, Cluster Analysis, Cohort Analysis, Interrater Reliability

Sex Bias in Student Assessment Overlooked?

Peer reviewed

Bradley, Clare – Assessment and Evaluation in Higher Education, 1993

Analysis of a study of sex bias in undergraduate student project evaluations revealed evidence of bias that was overlooked by the researchers. Research methodology and interpretation are discussed further. (MSE)

Descriptors: College Students, Higher Education, Interrater Reliability, Research Methodology

« Previous Page | Next Page »

Pages: 1 | ... | 143 | 144 | 145 | 146 | 147 | 148 | 149 | 150 | 151 | ... | 209

ProQuest LLC	86
Journal of Speech, Language,…	62
Educational and Psychological…	61
Journal of Autism and…	56
Grantee Submission	40
Language Testing	39
Online Submission	35
International Journal of…	34
Assessment & Evaluation in…	33
Research in Developmental…	31
Applied Measurement in…	28
Advances in Health Sciences…	26
Assessment for Effective…	26
ETS Research Report Series	25
Journal of Educational…	25
Educational Measurement:…	23
Measurement in Physical…	20
Language Assessment Quarterly	19
Psychology in the Schools	19
Topics in Early Childhood…	19
Psychological Assessment	18
Educational Assessment	16
Autism: The International…	15
Journal of Consulting and…	15
Personnel Psychology	15
More ▼

Lunz, Mary E.	10
Wind, Stefanie A.	10
Engelhard, George, Jr.	8
Epstein, Michael H.	8
Ingham, Roger J.	8
Johnson, Evelyn S.	8
Matson, Johnny L.	7
McLeod, Bryce D.	7
Moylan, Laura A.	7
Cason, Carolyn L.	6
Cordes, Anne K.	6
Jaeger, Richard M.	6
Johnson, Robert L.	6
Lecavalier, Luc	6
Plake, Barbara S.	6
Tasse, Marc J.	6
Wyse, Adam E.	6
Zheng, Yuzhu	6
Aman, Michael G.	5
Barton, Erin E.	5
Cason, Gerald J.	5
Coniam, David	5
Conroy, Maureen A.	5
Crawford, Angela R.	5
More ▼

Journal Articles	2555
Reports - Research	2243
Reports - Evaluative	515
Speeches/Meeting Papers	272
Reports - Descriptive	163
Tests/Questionnaires	162
Information Analyses	130
Dissertations/Theses -…	89
Opinion Papers	61
Numerical/Quantitative Data	31
Guides - Non-Classroom	11
Books	7
Collected Works - General	3
Guides - Classroom - Teacher	3
Non-Print Media	3
Book/Product Reviews	2
Collected Works - Serials	2
Dissertations/Theses	2
ERIC Digests in Full Text	2
ERIC Publications	2
Guides - General	2
Reports - General	2
Collected Works - Proceedings	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	30
Child Behavior Checklist	18
National Assessment of…	14
Vineland Adaptive Behavior…	14
Autism Diagnostic Observation…	13
Strengths and Difficulties…	11
Woodcock Johnson Tests of…	10
Peabody Picture Vocabulary…	9
SAT (College Admission Test)	9
Wechsler Intelligence Scale…	9
Behavior Assessment System…	8
Dynamic Indicators of Basic…	8
Early Childhood Environment…	8
Graduate Record Examinations	8
International English…	7
Teacher Performance…	6
ACT Assessment	5
Advanced Placement…	5
Behavioral and Emotional…	5
Childhood Autism Rating Scale	5
Classroom Assessment Scoring…	5
Conners Teacher Rating Scale	5
Draw a Person Test	5
Raven Progressive Matrices	5
ACTFL Oral Proficiency…	4
More ▼