ERIC - Search Results

Publication Date

In 2026	0
Since 2025	58
Since 2022 (last 5 years)	284
Since 2017 (last 10 years)	780
Since 2007 (last 20 years)	2042

Descriptor

Interrater Reliability	3124
Foreign Countries	655
Test Reliability	503
Evaluation Methods	502
Test Validity	410
Correlation	401
Scoring	347
Comparative Analysis	327
Scores	324
Validity	310
Student Evaluation	308
Measures (Individuals)	298
Evaluators	295
Rating Scales	282
Statistical Analysis	268
Higher Education	264
Psychometrics	241
Reliability	231
Observation	229
Scoring Rubrics	216
Test Construction	212
English (Second Language)	211
Teaching Methods	208
Writing Evaluation	206
Intervention	200
More ▼

Education Level

Higher Education	574
Postsecondary Education	420
Elementary Education	282
Secondary Education	180
Early Childhood Education	145
Elementary Secondary Education	120
Middle Schools	109
High Schools	86
Preschool Education	72
Junior High Schools	65
Adult Education	59
Primary Education	57
Kindergarten	45
Grade 4	41
Grade 5	40
Intermediate Grades	40
Grade 1	37
Grade 6	35
Grade 8	32
Grade 3	31
Grade 2	27
Grade 7	27
Grade 10	13
Grade 9	11
Two Year Colleges	8
More ▼

Audience

Researchers	130
Practitioners	42
Teachers	22
Administrators	11
Counselors	3
Policymakers	2

Location

Australia	56
Turkey	53
United Kingdom	46
Canada	45
Netherlands	40
China	38
California	37
United States	30
United Kingdom (England)	25
Taiwan	23
Germany	22
Japan	22
Pennsylvania	22
Florida	21
Sweden	21
Iran	19
North Carolina	19
Hong Kong	17
South Korea	17
Texas	17
Georgia	16
Israel	15
New Zealand	14
South Africa	14
Washington	14
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	13
Individuals with Disabilities…	7
Elementary and Secondary…	3
Race to the Top	3
Elementary and Secondary…	2
American Recovery and…	1
Americans with Disabilities…	1
Education Consolidation…	1
Education for All Handicapped…	1
Every Student Succeeds Act…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
Pell Grant Program	1
Rehabilitation Act 1973…	1
Stewart B McKinney Homeless…	1
Temporary Assistance for…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	3
Meets WWC Standards with or without Reservations	3
Does not meet standards	3

Showing 2,041 to 2,055 of 3,124 results Save | Export

Reliability Estimation in Interaction Analysis.

Peer reviewed

Weider-Hatfield, Deborah; Hatfield, John D. – Communication Quarterly, 1984

Evaluation approaches to measuring reliabilty in interaction analysis by (1) presenting criteria for a sound reliability estimate, (2) evaluating currently used tests against these criteria, and (3) discussing application of appropriate tests to interaction data. (PD)

Descriptors: Communication Research, Evaluation Criteria, Interaction Process Analysis, Interrater Reliability

Effects of Deficient Reporting on Meta-Analysis: A Conceptual Framework and Reanalysis

Peer reviewed

Orwin, Robert G.; Cordray, David S. – Psychological Bulletin, 1985

Identifies three sources of reporting deficiency for meta-analytic results: quality (adequacy) of publicizing; quality of macrolevel reporting, and quality of microlevel reporting. Reanalysis of 25 reports from the Smith, Glass and Miller (1980) psychotherapy meta-analysis established two sources of misinformation, interrater reliabilities and…

Descriptors: Confidence Testing, Interrater Reliability, Meta Analysis, Psychotherapy

Inter-Rater Reliability on Performance Criteria: Theoretical Issues.

Miller-Whitehead, Marie – 2001

A hypothetical case study provides examples of the inter-rater reliability issues involved in complex performance assessment, focusing on the Baldrige model. A hypothetical team of five evaluators was asked to rate a Baldrige model performance assessment along the seven defined criteria or performance dimensions that comprise the Baldrige model…

Descriptors: Case Studies, Criteria, Evaluators, Interrater Reliability

When Inter-Rater Reliability Is Obtained from Only Part of a Sample.

Download full text

Fan, Xitao; Chen, Michael – 1999

It is erroneous to extend or generalize the inter-rater reliability coefficient estimated from only a (small) proportion of the sample to the rest of the sample data where only one rater is used for scoring, although such generalization is often made implicitly in practice. It is shown that if inter-rater reliability estimate from part of a sample…

Descriptors: Estimation (Mathematics), Generalizability Theory, Interrater Reliability, Sample Size

Sampling of Common Items: An Unrecognized Source of Error in Test Equating. CSE Report 636

Download full text

Michaelides, Michalis P.; Haertel, Edward H. – Center for Research on Evaluation Standards and Student Testing CRESST, 2004

There is variability in the estimation of an equating transformation because common-item parameters are obtained from responses of samples of examinees. The most commonly used standard error of equating quantifies this source of sampling error, which decreases as the sample size of examinees used to derive the transformation increases. In a…

Descriptors: Test Items, Testing, Error Patterns, Interrater Reliability

Evaluating Reflective Writing for Appropriateness, Fairness, and Consistency.

Peer reviewed

Kennison, Monica Metrick; Misselwitz, Shirley – Nursing Education Perspectives, 2002

Samples from 17 reflective journals of nursing students were evaluated by 6 faculty. Results indicate a lack of consistency in grading reflective writing, lack of consensus regarding evaluation, and differences among faculty regarding their view of such exercises. (Contains 26 references.) (JOW)

Descriptors: Grading, Higher Education, Interrater Reliability, Nursing Education

Effect of Situational Interviews, Conventional Structured Interviews, and Training on Interview Rating Agreement: An Experimental Analysis.

Peer reviewed

Maurer, Steven D.; Fay, Charles – Personnel Psychology, 1988

Examined degree to which agreement in interviewer ratings may be influenced by training, use of structured conventional interviews, or situational interviews. Results from 42 managers experienced as interviewers revealed no training effect on rating agreement; impact of situational format on consistency in assessments of applicant suitability was…

Descriptors: Administrators, Employment Interviews, Examiners, Experimenter Characteristics

The Reliability of Observational Data: I. Theories and Methods for Speech-Language Pathology.

Peer reviewed

Cordes, Anne K. – Journal of Speech and Hearing Research, 1994

This paper contends that behavior observation data relating to speech-language pathology are reliable if they are not affected by differences among observers or other variations in the recording context. The theoretical bases of methods used to estimate reliability for observational data are reviewed, and suggestions are provided for improving the…

Descriptors: Data Collection, Interrater Reliability, Observation, Reliability

Readers' Responses to the Rating of Non-Uniform Portfolios: Are There Limits on Portfolios' Utility?

Peer reviewed

Despain, LaRene; Hilgers, Thomas L. – WPA: Writing Program Administration, 1992

Describes readers' responses to the task of assigning scores to nonuniform portfolios of student writing. Suggests that reaching the goal of reliability in reading practices will not be easy. Concludes that writing program administrators should greet suggestions for the use of nonuniform portfolios with questioning restraint. (RS)

Descriptors: Higher Education, Interrater Reliability, Portfolios (Background Materials), Student Evaluation

Individual Differences in Voice Quality Perception.

Peer reviewed

Kreiman, Jody; And Others – Journal of Speech and Hearing Research, 1992

Sixteen listeners (10 expert, 6 naive) judged the dissimilarity of pairs of voices drawn from pathological and normal populations. Only parameters that showed substantial variability were perceptually salient across listeners. Results suggest that traditional means of assessing listener reliability in voice perception tasks may not be appropriate.…

Descriptors: Evaluation Methods, Individual Differences, Interrater Reliability, Perception

Client Expectations about Counseling and Involvement during Career Counseling.

Peer reviewed

Tindley, Howard E. A.; And Others – Career Development Quarterly, 1994

Describes investigation employing within-counselor design. Investigators analyzed audio recordings of career counseling interviews with clients who held either relatively negative expectations or relatively positive expectations regarding counseling. Clients who held relatively positive expectations were rated significantly higher on global…

Descriptors: Career Counseling, Expectation, Higher Education, Interrater Reliability

Time-Interval Measurement of Stuttering: Modifying Interjudge Agreement.

Peer reviewed

Ingham, Roger J.; And Others – Journal of Speech and Hearing Research, 1993

Two experiments investigating interval-by-interval interjudge and intrajudge agreement for stuttered and nonstuttered speech intervals found that training of judges could improve reliability levels; judges with relatively high intrajudge agreement also showed relatively higher interjudge agreement; and interval-by-interval interjudge agreement was…

Descriptors: Evaluation Methods, Interrater Reliability, Performance Factors, Speech Evaluation

Children's Observational Drawings: A Nine-Point Scale for Scoring Drawings of a Cube.

Peer reviewed

Cox, Maureen V.; Perara, Julian – Educational Psychology: An International Journal of Experimental Educational Psychology, 1998

Devises a nine-point scale for scoring drawings of a cube. Provides detailed criteria and examples for each category. Shows that interrater reliability of the scale is high, and scores trace a linear trend through a sample age-range. Suggests that the scale is suitable for use as a diagnostic or assessment tool. (DSK)

Descriptors: Art Education, Evaluation Methods, Foreign Countries, Geometric Constructions

Profiling Childhood Disability: The Reliability of the Educational Needs Questionnaire.

Peer reviewed

Dyson, Maree; Allen, Felicity; Duckett, Stephen – Evaluation and Program Planning, 2000

Reports on the interrater reliability of the Educational Needs Questionnaire (Victoria Department of Education, Australia), which was applied to 70 school-age children by their parents and 2 therapists. Results indicate that six of the subscales are reliable when evaluated by therapists and parents, but three subscales did not achieve the…

Descriptors: Children, Disabilities, Foreign Countries, Interrater Reliability

Classical, Generalizability, and Multifaceted Rasch Detection of Interrater Variability in Large, Sparse Data Sets.

Peer reviewed

MacMillan, Peter D. – Journal of Experimental Education, 2000

Compared classical test theory (CTT), generalizability theory (GT), and multifaceted Rasch model (MFRM) approaches to detecting and correcting for rater variability using responses of 4,930 high school students graded by 3 raters on 9 scales. The MFRM approach identified far more raters as different than did the CTT analysis. GT and Rasch…

Descriptors: Generalizability Theory, High School Students, High Schools, Interrater Reliability

« Previous Page | Next Page »

Pages: 1 | ... | 133 | 134 | 135 | 136 | 137 | 138 | 139 | 140 | 141 | ... | 209

ProQuest LLC	86
Journal of Speech, Language,…	62
Educational and Psychological…	61
Journal of Autism and…	56
Grantee Submission	40
Language Testing	39
Online Submission	35
International Journal of…	34
Assessment & Evaluation in…	33
Research in Developmental…	31
Applied Measurement in…	28
Advances in Health Sciences…	26
Assessment for Effective…	26
ETS Research Report Series	25
Journal of Educational…	25
Educational Measurement:…	23
Measurement in Physical…	20
Language Assessment Quarterly	19
Psychology in the Schools	19
Topics in Early Childhood…	19
Psychological Assessment	18
Educational Assessment	16
Autism: The International…	15
Journal of Consulting and…	15
Personnel Psychology	15
More ▼

Lunz, Mary E.	10
Wind, Stefanie A.	10
Engelhard, George, Jr.	8
Epstein, Michael H.	8
Ingham, Roger J.	8
Johnson, Evelyn S.	8
Matson, Johnny L.	7
McLeod, Bryce D.	7
Moylan, Laura A.	7
Cason, Carolyn L.	6
Cordes, Anne K.	6
Jaeger, Richard M.	6
Johnson, Robert L.	6
Lecavalier, Luc	6
Plake, Barbara S.	6
Tasse, Marc J.	6
Wyse, Adam E.	6
Zheng, Yuzhu	6
Aman, Michael G.	5
Barton, Erin E.	5
Cason, Gerald J.	5
Coniam, David	5
Conroy, Maureen A.	5
Crawford, Angela R.	5
More ▼

Journal Articles	2555
Reports - Research	2243
Reports - Evaluative	515
Speeches/Meeting Papers	272
Reports - Descriptive	163
Tests/Questionnaires	162
Information Analyses	130
Dissertations/Theses -…	89
Opinion Papers	61
Numerical/Quantitative Data	31
Guides - Non-Classroom	11
Books	7
Collected Works - General	3
Guides - Classroom - Teacher	3
Non-Print Media	3
Book/Product Reviews	2
Collected Works - Serials	2
Dissertations/Theses	2
ERIC Digests in Full Text	2
ERIC Publications	2
Guides - General	2
Reports - General	2
Collected Works - Proceedings	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	30
Child Behavior Checklist	18
National Assessment of…	14
Vineland Adaptive Behavior…	14
Autism Diagnostic Observation…	13
Strengths and Difficulties…	11
Woodcock Johnson Tests of…	10
Peabody Picture Vocabulary…	9
SAT (College Admission Test)	9
Wechsler Intelligence Scale…	9
Behavior Assessment System…	8
Dynamic Indicators of Basic…	8
Early Childhood Environment…	8
Graduate Record Examinations	8
International English…	7
Teacher Performance…	6
ACT Assessment	5
Advanced Placement…	5
Behavioral and Emotional…	5
Childhood Autism Rating Scale	5
Classroom Assessment Scoring…	5
Conners Teacher Rating Scale	5
Draw a Person Test	5
Raven Progressive Matrices	5
ACTFL Oral Proficiency…	4
More ▼