ERIC - Search Results

Publication Date

In 2026	0
Since 2025	58
Since 2022 (last 5 years)	284
Since 2017 (last 10 years)	780
Since 2007 (last 20 years)	2042

Descriptor

Interrater Reliability	3124
Foreign Countries	655
Test Reliability	503
Evaluation Methods	502
Test Validity	410
Correlation	401
Scoring	347
Comparative Analysis	327
Scores	324
Validity	310
Student Evaluation	308
Measures (Individuals)	298
Evaluators	295
Rating Scales	282
Statistical Analysis	268
Higher Education	264
Psychometrics	241
Reliability	231
Observation	229
Scoring Rubrics	216
Test Construction	212
English (Second Language)	211
Teaching Methods	208
Writing Evaluation	206
Intervention	200
More ▼

Education Level

Higher Education	574
Postsecondary Education	420
Elementary Education	282
Secondary Education	180
Early Childhood Education	145
Elementary Secondary Education	120
Middle Schools	109
High Schools	86
Preschool Education	72
Junior High Schools	65
Adult Education	59
Primary Education	57
Kindergarten	45
Grade 4	41
Grade 5	40
Intermediate Grades	40
Grade 1	37
Grade 6	35
Grade 8	32
Grade 3	31
Grade 2	27
Grade 7	27
Grade 10	13
Grade 9	11
Two Year Colleges	8
More ▼

Audience

Researchers	130
Practitioners	42
Teachers	22
Administrators	11
Counselors	3
Policymakers	2

Location

Australia	56
Turkey	53
United Kingdom	46
Canada	45
Netherlands	40
China	38
California	37
United States	30
United Kingdom (England)	25
Taiwan	23
Germany	22
Japan	22
Pennsylvania	22
Florida	21
Sweden	21
Iran	19
North Carolina	19
Hong Kong	17
South Korea	17
Texas	17
Georgia	16
Israel	15
New Zealand	14
South Africa	14
Washington	14
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	13
Individuals with Disabilities…	7
Elementary and Secondary…	3
Race to the Top	3
Elementary and Secondary…	2
American Recovery and…	1
Americans with Disabilities…	1
Education Consolidation…	1
Education for All Handicapped…	1
Every Student Succeeds Act…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
Pell Grant Program	1
Rehabilitation Act 1973…	1
Stewart B McKinney Homeless…	1
Temporary Assistance for…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	3
Meets WWC Standards with or without Reservations	3
Does not meet standards	3

Showing 2,131 to 2,145 of 3,124 results Save | Export

Client Verbal Response Category System: Preliminary Data.

Peer reviewed

Meier, Augustine; Boivin, Micheline – Journal of Consulting and Clinical Psychology, 1986

The Client Verbal Response Category System classifies client responses into Temporal, Directional and Experiential categories. The categories with their subcategories are defined, interjudge reliability data is presented, and the instrument's utility in psychotherapy process research is demonstrated. Initial results indicate that the instrument is…

Descriptors: Client Characteristics (Human Services), Interrater Reliability, Psychotherapy, Research Tools

Estimation of Employment Validities by Less Experienced Judges.

Peer reviewed

Hirsh, Hannah Rothstein; And Others – Personnel Psychology, 1986

Examined whether less experienced judges could also produce accurate estimates of the validity of cognitive tests. Shows that the estimates of less experienced judges contain less information than those of experts, but also that average of estimates of several less experienced judges are as accurate as those obtained from small-sample empirical…

Descriptors: Cognitive Tests, Educational Experience, Interrater Reliability, Judges

A Review of Reliability Procedures for Measuring Observer Agreement.

Peer reviewed

Towstopiat, Olga – Contemporary Educational Psychology, 1984

The present article reviews the procedures that have been developed for measuring the reliability of human observers' judgments when making direct observations of behavior. These include the percentage of agreement, Cohen's Kappa, phi, and univariate and multivariate agreement measures that are based on quasi-equiprobability and quasi-independence…

Descriptors: Interrater Reliability, Mathematical Models, Multivariate Analysis, Observation

The Effects of Familiarity, Gender, and Institutional Prestige on Evaluative Judgments of Convention Program Proposals.

Peer reviewed

Cooper, Harris – Journal of Research and Development in Education, 1985

Questions repeatedly arise about whether evaluations of research papers are systematically influenced by factors unrelated to quality, such as gender or prestige of the authors or reviewers. This study examined the reliability of reviews of proposals submitted for inclusion in the 1984 American Educational Research Association annual meeting…

Descriptors: Conference Papers, Evaluation Criteria, Interrater Reliability, Prestige

Recommendation Inflation.

Peer reviewed

Kasambira, K. Paul – Teacher Educator, 1984

Due to grade inflation, transcripts reveal little more than courses a student has completed. Recommendation letters have become an important criterion in teacher candidate selection. Suggestions for writing a subjective recommendation are offered. (DF)

Descriptors: Academic Standards, Evaluation Methods, Interrater Reliability, Portfolios (Background Materials)

Inter-Rater Reliability of the Cloze Reading Inventory as a Qualitative Measure of Reading Comprehension.

Peer reviewed

DeSanti, Roger J.; Sullivan, Vicki Gallo – Reading Psychology, 1984

Concludes that the Cloze Reading Inventory and its coding form can be reliably employed by a variety of teachers for a variety of grade levels and passages. (FL)

Descriptors: Cloze Procedure, Elementary Secondary Education, Interrater Reliability, Reading Comprehension

Observer Usage in Counseling Research: An Analysis of Reporting Adequacy.

Wilson, F. Robert; And Others – Measurement and Evaluation in Guidance, 1984

Assessed the adequacy of reporting the use of observers in counseling research. Each sampled article was classified according to the use made of the observer, the type and length of training given the observer, and the assessments made of the observers' reliability and validity. (Author/JAC)

Descriptors: Counseling, Evaluators, Experimenter Characteristics, Interrater Reliability

Teachers' Assignments and Student Work: Opening a Window on Classroom Practice. CSE Report.

Download full text

Matsumura, Lindsay Clare – 2003

This report describes 4 years of research by the National Center for Research on Evaluation, Standards, and Student Testing (CRESST) on developing indicators of classroom practice that have the potential to be used in large-scale settings and that draw attention to important aspects of standards-based learning and instruction. CRESSTs method was…

Descriptors: Academic Achievement, Assignments, Educational Practices, Elementary Secondary Education

A Subdividing Method for Generalizability Theory: Precision of Measurement Errors and Patterns of Missing Data.

Chiu, Christopher W. T. – 2000

A procedure was developed to analyze data with missing observations by extracting data from a sparsely filled data matrix into analyzable smaller subsets of data. This subdividing method, based on the conceptual framework of meta-analysis, was accomplished by creating data sets that exhibit structural designs and then pooling variance components…

Descriptors: Difficulty Level, Error of Measurement, Generalizability Theory, Interrater Reliability

A Review of Intraclass Correlation.

Download full text

Cook, Colleen – 2000

Against an historical backdrop, this paper summarizes four uses of intraclass correlation of importance to contemporary researchers in the behavioral sciences. First, it shows how the intraclass correlation coefficient can be used to adjust confidence intervals for statistical significance testing when data are intracorrelated and the independence…

Descriptors: Association (Psychology), Behavioral Sciences, Correlation, Interrater Reliability

Pubertal Timing and Adolescent Adjustment and Behavior: Conclusions Vary by Rater.

Peer reviewed

Dorn, Lorah D.; Susman, Elizabeth J.; Ponirakis, Angelo – Journal of Youth and Adolescence, 2003

Studied whether pubertal timing by self-report (SR), parent report (PR), or physical examination predicted the same aspects of adjustment and behavior problems. Findings for 52 girls, 56 boys, and their parents show that pubertal timing by SR and PR did not always provide the same level of prediction as did physical examination. (SLD)

Descriptors: Adjustment (to Environment), Adolescents, Behavior Patterns, Interrater Reliability

Development of a Procedure for Establishing Occupational Examination Cut Scores: A NOCTI Example.

Peer reviewed

Walter, Richard A.; Kapes, Jerome T. – Journal of Industrial Teacher Education, 2003

To identify a procedure for establishing cut scores for National Occupational Competency Testing Institute examinations in Pennsylvania, an expert panel assessed written and performance test items for minimally competent workers. Recommendations about the number, type, and training of judges used were made. (Contains 18 references.) (SK)

Descriptors: Cutting Scores, Interrater Reliability, Occupational Tests, Teacher Competency Testing

Agreement Measure Comparisons between Two Independent Sets of Raters.

Peer reviewed

Berry, Kenneth J.; Mielke, Paul W., Jr. – Educational and Psychological Measurement, 1997

Describes a FORTRAN software program that calculates the probability of an observed difference between agreement measures obtained from two independent sets of raters. An example illustrates the use of the DIFFER program in evaluating undergraduate essays. (Author/SLD)

Descriptors: Comparative Analysis, Computer Software, Evaluation Methods, Higher Education

Reliability in Cross-National Content Analysis.

Peer reviewed

Peter, Jochen; Lauf, Edmund – Journalism and Mass Communication Quarterly, 2002

Investigates how coder characteristics such as language skills, political knowledge, coding experience, and coding certainty affected inter-coder and coder-training reliability. Shows that language skills influenced both reliability types. Suggests that cross-national researchers should pay more attention to cross-national assessments of…

Descriptors: Coding, Communication Research, Evaluation Methods, Higher Education

Validity and Reliability of Judgments of Authentic and Simulated Stuttering.

Peer reviewed

Moore, Sulyn Elliot; Perkins, William H. – Journal of Speech and Hearing Disorders, 1990

Eighteen adult listeners assessed whether stuttering samples were authentic or simulated. Results support the concepts that the production of stuttered and nonstuttered speech disruptions are experienced as being qualitatively different; only stutterers can validly recognize the difference, and only when it occurs; stuttering is a…

Descriptors: Auditory Perception, Evaluation, Handicap Identification, Interrater Reliability

« Previous Page | Next Page »

Pages: 1 | ... | 139 | 140 | 141 | 142 | 143 | 144 | 145 | 146 | 147 | ... | 209

ProQuest LLC	86
Journal of Speech, Language,…	62
Educational and Psychological…	61
Journal of Autism and…	56
Grantee Submission	40
Language Testing	39
Online Submission	35
International Journal of…	34
Assessment & Evaluation in…	33
Research in Developmental…	31
Applied Measurement in…	28
Advances in Health Sciences…	26
Assessment for Effective…	26
ETS Research Report Series	25
Journal of Educational…	25
Educational Measurement:…	23
Measurement in Physical…	20
Language Assessment Quarterly	19
Psychology in the Schools	19
Topics in Early Childhood…	19
Psychological Assessment	18
Educational Assessment	16
Autism: The International…	15
Journal of Consulting and…	15
Personnel Psychology	15
More ▼

Lunz, Mary E.	10
Wind, Stefanie A.	10
Engelhard, George, Jr.	8
Epstein, Michael H.	8
Ingham, Roger J.	8
Johnson, Evelyn S.	8
Matson, Johnny L.	7
McLeod, Bryce D.	7
Moylan, Laura A.	7
Cason, Carolyn L.	6
Cordes, Anne K.	6
Jaeger, Richard M.	6
Johnson, Robert L.	6
Lecavalier, Luc	6
Plake, Barbara S.	6
Tasse, Marc J.	6
Wyse, Adam E.	6
Zheng, Yuzhu	6
Aman, Michael G.	5
Barton, Erin E.	5
Cason, Gerald J.	5
Coniam, David	5
Conroy, Maureen A.	5
Crawford, Angela R.	5
More ▼

Journal Articles	2555
Reports - Research	2243
Reports - Evaluative	515
Speeches/Meeting Papers	272
Reports - Descriptive	163
Tests/Questionnaires	162
Information Analyses	130
Dissertations/Theses -…	89
Opinion Papers	61
Numerical/Quantitative Data	31
Guides - Non-Classroom	11
Books	7
Collected Works - General	3
Guides - Classroom - Teacher	3
Non-Print Media	3
Book/Product Reviews	2
Collected Works - Serials	2
Dissertations/Theses	2
ERIC Digests in Full Text	2
ERIC Publications	2
Guides - General	2
Reports - General	2
Collected Works - Proceedings	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	30
Child Behavior Checklist	18
National Assessment of…	14
Vineland Adaptive Behavior…	14
Autism Diagnostic Observation…	13
Strengths and Difficulties…	11
Woodcock Johnson Tests of…	10
Peabody Picture Vocabulary…	9
SAT (College Admission Test)	9
Wechsler Intelligence Scale…	9
Behavior Assessment System…	8
Dynamic Indicators of Basic…	8
Early Childhood Environment…	8
Graduate Record Examinations	8
International English…	7
Teacher Performance…	6
ACT Assessment	5
Advanced Placement…	5
Behavioral and Emotional…	5
Childhood Autism Rating Scale	5
Classroom Assessment Scoring…	5
Conners Teacher Rating Scale	5
Draw a Person Test	5
Raven Progressive Matrices	5
ACTFL Oral Proficiency…	4
More ▼