ERIC - Search Results

Publication Date

In 2026	0
Since 2025	58
Since 2022 (last 5 years)	284
Since 2017 (last 10 years)	780
Since 2007 (last 20 years)	2042

Descriptor

Interrater Reliability	3124
Foreign Countries	655
Test Reliability	503
Evaluation Methods	502
Test Validity	410
Correlation	401
Scoring	347
Comparative Analysis	327
Scores	324
Validity	310
Student Evaluation	308
Measures (Individuals)	298
Evaluators	295
Rating Scales	282
Statistical Analysis	268
Higher Education	264
Psychometrics	241
Reliability	231
Observation	229
Scoring Rubrics	216
Test Construction	212
English (Second Language)	211
Teaching Methods	208
Writing Evaluation	206
Intervention	200
More ▼

Education Level

Higher Education	574
Postsecondary Education	420
Elementary Education	282
Secondary Education	180
Early Childhood Education	145
Elementary Secondary Education	120
Middle Schools	109
High Schools	86
Preschool Education	72
Junior High Schools	65
Adult Education	59
Primary Education	57
Kindergarten	45
Grade 4	41
Grade 5	40
Intermediate Grades	40
Grade 1	37
Grade 6	35
Grade 8	32
Grade 3	31
Grade 2	27
Grade 7	27
Grade 10	13
Grade 9	11
Two Year Colleges	8
More ▼

Audience

Researchers	130
Practitioners	42
Teachers	22
Administrators	11
Counselors	3
Policymakers	2

Location

Australia	56
Turkey	53
United Kingdom	46
Canada	45
Netherlands	40
China	38
California	37
United States	30
United Kingdom (England)	25
Taiwan	23
Germany	22
Japan	22
Pennsylvania	22
Florida	21
Sweden	21
Iran	19
North Carolina	19
Hong Kong	17
South Korea	17
Texas	17
Georgia	16
Israel	15
New Zealand	14
South Africa	14
Washington	14
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	13
Individuals with Disabilities…	7
Elementary and Secondary…	3
Race to the Top	3
Elementary and Secondary…	2
American Recovery and…	1
Americans with Disabilities…	1
Education Consolidation…	1
Education for All Handicapped…	1
Every Student Succeeds Act…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
Pell Grant Program	1
Rehabilitation Act 1973…	1
Stewart B McKinney Homeless…	1
Temporary Assistance for…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	3
Meets WWC Standards with or without Reservations	3
Does not meet standards	3

Showing 1,996 to 2,010 of 3,124 results Save | Export

Planned and Post Hoc Comparisons in Tests of Concordance and Discordance for G Groups of Judges.

Peer reviewed

Serlin, Ronald C.; Marascuilo, Leonard A. – Journal of Educational Statistics, 1983

Two alternatives to the problems of conducting planned and post hoc comparisons in tests of concordance and discordance for G groups of judges are examined. The two models are illustrated using existing data. (Author/JKS)

Descriptors: Attitude Measures, Comparative Analysis, Interrater Reliability, Mathematical Models

Monte Carlo Baselines for Interrater Reliability Correlations Using the Position Analysis Questionnaire.

Peer reviewed

Harvey, Robert J.; Hayes, Theodore L. – Personnel Psychology, 1986

Showed that reliabilities in the .50 range can be obtained when raters rule out only 15-20% of the items on the Position Analysis Questionnaire as "Does Not Apply" and respond randomly to the remainder. (Author/ABB)

Descriptors: Interrater Reliability, Job Analysis, Monte Carlo Methods, Occupational Information

Assessing the Quality of Family Observations: A Comparative Analysis.

Peer reviewed

Conger, Rand D.; And Others – Journal of Marriage and the Family, 1986

Examined the comparability of three techniques that are used to assess the dependability of family observational measures: analyses of observer agreement, reliability, and generalizability. Results indicated no single evaluative technique will always be most conservative in estimating the quality of observations. Suggests that multiple assessments…

Descriptors: Family Involvement, Generalization, Interrater Reliability, Measurement Techniques

The Utility of Structural Family Therapy Nomenclature: Between-Clinician Agreement in the Conjoint Family Assessment Interview.

Peer reviewed

O'Sullivan, Sean; And Others – Journal of Marital and Family Therapy, 1984

Explores the reliability of the categories used to describe family structure in structural family therapy. Five clinicians independently rated three initial conjoint family interviews. Results are discussed in terms of their demonstration of the utility of the structural nonmenclature, some conceptual problems in the structural nomenclature, and…

Descriptors: Cocounseling, Family Counseling, Family Problems, Family Structure

Multidimensional Functional Assessment in Two Modes.

Peer reviewed

Morris, Woodrow W.; Boutelle, Sandra – Gerontologist, 1985

Examines the feasibility of making multidimensional functional assessments among 22 older persons by using a questionnaire. Analysis of ratings and objective scores suggests that among relatively independent, well elderly individuals, self-administered assessment should be the mode of choice. Clinical and survey research applications are…

Descriptors: Interrater Reliability, Older Adults, Research Methodology, Scoring

Are Validity and Reliability "Relevant" in Qualitative Evaluation Research?

Peer reviewed

Goodwin, Laura D.; Goodwin, William L. – Evaluation and the Health Professions, 1984

The views of prominant qualitative methodologists on the appropriateness of validity and reliability estimation for the measurement strategies employed in qualitative evaluations are summarized. A case is made for the relevance of validity and reliability estimation. Definitions of validity and reliability for qualitative measurement are presented…

Descriptors: Evaluation Methods, Experimenter Characteristics, Interrater Reliability, Reliability

Expert and Naive Raters Using the PAG: Does it Matter?

Peer reviewed

Cornelius, Edwin T.; And Others – Personnel Psychology, 1984

Questions the observed correlation between job experts and naive raters using the Position Analysis Questionnaire (PAQ); and conducts a replication of the Smith and Hakel study (1979) with college students (N=39). Concluded that PAQ ratings from job experts and college students are not equivalent and therefore are not interchangeable. (LLL)

Descriptors: College Students, Higher Education, Interrater Reliability, Job Analysis

Detecting Intrajudge Inconsistency in Standard Setting Using Test Items with a Selected-Response Format. Research Report.

Download full text

van der Linden, Wim J.; Vos, Hans J.; Chang, Lei – 2000

In judgmental standard setting experiments, it may be difficult to specify subjective probabilities that adequately take the properties of the items into account. As a result, these probabilities are not consistent with each other in the sense that they do not refer to the same borderline level of performance. Methods to check standard setting…

Descriptors: Interrater Reliability, Judges, Probability, Standard Setting

Assessing the Impact of Standardized Patient Variability on Examination Mastery-Level Decision Consistency Rates.

Download full text

De Champlain, Andre F.; Gessaroli, Marc E.; Floreck, Lisa M. – 2000

The purpose of this study was to estimate the extent to which recording variability among standardized patients (SPs) has an impact on classification consistency with data sets simulated to reflect performances on a large-scale clinical skills examination. SPs are laypersons trained to portray patients in clinical encounters (cases) and to record…

Descriptors: Classification, Interrater Reliability, Licensing Examinations (Professions), Medical Education

Are Phenomenographic Results Reliable?

Peer reviewed

Sandburg, Jorgen – Higher Education Research and Development, 1997

Argues that interrater reliability, traditionally used in phenomenographic research, is unreliable for establishing the reliability of research results; it does not take into account the researcher's procedures for achieving fidelity to the individuals' conceptions investigated, and use of interrater reliability based on objectivist epistemology…

Descriptors: Educational Research, Epistemology, Interrater Reliability, Qualitative Research

An Analysis of Job Evaluation Committee and Job Holder Gender Effects on Job Evaluation.

Peer reviewed

Lewis, Chad T.; Stevens, Cynthia Kay – Public Personnel Management, 1990

A total of 204 business students organized in committees evaluated jobs for accountability, knowledge and skills, and mental demands. The same position was rated more highly when held by a male rather than a female, regardless of whether the committee was predominantly male or female. The importance of anonymity of job holders when conducting job…

Descriptors: College Students, Interrater Reliability, Job Analysis, Sex Bias

Interjudge Agreement and the Maximum Value of Kappa.

Peer reviewed

Umesh, U. N.; And Others – Educational and Psychological Measurement, 1989

An approach is provided for calculating maximum values of the Kappa statistic of J. Cohen (1960) as a function of observed agreement proportions between evaluators. Separate calculations are required for different matrix sizes and observed agreement levels. (SLD)

Descriptors: Equations (Mathematics), Evaluators, Heuristics, Interrater Reliability

The Reliability of Observational Data: II. Issues in the Identification and Measurement of Stuttering Events.

Peer reviewed

Cordes, Anne K.; Ingham, Roger J. – Journal of Speech and Hearing Research, 1994

This paper reviews the prominent concepts of the stuttering event and concerns about the reliability of stuttering event measurements, specifically interjudge agreement. Recent attempts to resolve the stuttering measurement problem are reviewed, and the implications of developing an improved measurement system are discussed. (Author/JDD)

Descriptors: Data Collection, Interrater Reliability, Measurement Techniques, Observation

The Consistency of Peer Review in Student Writing Projects.

Peer reviewed

Marcoulides, George A.; Simkin, Mark G. – Journal of Education for Business, 1995

Each paper written by 60 sophomores in computer classes received 3 peer evaluations using a structured evaluation process. Overall, students were able to grade efficiently and consistently in terms of overall score and selected criteria (subject matter, content, and mechanics). (SK)

Descriptors: Higher Education, Interrater Reliability, Peer Evaluation, Undergraduate Students

Inter-rater and Intra-rater Reliability of the Occupational Therapy Diagnosis.

Peer reviewed

Driessen, Marie-Jose; And Others – Occupational Therapy Journal of Research, 1995

Two occupational therapists in an interrater test and 9 in an intrarater test used a form based on the International Classification of Impairments, Disabilities, and Handicaps to evaluate 50 patients in a psychiatric hospital and 50 in a rehabilitation center. Based on percentage of agreement and Cohen's kappa, the reliability of the diagnoses was…

Descriptors: Clinical Diagnosis, Disabilities, Interrater Reliability, Occupational Therapy

« Previous Page | Next Page »

Pages: 1 | ... | 130 | 131 | 132 | 133 | 134 | 135 | 136 | 137 | 138 | ... | 209

ProQuest LLC	86
Journal of Speech, Language,…	62
Educational and Psychological…	61
Journal of Autism and…	56
Grantee Submission	40
Language Testing	39
Online Submission	35
International Journal of…	34
Assessment & Evaluation in…	33
Research in Developmental…	31
Applied Measurement in…	28
Advances in Health Sciences…	26
Assessment for Effective…	26
ETS Research Report Series	25
Journal of Educational…	25
Educational Measurement:…	23
Measurement in Physical…	20
Language Assessment Quarterly	19
Psychology in the Schools	19
Topics in Early Childhood…	19
Psychological Assessment	18
Educational Assessment	16
Autism: The International…	15
Journal of Consulting and…	15
Personnel Psychology	15
More ▼

Lunz, Mary E.	10
Wind, Stefanie A.	10
Engelhard, George, Jr.	8
Epstein, Michael H.	8
Ingham, Roger J.	8
Johnson, Evelyn S.	8
Matson, Johnny L.	7
McLeod, Bryce D.	7
Moylan, Laura A.	7
Cason, Carolyn L.	6
Cordes, Anne K.	6
Jaeger, Richard M.	6
Johnson, Robert L.	6
Lecavalier, Luc	6
Plake, Barbara S.	6
Tasse, Marc J.	6
Wyse, Adam E.	6
Zheng, Yuzhu	6
Aman, Michael G.	5
Barton, Erin E.	5
Cason, Gerald J.	5
Coniam, David	5
Conroy, Maureen A.	5
Crawford, Angela R.	5
More ▼

Journal Articles	2555
Reports - Research	2243
Reports - Evaluative	515
Speeches/Meeting Papers	272
Reports - Descriptive	163
Tests/Questionnaires	162
Information Analyses	130
Dissertations/Theses -…	89
Opinion Papers	61
Numerical/Quantitative Data	31
Guides - Non-Classroom	11
Books	7
Collected Works - General	3
Guides - Classroom - Teacher	3
Non-Print Media	3
Book/Product Reviews	2
Collected Works - Serials	2
Dissertations/Theses	2
ERIC Digests in Full Text	2
ERIC Publications	2
Guides - General	2
Reports - General	2
Collected Works - Proceedings	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	30
Child Behavior Checklist	18
National Assessment of…	14
Vineland Adaptive Behavior…	14
Autism Diagnostic Observation…	13
Strengths and Difficulties…	11
Woodcock Johnson Tests of…	10
Peabody Picture Vocabulary…	9
SAT (College Admission Test)	9
Wechsler Intelligence Scale…	9
Behavior Assessment System…	8
Dynamic Indicators of Basic…	8
Early Childhood Environment…	8
Graduate Record Examinations	8
International English…	7
Teacher Performance…	6
ACT Assessment	5
Advanced Placement…	5
Behavioral and Emotional…	5
Childhood Autism Rating Scale	5
Classroom Assessment Scoring…	5
Conners Teacher Rating Scale	5
Draw a Person Test	5
Raven Progressive Matrices	5
ACTFL Oral Proficiency…	4
More ▼