ERIC - Search Results

Publication Date

In 2026	0
Since 2025	56
Since 2022 (last 5 years)	282
Since 2017 (last 10 years)	778
Since 2007 (last 20 years)	2040

Descriptor

Interrater Reliability	3122
Foreign Countries	654
Test Reliability	503
Evaluation Methods	502
Test Validity	410
Correlation	401
Scoring	347
Comparative Analysis	327
Scores	324
Validity	310
Student Evaluation	308
Measures (Individuals)	298
Evaluators	295
Rating Scales	282
Statistical Analysis	268
Higher Education	264
Psychometrics	241
Reliability	230
Observation	229
Scoring Rubrics	216
Test Construction	212
English (Second Language)	211
Teaching Methods	208
Writing Evaluation	206
Intervention	200
More ▼

Education Level

Higher Education	574
Postsecondary Education	420
Elementary Education	282
Secondary Education	179
Early Childhood Education	145
Elementary Secondary Education	120
Middle Schools	109
High Schools	85
Preschool Education	72
Junior High Schools	65
Adult Education	58
Primary Education	57
Kindergarten	45
Grade 4	41
Grade 5	40
Intermediate Grades	40
Grade 1	37
Grade 6	35
Grade 8	32
Grade 3	31
Grade 2	27
Grade 7	27
Grade 10	13
Grade 9	11
Two Year Colleges	8
More ▼

Audience

Researchers	130
Practitioners	42
Teachers	22
Administrators	11
Counselors	3
Policymakers	2

Location

Australia	56
Turkey	53
United Kingdom	46
Canada	45
Netherlands	40
China	38
California	37
United States	30
United Kingdom (England)	24
Taiwan	23
Germany	22
Japan	22
Pennsylvania	22
Florida	21
Sweden	21
Iran	19
North Carolina	19
Hong Kong	17
South Korea	17
Texas	17
Georgia	16
Israel	15
New Zealand	14
South Africa	14
Washington	14
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	13
Individuals with Disabilities…	7
Elementary and Secondary…	3
Race to the Top	3
Elementary and Secondary…	2
American Recovery and…	1
Americans with Disabilities…	1
Education Consolidation…	1
Education for All Handicapped…	1
Every Student Succeeds Act…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
Pell Grant Program	1
Rehabilitation Act 1973…	1
Stewart B McKinney Homeless…	1
Temporary Assistance for…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	3
Meets WWC Standards with or without Reservations	3
Does not meet standards	3

Showing 2,446 to 2,460 of 3,122 results Save | Export

The Use of Paired Comparisons for Developing Criteria for the Observation of Student Clinical Performance.

Peer reviewed

Rippey, Robert M.; Krutchkoff, David J. – Evaluation and the Health Professions, 1984

The method of paired comparisons was used to rank 42 dental students on their performance in emergency and screening clinic rotations. Results suggest this methodology may provide more internally consistent student assessments on more subtle aspects of clinical performance than those assessed by multiple-choice tests or written performance…

Descriptors: Clinical Teaching (Health Professions), College Faculty, Computer Software, Dental Students

Marital Conflict Resolution: Factors Influencing Concordance between Partners and Trained Coders.

Peer reviewed

Birchler, Gary R.; And Others – American Journal of Family Therapy, 1984

Examined factors that influenced the concordant perceptions of 28 distressed and 28 nondistressed husbands and wives and trained coders who observed samples of their own and another couple's problem solving. Correlational analyses suggested greater insider-outsider perceptual agreement for distressed than nondistressed couples and for negative…

Descriptors: Behavior Patterns, Conflict Resolution, Congruence (Psychology), Interaction Process Analysis

Reliability of the Conners Abbreviated Teacher Rating Scale Across Raters and Across Time: Use with Learning Disabled Students.

Peer reviewed

Epstein, Michael H.; Nieminen, Gayla S. – School Psychology Review, 1983

Teachers and classroom aides of learning disabled pupils were asked to complete the Conners Abbreviated Teacher Rating Scale (CATRS) on two separate occasions, one month apart. Inter-rater reliability for teachers (.866) and for aides (.602), and reliability across time for teachers (.866) and aides (.603) achieved acceptable levels. (Author/BW)

Descriptors: Elementary Education, Elementary School Teachers, Hyperactivity, Interrater Reliability

Using FACETS To Model Rater Training Effects. Draft.

Download full text

Weigle, Sara Cushing – 1994

This paper describes a study on rater training that involved the analysis of ratings given to English-as-a-Second-Language (ESL) compositions by 8 inexperienced and 8 experienced raters both before and after rater training, using FACETS (Linacre, 1990, 1993), which provides measures of rater severity and consistency. The testing text was a…

Descriptors: English (Second Language), Essay Tests, Evaluation Criteria, Evaluators

Interviewer Validity and Reliability: An Individual Analysis Approach.

Peer reviewed

Zedeck, Sheldon; And Others – Personnel Psychology, 1983

Studied interviewer reliability, validity, and strategy for information integration. Candidates (N=412) for selection to a military division were interviewed and assessed. Results indicated that interviewers functioned in a similar fashion. Analyses of individual interviewers indicated higher reliability and individual differences among…

Descriptors: Cognitive Processes, Employment Interviews, Evaluation Criteria, Evaluation Methods

Scoring and Analysis of Performance Examinations: A Comparison of Methods and Interpretations.

Peer reviewed

Lunz, Mary E.; Schumacker, Randall E. – Journal of Outcome Measurement, 1997

Results and interpretations of the data from a performance examination were compared for four methods of analysis for 74 medical specialty certification candidates: (1) traditional summary statistics; (2) inter-judge correlations; (3) generalizability theory; and (4) the multifaceted Rasch model. Advantages of the Rasch model are outlined. (SLD)

Descriptors: Comparative Analysis, Data Analysis, Generalizability Theory, Interrater Reliability

Assessing Agreement: An Examination of the Interrater Reliability of Portfolio Assessment in Rochester, New York.

Peer reviewed

Supovitz, Jonathan A.; MacGowan, Andrew, III; Slattery, Jean – Educational Assessment, 1997

Reports on the interrater reliability of a language arts portfolio assessment in the primary grades of the Rochester (New York) school system. Results from approximately 400 primary grade portfolios rated by 2 raters show that teachers can assess their own students' work reliably. (SLD)

Descriptors: Evaluation Methods, Evaluators, Interrater Reliability, Portfolio Assessment

The Design of an Instrument to Assess Problem Solving Activities in Technology Education.

Peer reviewed
PDF on ERIC

Download full text

Hill, Roger B. – Journal of Technology Education, 1997

The Observation Procedure for Technology Education Mental Processes, a computerized assessment tool, was based on duration and frequency of mental processes needed for problem solving. Videotapes of students completing problem-solving activities were used to identify the processes. Interrater reliability tests validated the program. (SK)

Descriptors: Cognitive Processes, Computer Software Development, Interrater Reliability, Measures (Individuals)

Information Level and Young Children's Phonological Accuracy.

Peer reviewed

Goffman, Lisa; And Others – Journal of Child Language, 1996

The influence of information level on the production of accuracy of 20 children was examined. Data were children's productions of nouns in sets of utterances referring to triplets of pictures representing noun-verb-noun utterances. (Author/JL)

Descriptors: Acoustic Phonetics, Child Language, Cognitive Processes, Grammar

The Use of Criteria-Based Grading Profiles in Formative and Summative Assessment.

Peer reviewed

Milligan, Frank – Nurse Education Today, 1996

Grading profiles for formative and summative assessment in a British nursing school were designed with criterion referencing to improve validity and interrater and intercourse reliability. Assessment was conceptualized as an ethical activity that clarifies expectations through specification of criteria. (SK)

Descriptors: Criterion Referenced Tests, Evaluation Criteria, Foreign Countries, Formative Evaluation

Mark My Words, Part 1: Teachers.

Peer reviewed

Miller, Ronald – South African Journal of Higher Education, 1996

In a study of criteria for and reliability of grading of college essays in introductory psychology, 16 essays were marked by 12 faculty and 20 graduate students. Analysis found that two content attributes (facts, examples) accounted for 82% of variance in grading by faculty, while five stylistic measures accounted for the remainder. Both faculty…

Descriptors: College Instruction, Essays, Evaluation Criteria, Grading

Sensitivity and Specificity of the Autism Diagnostic Inventory-Telephone Screening in Spanish.

Peer reviewed

Vrancic, Daniela; Nanclares, Valeria; Soares, Delfina; Kulesz, Analia; Mordzinski, Claudia; Plebst, Christian; Starkstein, Sergio – Journal of Autism and Developmental Disorders, 2002

A study involving 30 Argentineans with autism evaluated the validity of the Autism Diagnostic Inventory-Telephone Screening in Spanish (ADI-TSS). The final version of the ADI-TSS could be assessed in 20 to 40 minutes and demonstrated a high validity, high interrater reliability, and high internal consistency. (Contains references.) (Author/CR)

Descriptors: Adults, Autism, Disability Identification, Foreign Countries

Is the MEAP Writing Test Reliable? A Case Study.

Peer reviewed

Anderson, Stephen A. – Michigan Reading Journal, 2002

Considers the development of an inter-rater reliability correlation comparing the judgments, or scores, or each judge to see if their observations are similar. Presents a case study of the Northville Public Schools' data for the 2000 MEAP (Michigan Educational Assessment Program) Writing Test. Concludes that in this case study the state fails both…

Descriptors: Case Studies, Elementary Education, Evaluation Research, Interrater Reliability

Interviewer Variation and the Co-construction of Speaking Proficiency.

Peer reviewed

Brown, Annie – Language Testing, 2003

Examines the question of variation among interviewers of oral language proficiency interviews in the ways that they elicit demonstrations of communicative ability and the impact of this variation on candidate performance and raters' perceptions of candidate ability. A discourse analysis of two interviews involving the same candidate with two…

Descriptors: Discourse Analysis, Interrater Reliability, Interviews, Language Proficiency

Judge Consistency and Severity across Grading Periods.

Peer reviewed

Lunz, Mary E.; Stahl, John A. – Evaluation and the Health Professions, 1990

Examinations were analyzed using the Rasch model to determine differences in judge severity and grading period stringency for (1) essay examination (subjects were 12 judges and 32 examinees); (2) clinical examination (subjects were 18 judges and 217 examinees); and (3) oral examination (subjects were 46 judges and 270 examinees). (SLD)

Descriptors: Certification, Essay Tests, Evaluators, Examiners

« Previous Page | Next Page »

Pages: 1 | ... | 160 | 161 | 162 | 163 | 164 | 165 | 166 | 167 | 168 | ... | 209

Lunz, Mary E.	10
Wind, Stefanie A.	10
Engelhard, George, Jr.	8
Epstein, Michael H.	8
Ingham, Roger J.	8
Johnson, Evelyn S.	8
Matson, Johnny L.	7
McLeod, Bryce D.	7
Moylan, Laura A.	7
Cason, Carolyn L.	6
Cordes, Anne K.	6
Jaeger, Richard M.	6
Johnson, Robert L.	6
Lecavalier, Luc	6
Plake, Barbara S.	6
Tasse, Marc J.	6
Wyse, Adam E.	6
Zheng, Yuzhu	6
Aman, Michael G.	5
Barton, Erin E.	5
Cason, Gerald J.	5
Coniam, David	5
Conroy, Maureen A.	5
Crawford, Angela R.	5
More ▼

Journal Articles	2553
Reports - Research	2241
Reports - Evaluative	515
Speeches/Meeting Papers	272
Reports - Descriptive	163
Tests/Questionnaires	162
Information Analyses	130
Dissertations/Theses -…	89
Opinion Papers	61
Numerical/Quantitative Data	31
Guides - Non-Classroom	11
Books	7
Collected Works - General	3
Guides - Classroom - Teacher	3
Non-Print Media	3
Book/Product Reviews	2
Collected Works - Serials	2
Dissertations/Theses	2
ERIC Digests in Full Text	2
ERIC Publications	2
Guides - General	2
Reports - General	2
Collected Works - Proceedings	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	30
Child Behavior Checklist	18
National Assessment of…	14
Vineland Adaptive Behavior…	14
Autism Diagnostic Observation…	13
Strengths and Difficulties…	11
Woodcock Johnson Tests of…	10
Peabody Picture Vocabulary…	9
SAT (College Admission Test)	9
Wechsler Intelligence Scale…	9
Behavior Assessment System…	8
Dynamic Indicators of Basic…	8
Early Childhood Environment…	8
Graduate Record Examinations	8
International English…	7
Teacher Performance…	6
ACT Assessment	5
Advanced Placement…	5
Behavioral and Emotional…	5
Childhood Autism Rating Scale	5
Classroom Assessment Scoring…	5
Conners Teacher Rating Scale	5
Draw a Person Test	5
Raven Progressive Matrices	5
ACTFL Oral Proficiency…	4
More ▼

ProQuest LLC	86
Journal of Speech, Language,…	62
Educational and Psychological…	61
Journal of Autism and…	56
Grantee Submission	40
Language Testing	39
Online Submission	35
International Journal of…	34
Assessment & Evaluation in…	33
Research in Developmental…	31
Applied Measurement in…	28
Advances in Health Sciences…	26
Assessment for Effective…	26
ETS Research Report Series	25
Journal of Educational…	24
Educational Measurement:…	23
Measurement in Physical…	20
Language Assessment Quarterly	19
Psychology in the Schools	19
Topics in Early Childhood…	19
Psychological Assessment	18
Educational Assessment	16
Autism: The International…	15
Journal of Consulting and…	15
Personnel Psychology	15
More ▼