ERIC - Search Results

Publication Date

In 2026	0
Since 2025	56
Since 2022 (last 5 years)	282
Since 2017 (last 10 years)	778
Since 2007 (last 20 years)	2040

Descriptor

Interrater Reliability	3122
Foreign Countries	654
Test Reliability	503
Evaluation Methods	502
Test Validity	410
Correlation	401
Scoring	347
Comparative Analysis	327
Scores	324
Validity	310
Student Evaluation	308
Measures (Individuals)	298
Evaluators	295
Rating Scales	282
Statistical Analysis	268
Higher Education	264
Psychometrics	241
Reliability	230
Observation	229
Scoring Rubrics	216
Test Construction	212
English (Second Language)	211
Teaching Methods	208
Writing Evaluation	206
Intervention	200
More ▼

Education Level

Higher Education	574
Postsecondary Education	420
Elementary Education	282
Secondary Education	179
Early Childhood Education	145
Elementary Secondary Education	120
Middle Schools	109
High Schools	85
Preschool Education	72
Junior High Schools	65
Adult Education	58
Primary Education	57
Kindergarten	45
Grade 4	41
Grade 5	40
Intermediate Grades	40
Grade 1	37
Grade 6	35
Grade 8	32
Grade 3	31
Grade 2	27
Grade 7	27
Grade 10	13
Grade 9	11
Two Year Colleges	8
More ▼

Audience

Researchers	130
Practitioners	42
Teachers	22
Administrators	11
Counselors	3
Policymakers	2

Location

Australia	56
Turkey	53
United Kingdom	46
Canada	45
Netherlands	40
China	38
California	37
United States	30
United Kingdom (England)	24
Taiwan	23
Germany	22
Japan	22
Pennsylvania	22
Florida	21
Sweden	21
Iran	19
North Carolina	19
Hong Kong	17
South Korea	17
Texas	17
Georgia	16
Israel	15
New Zealand	14
South Africa	14
Washington	14
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	13
Individuals with Disabilities…	7
Elementary and Secondary…	3
Race to the Top	3
Elementary and Secondary…	2
American Recovery and…	1
Americans with Disabilities…	1
Education Consolidation…	1
Education for All Handicapped…	1
Every Student Succeeds Act…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
Pell Grant Program	1
Rehabilitation Act 1973…	1
Stewart B McKinney Homeless…	1
Temporary Assistance for…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	3
Meets WWC Standards with or without Reservations	3
Does not meet standards	3

Showing 2,926 to 2,940 of 3,122 results Save | Export

Construct Validity of Measures of College Teaching Effectiveness.

Peer reviewed

Howard, George S.; And Others – Journal of Educational Psychology, 1985

The accuracy of various evaluation methods for assessing teacher effectiveness was investigated. College instructors (n=43) were rated by students, colleagues, trained classroom raters, former students, and themselves. Results indicate these methods to be more valid than prior research would suggest. (BS)

Descriptors: College Faculty, Evaluation Methods, Higher Education, Interrater Reliability

Behaviorally Anchored Rating Scales vs. Summated Rating Scales: Psychometric Properties and Susceptibility to Rating Bias.

Peer reviewed

Kinicki, Angelo J.; And Others – Educational and Psychological Measurement, 1985

Using both the Behaviorally Anchored Rating Scales (BARS) and the Purdue University Scales, 727 undergraduates rated 32 instructors. The BARS had less halo effect, more leniency error, and lower interrater reliability. Both formats were valid. The two tests did not differ in rate discrimination or susceptibility to rating bias. (Author/GDC)

Descriptors: Behavior Rating Scales, College Faculty, Comparative Testing, Higher Education

The Effect of Year-to-Year Rater Variation on IRT Linking

Download full text

Yen, Shu Jing; Ochieng, Charles; Michaels, Hillary; Friedman, Greg – Online Submission, 2005

Year-to-year rater variation may result in constructed response (CR) parameter changes, making CR items inappropriate to use in anchor sets for linking or equating. This study demonstrates how rater severity affected the writing and reading scores. Rater adjustments were made to statewide results using an item response theory (IRT) methodology…

Descriptors: Test Items, Writing Tests, Reading Tests, Measures (Individuals)

A Study of Raters' Scoring Tendency of Speaking Ability through Verbal Report Methods and Questionnaire Analysis.

Download full text

Nakamura, Yuji – Journal of Communication Studies, 1996

To find ways to improve rater reliability of a tape-mediated speaking test for Japanese university students of English as a Second Language, two studies gathered information on: how raters actually made their choices on rating sheets of students' speaking ability; determined what criteria teachers think they use and actually use in rating…

Descriptors: English (Second Language), Evaluation Criteria, Foreign Countries, Interrater Reliability

The Accuracy of Pre-Service Teachers' Assessments of Their Classroom Behaviors.

Peer reviewed

Irvine, Jacqueline Jordan – Journal of Research and Development in Education, 1983

The concurrence between preservice teachers' self-evaluations and the ratings of their supervisors was investigated, after both student teachers and supervisors completed training designed to facilitate self-assessment and collegiate relationships. Self-reports of the trained teachers were in moderate agreement with ratings of supervisors. (PP)

Descriptors: Competency Based Teacher Education, Evaluation Methods, Higher Education, Interrater Reliability

An Investigation of Planning Time and Proficiency Level on Oral Test Discourse.

Peer reviewed

Wigglesworth, Gillian – Language Testing, 1997

In this study, planning time was manipulated as a variable in a trial administration of a semi-direct oral interaction test. Discourse analytic techniques were used to determine the nature and/or significance of difference in the elicited discourse across two conditions in terms of complexity and accuracy. Findings suggest that planning time may…

Descriptors: Cognitive Development, Communicative Competence (Languages), Comparative Analysis, Discourse Analysis

The Child Development Milestone Chart: An Approach to Low Cost Developmental Programming in Indonesia.

Peer reviewed

Colletta, Nancy Donahue; And Others – Early Child Development and Care, 1993

Discusses the development of the Indonesian Chart of Developmental Milestones, designed for use with existing nutrition and mother-child welfare programs to monitor children's development. A reliability and validity study using 108 Indonesian children from birth to 36 months of age established a tester-observer reliability of 0.97 and a…

Descriptors: Charts, Child Development, Child Health, Child Welfare

A Latent-Variable Modeling Approach to Assessing Interrater Reliability, Topic Generalizability, and Validity of a Content Assessment Scoring Rubric.

Peer reviewed

Abedi, Jamal; Baker, Eva L. – Educational and Psychological Measurement, 1995

Results from a performance assessment in which 68 high school students wrote essays support the use of latent variable modeling for estimating reliability, concurrent validity, and generalizability of a scoring rubric. The latent variable modeling approach overcomes the limitations of certain conventional statistical techniques in handling…

Descriptors: Criteria, Essays, Estimation (Mathematics), Generalizability Theory

Generalizability Analyses of Work Keys Listening and Writing Tests.

Peer reviewed

Brennan, Robert L.; And Others – Educational and Psychological Measurement, 1995

Generalizability theory is used to examine the psychometric characteristics of the Listening and Writing Tests developed by American College Testing for its Work Keys program. Results with samples of 50 suggest the desirability of a minimum number of the tests' tape-recorded messages and the use of at least 2 raters. (SLD)

Descriptors: Audiotape Recordings, Error of Measurement, Generalizability Theory, Interaction

The Work Sampling System: Reliability and Validity of a Performance Assessment for Young Children.

Peer reviewed

Meisels, Samuel J.; And Others – Early Childhood Research Quarterly, 1995

Examined the reliability and validity of the Work Sampling System (WSS) for evaluating the schoolwork of 100 kindergarten children. Results indicated that the WSS checklist and summary report had very high internal and moderately high interrater reliability. The WSS accurately predicted the performance of the children on a norm-referenced…

Descriptors: Academic Achievement, Achievement Tests, Check Lists, Early Childhood Education

Adjustments for Rater Effects in Performance Assessment.

Peer reviewed

Houston, Walter M.; And Others – Applied Psychological Measurement, 1991

The effectiveness of alternative procedures to correct for rater leniency/stringency effects was studied when true scores were known. Ordinary least squares, weighted least squares, and imputation of the missing data consistently outperformed averaging the observed ratings; and the imputation technique was superior to the least squares methods.…

Descriptors: Comparative Analysis, Computer Simulation, Educational Assessment, Equations (Mathematics)

Quality Control in the Development and Use of Performance Assessments.

Peer reviewed

Dunbar, Stephen B.; And Others – Applied Measurement in Education, 1991

Issues pertaining to the quality of performance assessments, including reliability and validity, are discussed. The relatively limited generalizability of performance across tasks is indicative of the care needed to evaluate performance assessments. Quality control is an empirical matter when measurement is intended to inform public policy. (SLD)

Descriptors: Educational Assessment, Generalization, Interrater Reliability, Measurement Techniques

Family Day Care: A Theoretical Basis for Improving Quality.

Peer reviewed

Fischer, Jan Lockwood; Krause Eheart, Brenda – Early Childhood Research Quarterly, 1991

Providers' demographic characteristics, training, support networks, business practices, and stability of services were examined relative to their caregiving practices. Results from a schematic model approach suggest correlations between some of these factors and variances in ratings of caregiver practices. (LB)

Descriptors: Behavior Rating Scales, Child Caregivers, Comparative Analysis, Data Analysis

Instruction and Exposure: How Do They Contribute to Second Language Acquisition?

Peer reviewed

Shresta, Tej B. – Foreign Language Annals, 1998

Describes how instruction and exposure contributed to the development of oral proficiency in English as a Second Language in mutually exclusive learning situations in Nepal. This study finds that both instruction and exposure contribute to second-language acquisition, the former promoting accuracy, the latter promoting fluency. (Author/VWL)

Descriptors: English (Second Language), Experiential Learning, Foreign Countries, Grammar

Rater Reliability in Language Assessment: The Bug of All Bears.

Peer reviewed

Gamaroff, Raphael – System, 2000

To test how to achieve a reliable score on an essay test, based on judgments of specific criteria such as grammatical accuracy or topic relevance, a workshop was conducted on interrater reliability at a teacher educators conference in South Africa. Experienced English teacher educators assessed two essay protocols. Results showed substantial…

Descriptors: English (Second Language), Essays, Evaluation Criteria, Foreign Countries

« Previous Page | Next Page »

Pages: 1 | ... | 192 | 193 | 194 | 195 | 196 | 197 | 198 | 199 | 200 | ... | 209

ProQuest LLC	86
Journal of Speech, Language,…	62
Educational and Psychological…	61
Journal of Autism and…	56
Grantee Submission	40
Language Testing	39
Online Submission	35
International Journal of…	34
Assessment & Evaluation in…	33
Research in Developmental…	31
Applied Measurement in…	28
Advances in Health Sciences…	26
Assessment for Effective…	26
ETS Research Report Series	25
Journal of Educational…	24
Educational Measurement:…	23
Measurement in Physical…	20
Language Assessment Quarterly	19
Psychology in the Schools	19
Topics in Early Childhood…	19
Psychological Assessment	18
Educational Assessment	16
Autism: The International…	15
Journal of Consulting and…	15
Personnel Psychology	15
More ▼

Lunz, Mary E.	10
Wind, Stefanie A.	10
Engelhard, George, Jr.	8
Epstein, Michael H.	8
Ingham, Roger J.	8
Johnson, Evelyn S.	8
Matson, Johnny L.	7
McLeod, Bryce D.	7
Moylan, Laura A.	7
Cason, Carolyn L.	6
Cordes, Anne K.	6
Jaeger, Richard M.	6
Johnson, Robert L.	6
Lecavalier, Luc	6
Plake, Barbara S.	6
Tasse, Marc J.	6
Wyse, Adam E.	6
Zheng, Yuzhu	6
Aman, Michael G.	5
Barton, Erin E.	5
Cason, Gerald J.	5
Coniam, David	5
Conroy, Maureen A.	5
Crawford, Angela R.	5
More ▼

Journal Articles	2553
Reports - Research	2241
Reports - Evaluative	515
Speeches/Meeting Papers	272
Reports - Descriptive	163
Tests/Questionnaires	162
Information Analyses	130
Dissertations/Theses -…	89
Opinion Papers	61
Numerical/Quantitative Data	31
Guides - Non-Classroom	11
Books	7
Collected Works - General	3
Guides - Classroom - Teacher	3
Non-Print Media	3
Book/Product Reviews	2
Collected Works - Serials	2
Dissertations/Theses	2
ERIC Digests in Full Text	2
ERIC Publications	2
Guides - General	2
Reports - General	2
Collected Works - Proceedings	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	30
Child Behavior Checklist	18
National Assessment of…	14
Vineland Adaptive Behavior…	14
Autism Diagnostic Observation…	13
Strengths and Difficulties…	11
Woodcock Johnson Tests of…	10
Peabody Picture Vocabulary…	9
SAT (College Admission Test)	9
Wechsler Intelligence Scale…	9
Behavior Assessment System…	8
Dynamic Indicators of Basic…	8
Early Childhood Environment…	8
Graduate Record Examinations	8
International English…	7
Teacher Performance…	6
ACT Assessment	5
Advanced Placement…	5
Behavioral and Emotional…	5
Childhood Autism Rating Scale	5
Classroom Assessment Scoring…	5
Conners Teacher Rating Scale	5
Draw a Person Test	5
Raven Progressive Matrices	5
ACTFL Oral Proficiency…	4
More ▼