ERIC - Search Results

Publication Date

In 2026	0
Since 2025	56
Since 2022 (last 5 years)	282
Since 2017 (last 10 years)	778
Since 2007 (last 20 years)	2040

Descriptor

Interrater Reliability	3122
Foreign Countries	654
Test Reliability	503
Evaluation Methods	502
Test Validity	410
Correlation	401
Scoring	347
Comparative Analysis	327
Scores	324
Validity	310
Student Evaluation	308
Measures (Individuals)	298
Evaluators	295
Rating Scales	282
Statistical Analysis	268
Higher Education	264
Psychometrics	241
Reliability	230
Observation	229
Scoring Rubrics	216
Test Construction	212
English (Second Language)	211
Teaching Methods	208
Writing Evaluation	206
Intervention	200
More ▼

Education Level

Higher Education	574
Postsecondary Education	420
Elementary Education	282
Secondary Education	179
Early Childhood Education	145
Elementary Secondary Education	120
Middle Schools	109
High Schools	85
Preschool Education	72
Junior High Schools	65
Adult Education	58
Primary Education	57
Kindergarten	45
Grade 4	41
Grade 5	40
Intermediate Grades	40
Grade 1	37
Grade 6	35
Grade 8	32
Grade 3	31
Grade 2	27
Grade 7	27
Grade 10	13
Grade 9	11
Two Year Colleges	8
More ▼

Audience

Researchers	130
Practitioners	42
Teachers	22
Administrators	11
Counselors	3
Policymakers	2

Location

Australia	56
Turkey	53
United Kingdom	46
Canada	45
Netherlands	40
China	38
California	37
United States	30
United Kingdom (England)	24
Taiwan	23
Germany	22
Japan	22
Pennsylvania	22
Florida	21
Sweden	21
Iran	19
North Carolina	19
Hong Kong	17
South Korea	17
Texas	17
Georgia	16
Israel	15
New Zealand	14
South Africa	14
Washington	14
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	13
Individuals with Disabilities…	7
Elementary and Secondary…	3
Race to the Top	3
Elementary and Secondary…	2
American Recovery and…	1
Americans with Disabilities…	1
Education Consolidation…	1
Education for All Handicapped…	1
Every Student Succeeds Act…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
Pell Grant Program	1
Rehabilitation Act 1973…	1
Stewart B McKinney Homeless…	1
Temporary Assistance for…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	3
Meets WWC Standards with or without Reservations	3
Does not meet standards	3

Showing 2,536 to 2,550 of 3,122 results Save | Export

The Peer Review Process Used to Evaluate Manuscripts Submitted to Academic Journals: Interjudgmental Reliability.

Peer reviewed

Marsh, Herbert W.; Ball, Samuel – Journal of Experimental Education, 1989

Agreement between two independent reviews of each of 278 manuscripts was compared on an overall recommendation and on specific rating items. Agreement between reviewers on separate dimensions, the unweighted sum of the dimensions, and various weighted sums was no better than that for the overall recommendation itself. (SLD)

Descriptors: Evaluation Methods, Factor Analysis, Interrater Reliability, Manuscripts

Story-Telling: A Method for Assessing Children's Creativity.

Peer reviewed

Hennessey, Beth Ann; Amabile, Teresa M. – Journal of Creative Behavior, 1988

The subjective judgment of observers was used to assess verbal creativity. Students, aged 5-10, told a story to accompany a picture series. Teachers rated the stories relative to one another. Interjudge reliability of the creativity measure was highly satisfactory. Two subsequent studies affirmed the results, with slightly lower interjudge…

Descriptors: Creativity, Creativity Tests, Elementary Education, Evaluation Methods

Competency Judgments in the Training and Evaluation of Psychotherapists.

Peer reviewed

Shaw, Brian F.; Dobson, Keith S. – Journal of Consulting and Clinical Psychology, 1988

Reviews several scales used to evaluate competency of psychotherapists. Discusses concerns about interrater reliability and predictive validity of scales. Considers competency a state-like variable, with therapists demonstrating higher competence when they skillfully treat patients across range of difficulty levels. Contends that development of…

Descriptors: Competence, Counselor Evaluation, Counselor Qualifications, Evaluation Criteria

The Triple Jump Exercise in Inquiry-Based Learning: A Case Study Showing Directions for Further Research.

Peer reviewed

Feletti, Grahame; Ryan, Greg – Assessment & Evaluation in Higher Education, 1994

The Triple Jump, a procedure for assessing students' problem-based learning, is applied to assessment of inquiry-based learning in a graduate course. Results suggest the need for more research into interrater reliability and other characteristics of the exercise. Some simple strategies for making the instrument cost effective are offered. (MSE)

Descriptors: Evaluation Methods, Graduate Study, Higher Education, Independent Study

Setting Performance Standards through Two-Stage Judgmental Policy Capturing.

Peer reviewed

Jaeger, Richard M. – Applied Measurement in Education, 1995

A performance-standard setting procedure termed judgmental policy capturing (JPC) and its application are described. A study involving 12 panelists demonstrated the feasibility of the JPC method for setting performance standards for classroom teachers seeking certification from the National Board for Professional Teaching Standards. (SLD)

Descriptors: Decision Making, Educational Assessment, Evaluation Methods, Evaluators

Professionals' Standards of "Normal" Behavior with Anatomical Dolls and Factors That Influence These Standards.

Kendall-Tackett, Kathleen A. – Child Abuse and Neglect: The International Journal, 1992

Professionals (n=201) working with child sexual abuse victims rated the normalcy of various behaviors with anatomical dolls for children ages two to five. Respondents agreed that overtly sexual behaviors were abnormal for nonabused children, but ratings of ambiguous behaviors varied depending on respondent's profession, gender, and years of…

Descriptors: Behavior Patterns, Behavior Rating Scales, Behavior Standards, Child Abuse

Inter-rater Reliability of the Modified Ashworth Scale for Spasticity in Hemiplegic Patients.

Peer reviewed

Sloan, R. L.; And Others – International Journal of Rehabilitation Research, 1992

This study tested the interrater reliability of the Modified Ashworth Scale in measuring upper and lower limb spasticity in 34 hemiplegic adult patients examined by 2 physiotherapists and 2 doctors. Findings indicated satisfactory reliability for upper limb spasticity but less satisfactory results for lower limb spasticity. (DB)

Descriptors: Adults, Behavior Rating Scales, Evaluation Methods, Interrater Reliability

A Demonstration of Validity for Certification by the American Board of Anesthesiology.

Peer reviewed

Slogoff, Stephen; And Others – Academic Medicine, 1994

To investigate the validity of anesthesiologist certification, 146 anesthesiology program directors were asked whether they would permit each of their graduating residents to complete 3 increasingly complex anesthetic regimens to the directors themselves and rate residents on specific skills. Director responses generally correspond to…

Descriptors: Administrator Attitudes, Anesthesiology, Certification, Graduate Medical Education

Interpretations of Colposcopic Photographs: Evidence for Competence in Assessing Sexual Abuse?

Brayden, Robert M.; And Others – Child Abuse and Neglect: The International Journal, 1991

Seventy physicians and two nurse practitioners rated colposcopic photographs. Results showed that leaders in the field of child sexual abuse assessment made significantly more accurate assessments than pediatricians, pediatric and family practice residents, and intern physicians. Predictors of agreement with standard assessments, although weak,…

Descriptors: Child Abuse, Competence, Evaluation Methods, Evaluators

A Modification of Feldt's Test of the Equality of Two Dependent Alpha Coefficients.

Peer reviewed

Alsawalmeh, Yousef M.; Feldt, Leonard S. – Psychometrika, 1994

A modification of a test of the equality of nonindependent alpha reliability coefficients is proposed. It avoids the limitation that the product of the number of test parts times the number of subjects be quite large. Monte Carlo studies indicate that this test can be used in comparing interrater reliabilities. (SLD)

Descriptors: Comparative Analysis, Computer Simulation, Equations (Mathematics), Interrater Reliability

Identification of Conduct and Emotional Syndromes Using the Adelaide Behaviour Disorder Scale.

Peer reviewed

Bond, Malcolm J.; Tustin, R. Don – Journal of Intellectual and Developmental Disability, 1999

This study assessed the psychometric properties of two subscales of the Adelaide Behaviour Disorder Scale that have been hypothesized to describe conduct problems and emotional problems of adults with intellectual disability. Criterion scores for identifying individuals needing clinical intervention were established and validated against…

Descriptors: Adults, Behavior Problems, Disability Identification, Eligibility

Validity of Multiple Ratings of Business Student Performance in a Management Simulation.

Peer reviewed

McEnery, Jean M.; Blanchard, P. Nick – Human Resource Development Quarterly, 1999

Business undergraduates (n=261) participating in an assessment center simulation were evaluated by graduate students and faculty. Assessor-peer and assessor-self ratings lacked convergent and divergent validity, but self-peer ratings had both. (SK)

Descriptors: Assessment Centers (Personnel), Business Administration Education, Higher Education, Interrater Reliability

Controlling the Judge Variable in Grading Essay-Type Items: An Application of Rasch Analyses to the Recruitment Exam for Korean Public School Teachers.

Peer reviewed

Chae, Sunhee – Journal of Outcome Measurement, 1998

Using a recruitment test for Korean teachers, the use of the Rasch measurement model to control the effects of judge variable on the grading of essay-type items is examined. Ways of minimizing the variation of grading due to judge severity and reducing the number of judges without threatening objectivity of ability measurements are presented.…

Descriptors: Ability Identification, Achievement Tests, Essay Tests, Foreign Countries

The Environmental Rating Scale (ERS): A Measure of the Quality of the Residential Environment for Adults with Autism.

Peer reviewed

Van Bourgondien, Mary E.; Reichle, Nancy C.; Campbell, Duncan G.; Mesibov, Gary B. – Research in Developmental Disabilities, 1998

This study assessed the psychometric properties of the Environmental Rating Scale, a measure specifically designed to assess residential treatment programs for individuals with autism. The measure's reliability was demonstrated by assessments of the internal consistency, stability, and interrater reliability. Preliminary analysis of validity…

Descriptors: Adults, Autism, Evaluation Methods, Interrater Reliability

Performance-Based Assessment: Implications of Task Specificity.

Peer reviewed

Linn, Robert L.; Burton, Elizabeth – Educational Measurement: Issues and Practice, 1994

Generalizability of performance-based assessment scores across raters and tasks is examined, focusing on implications of generalizability analyses for specific uses and interpretations of assessment results. Although it seems probable that assessment conditions, task characteristics, and interactions with instructional experiences affect the…

Descriptors: Educational Assessment, Educational Experience, Generalizability Theory, Interaction

« Previous Page | Next Page »

Pages: 1 | ... | 166 | 167 | 168 | 169 | 170 | 171 | 172 | 173 | 174 | ... | 209

ProQuest LLC	86
Journal of Speech, Language,…	62
Educational and Psychological…	61
Journal of Autism and…	56
Grantee Submission	40
Language Testing	39
Online Submission	35
International Journal of…	34
Assessment & Evaluation in…	33
Research in Developmental…	31
Applied Measurement in…	28
Advances in Health Sciences…	26
Assessment for Effective…	26
ETS Research Report Series	25
Journal of Educational…	24
Educational Measurement:…	23
Measurement in Physical…	20
Language Assessment Quarterly	19
Psychology in the Schools	19
Topics in Early Childhood…	19
Psychological Assessment	18
Educational Assessment	16
Autism: The International…	15
Journal of Consulting and…	15
Personnel Psychology	15
More ▼

Lunz, Mary E.	10
Wind, Stefanie A.	10
Engelhard, George, Jr.	8
Epstein, Michael H.	8
Ingham, Roger J.	8
Johnson, Evelyn S.	8
Matson, Johnny L.	7
McLeod, Bryce D.	7
Moylan, Laura A.	7
Cason, Carolyn L.	6
Cordes, Anne K.	6
Jaeger, Richard M.	6
Johnson, Robert L.	6
Lecavalier, Luc	6
Plake, Barbara S.	6
Tasse, Marc J.	6
Wyse, Adam E.	6
Zheng, Yuzhu	6
Aman, Michael G.	5
Barton, Erin E.	5
Cason, Gerald J.	5
Coniam, David	5
Conroy, Maureen A.	5
Crawford, Angela R.	5
More ▼

Journal Articles	2553
Reports - Research	2241
Reports - Evaluative	515
Speeches/Meeting Papers	272
Reports - Descriptive	163
Tests/Questionnaires	162
Information Analyses	130
Dissertations/Theses -…	89
Opinion Papers	61
Numerical/Quantitative Data	31
Guides - Non-Classroom	11
Books	7
Collected Works - General	3
Guides - Classroom - Teacher	3
Non-Print Media	3
Book/Product Reviews	2
Collected Works - Serials	2
Dissertations/Theses	2
ERIC Digests in Full Text	2
ERIC Publications	2
Guides - General	2
Reports - General	2
Collected Works - Proceedings	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	30
Child Behavior Checklist	18
National Assessment of…	14
Vineland Adaptive Behavior…	14
Autism Diagnostic Observation…	13
Strengths and Difficulties…	11
Woodcock Johnson Tests of…	10
Peabody Picture Vocabulary…	9
SAT (College Admission Test)	9
Wechsler Intelligence Scale…	9
Behavior Assessment System…	8
Dynamic Indicators of Basic…	8
Early Childhood Environment…	8
Graduate Record Examinations	8
International English…	7
Teacher Performance…	6
ACT Assessment	5
Advanced Placement…	5
Behavioral and Emotional…	5
Childhood Autism Rating Scale	5
Classroom Assessment Scoring…	5
Conners Teacher Rating Scale	5
Draw a Person Test	5
Raven Progressive Matrices	5
ACTFL Oral Proficiency…	4
More ▼