ERIC - Search Results

Publication Date

In 2026	0
Since 2025	58
Since 2022 (last 5 years)	284
Since 2017 (last 10 years)	780
Since 2007 (last 20 years)	2042

Descriptor

Interrater Reliability	3124
Foreign Countries	655
Test Reliability	503
Evaluation Methods	502
Test Validity	410
Correlation	401
Scoring	347
Comparative Analysis	327
Scores	324
Validity	310
Student Evaluation	308
Measures (Individuals)	298
Evaluators	295
Rating Scales	282
Statistical Analysis	268
Higher Education	264
Psychometrics	241
Reliability	231
Observation	229
Scoring Rubrics	216
Test Construction	212
English (Second Language)	211
Teaching Methods	208
Writing Evaluation	206
Intervention	200
More ▼

Education Level

Higher Education	574
Postsecondary Education	420
Elementary Education	282
Secondary Education	180
Early Childhood Education	145
Elementary Secondary Education	120
Middle Schools	109
High Schools	86
Preschool Education	72
Junior High Schools	65
Adult Education	59
Primary Education	57
Kindergarten	45
Grade 4	41
Grade 5	40
Intermediate Grades	40
Grade 1	37
Grade 6	35
Grade 8	32
Grade 3	31
Grade 2	27
Grade 7	27
Grade 10	13
Grade 9	11
Two Year Colleges	8
More ▼

Audience

Researchers	130
Practitioners	42
Teachers	22
Administrators	11
Counselors	3
Policymakers	2

Location

Australia	56
Turkey	53
United Kingdom	46
Canada	45
Netherlands	40
China	38
California	37
United States	30
United Kingdom (England)	25
Taiwan	23
Germany	22
Japan	22
Pennsylvania	22
Florida	21
Sweden	21
Iran	19
North Carolina	19
Hong Kong	17
South Korea	17
Texas	17
Georgia	16
Israel	15
New Zealand	14
South Africa	14
Washington	14
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	13
Individuals with Disabilities…	7
Elementary and Secondary…	3
Race to the Top	3
Elementary and Secondary…	2
American Recovery and…	1
Americans with Disabilities…	1
Education Consolidation…	1
Education for All Handicapped…	1
Every Student Succeeds Act…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
Pell Grant Program	1
Rehabilitation Act 1973…	1
Stewart B McKinney Homeless…	1
Temporary Assistance for…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	3
Meets WWC Standards with or without Reservations	3
Does not meet standards	3

Showing 2,506 to 2,520 of 3,124 results Save | Export

Evaluating Rater Responses to an Online Training Program for L2 Writing Assessment

Peer reviewed

Direct link

Elder, Catherine; Barkhuizen, Gary; Knoch, Ute; von Randow, Janet – Language Testing, 2007

The use of online rater self-training is growing in popularity and has obvious practical benefits, facilitating access to training materials and rating samples and allowing raters to reorient themselves to the rating scale and self monitor their behaviour at their own convenience. However there has thus far been little research into rater…

Descriptors: Writing Evaluation, Writing Tests, Scoring Rubrics, Rating Scales

Improving the Reliability of a Direct Writing Skills Assessment.

Download full text

Schwarz, Julie A.; Collins, Michelle L. – 1995

Behaviorally Anchored Rating Scales (BARS) were developed to score responses from a previously designed police written communication test that lacked reliability. Rating scales for each of the 9 dimensions of the test consisted of the scale definition and a 5-point continuum, with the scores of 5, 3, and 1 defined by specified behavioral…

Descriptors: Graduate Students, Graduate Study, Higher Education, Interrater Reliability

Debate Philosophy Statements as Predictors of Critic Attitudes: A Summary and Direction of Research.

Download full text

Dudczak, Craig; Day, Donald – 1991

Philosophy statements have been used in the National Debate Tournament (NDT) since the mid-1970s and the Cross Examination Debate Association (CEDA) National Tournament since its 1986 inception. The statements should help debaters adapt to critics' expressed preferences. Moreover, philosophy statements can guide the study of argumentation theory…

Descriptors: Comparative Analysis, Content Analysis, Debate, Higher Education

Using the Microcomputer To Equate Ratings of Student Writing Samples.

Download full text

Brown, William L.; Stevens, Betty L. – 1992

The objectives of this study were to determine whether student writing portfolios could be rated reliably by trained judges; study the effects on student ratings of the differential leniency of the judges; and ascertain the effects of writing-prompt difficulty and its interactions with rater leniency. Writing samples from 127 students in grades 3,…

Descriptors: Elementary Education, Evaluation Methods, Interrater Reliability, Judges

"Naive" Native Speakers and Judgments of Oral Proficiency in Spanish.

Download full text

Barnwell, David – 1989

A study addressed issues of concern in the use of the American Council on the Teaching of Foreign Languages (ACTFL)/Educational Testing Service (ETS) Language Proficiency Guidelines commonly used in determination of oral language proficiency. Specifically, potential discrepancies between the judgments of trained raters and "naive" native…

Descriptors: Interrater Reliability, Interviews, Language Proficiency, Language Tests

A New Approach to Activity Structure Analysis.

Woolever, Roberta – 1990

A protocol for analyzing the activity structure of a classroom lesson was developed and field tested. The protocol can be completed on site by a person serving as both observer and analyst. Validity of the protocol was established by reference to the large body of qualitative research on activity structure and the analysis of teaching. The…

Descriptors: Classroom Observation Techniques, Elementary Secondary Education, Evaluators, Graduate Students

Temperamental Characteristics of Infants Born to Drug Abusers: Do Maternal and Observer Ratings Agree?

Johnson, Helen L.; Rosen, Tove S. – 1986

The study compared maternal and trained observer evaluations of infant temperamental characteristics, to determine how closely the ratings correspond, and to analyze the impact of maternal drug abuse habits on maternal ratings of infant temperament. In relating observer to maternal ratings of infant temperament, seven dimensions were compared:…

Descriptors: Child Rearing, Drug Abuse, Infant Behavior, Infants

The Potential Dual Effect of Context Effects and Score Level Effects on the Assignment of Scores to Essays.

Download full text

Paden, Patricia A. – 1986

Two factors which may affect the ratings assigned to an essay test are investigated: (1) context effects; and (2) score level effects. Context effects exist in essay scoring if an essay is rated higher when preceded by poor quality essays than when preceded by high quality essays. A score level effect is defined as a change in the score (value)…

Descriptors: Context Effect, Essay Tests, Holistic Evaluation, Interrater Reliability

The Analysis of Ratings Using Generalizability Theory for Student Outcome Assessment. AIR 1988 Annual Forum Paper.

Erwin, T. Dary – 1988

Rating scales are a typical method for evaluating a student's performance in outcomes assessment. The analysis of the quality of information from rating scales poses special measurement problems when researchers work with faculty in their development. Generalizability measurement theory offers a set of techniques for estimating errors or…

Descriptors: Educational Assessment, Generalizability Theory, Higher Education, Institutional Research

Factors Influencing the Degree of Intrajudge Consistency during the Standard Setting Process.

Plake, Barbara S.; And Others – 1989

The accuracy of standards obtained from judgmental methods is dependent on the quality of the judgments made by experts throughout the standard setting process. One important dimension of the quality of these judgments is the consistency of the judges' perceptions with item performance of minimally competent candidates. Several interrelated…

Descriptors: Cutting Scores, Evaluation Methods, Evaluative Thinking, Evaluators

Reliability and Generalizability of Ratings of Compositions.

Download full text

Lehmann, Rainer H. – 1987

A total of 1,487 eleventh grade students from the Hamburg (West Germany) school system were asked to complete four writing assignments used in an International Association for the Evaluation of Educational Achievement (IEA) study of writing assessment. In analyzing the writing samples, the study focused on: (1) between-rater effects; (2)…

Descriptors: Evaluation Problems, Foreign Countries, High Schools, International Programs

A Taxonomy of Literature Reviews.

Download full text

Cooper, Harris M. – 1985

A taxonomy for literature reviews in education and psychology is presented. The increased use of the descriptor "literature review" in ERIC and Psychological Abstracts documents between 1969 and 1983 is cited as creating the need for categorization. The taxonomy categorizes reviews according to focus, goal, perspective, coverage,…

Descriptors: Classification, Content Analysis, Databases, Educational Research

Development and Validation of a Scale to Measure Metaphoric Complexity.

Congleton, Donna McKinley – 1982

A scale was developed and tested to measure metaphoric complexity in order to aid teachers in the selection and sequencing of young adult (YA) novels in the English curriculum. The development and testing of the scale involved two stages: (1) the development of a questionnaire to determine if differences in the metaphoric complexity of examples…

Descriptors: Adolescent Literature, Content Analysis, Difficulty Level, English Curriculum

Function Analysis as a Way of Subgrouping the Reading Disabled: Clinical and Statistical Analyses.

Peer reviewed

Gjessing, Hans-Jorgen – Scandinavian Journal of Educational Research, 1986

The terms reading disability and dyslexia are discussed, as well as the meaning of function analysis as a way of diagnosing behavior and difficulties in reading and spelling. The author's model classifies reading disability and dyslexia as auditory, auditory-visual, visual, emotional, or pedagogic. The Bergen Study is described. (Author/LMO)

Descriptors: Auditory Discrimination, Dyslexia, Elementary Education, Foreign Countries

Assessment of Solo Musical Performance: A Preliminary Study.

Peer reviewed

Mills, Janet – Bulletin of the Council for Research in Music Education, 1987

Questions the extent to which assessment of solo musical performance can be made under the General Certificate of School Education exam in England and Wales. Discusses performances as criterion. Reports on experiment which attempted to assess a student's overall music performance. Offers a model which can be used to better measure solo music…

Descriptors: Educational Research, Educational Testing, Foreign Countries, Interrater Reliability

« Previous Page | Next Page »

Pages: 1 | ... | 164 | 165 | 166 | 167 | 168 | 169 | 170 | 171 | 172 | ... | 209

ProQuest LLC	86
Journal of Speech, Language,…	62
Educational and Psychological…	61
Journal of Autism and…	56
Grantee Submission	40
Language Testing	39
Online Submission	35
International Journal of…	34
Assessment & Evaluation in…	33
Research in Developmental…	31
Applied Measurement in…	28
Advances in Health Sciences…	26
Assessment for Effective…	26
ETS Research Report Series	25
Journal of Educational…	25
Educational Measurement:…	23
Measurement in Physical…	20
Language Assessment Quarterly	19
Psychology in the Schools	19
Topics in Early Childhood…	19
Psychological Assessment	18
Educational Assessment	16
Autism: The International…	15
Journal of Consulting and…	15
Personnel Psychology	15
More ▼

Lunz, Mary E.	10
Wind, Stefanie A.	10
Engelhard, George, Jr.	8
Epstein, Michael H.	8
Ingham, Roger J.	8
Johnson, Evelyn S.	8
Matson, Johnny L.	7
McLeod, Bryce D.	7
Moylan, Laura A.	7
Cason, Carolyn L.	6
Cordes, Anne K.	6
Jaeger, Richard M.	6
Johnson, Robert L.	6
Lecavalier, Luc	6
Plake, Barbara S.	6
Tasse, Marc J.	6
Wyse, Adam E.	6
Zheng, Yuzhu	6
Aman, Michael G.	5
Barton, Erin E.	5
Cason, Gerald J.	5
Coniam, David	5
Conroy, Maureen A.	5
Crawford, Angela R.	5
More ▼

Journal Articles	2555
Reports - Research	2243
Reports - Evaluative	515
Speeches/Meeting Papers	272
Reports - Descriptive	163
Tests/Questionnaires	162
Information Analyses	130
Dissertations/Theses -…	89
Opinion Papers	61
Numerical/Quantitative Data	31
Guides - Non-Classroom	11
Books	7
Collected Works - General	3
Guides - Classroom - Teacher	3
Non-Print Media	3
Book/Product Reviews	2
Collected Works - Serials	2
Dissertations/Theses	2
ERIC Digests in Full Text	2
ERIC Publications	2
Guides - General	2
Reports - General	2
Collected Works - Proceedings	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	30
Child Behavior Checklist	18
National Assessment of…	14
Vineland Adaptive Behavior…	14
Autism Diagnostic Observation…	13
Strengths and Difficulties…	11
Woodcock Johnson Tests of…	10
Peabody Picture Vocabulary…	9
SAT (College Admission Test)	9
Wechsler Intelligence Scale…	9
Behavior Assessment System…	8
Dynamic Indicators of Basic…	8
Early Childhood Environment…	8
Graduate Record Examinations	8
International English…	7
Teacher Performance…	6
ACT Assessment	5
Advanced Placement…	5
Behavioral and Emotional…	5
Childhood Autism Rating Scale	5
Classroom Assessment Scoring…	5
Conners Teacher Rating Scale	5
Draw a Person Test	5
Raven Progressive Matrices	5
ACTFL Oral Proficiency…	4
More ▼