ERIC - Search Results

Publication Date

In 2026	0
Since 2025	56
Since 2022 (last 5 years)	282
Since 2017 (last 10 years)	778
Since 2007 (last 20 years)	2040

Descriptor

Interrater Reliability	3122
Foreign Countries	654
Test Reliability	503
Evaluation Methods	502
Test Validity	410
Correlation	401
Scoring	347
Comparative Analysis	327
Scores	324
Validity	310
Student Evaluation	308
Measures (Individuals)	298
Evaluators	295
Rating Scales	282
Statistical Analysis	268
Higher Education	264
Psychometrics	241
Reliability	230
Observation	229
Scoring Rubrics	216
Test Construction	212
English (Second Language)	211
Teaching Methods	208
Writing Evaluation	206
Intervention	200
More ▼

Education Level

Higher Education	574
Postsecondary Education	420
Elementary Education	282
Secondary Education	179
Early Childhood Education	145
Elementary Secondary Education	120
Middle Schools	109
High Schools	85
Preschool Education	72
Junior High Schools	65
Adult Education	58
Primary Education	57
Kindergarten	45
Grade 4	41
Grade 5	40
Intermediate Grades	40
Grade 1	37
Grade 6	35
Grade 8	32
Grade 3	31
Grade 2	27
Grade 7	27
Grade 10	13
Grade 9	11
Two Year Colleges	8
More ▼

Audience

Researchers	130
Practitioners	42
Teachers	22
Administrators	11
Counselors	3
Policymakers	2

Location

Australia	56
Turkey	53
United Kingdom	46
Canada	45
Netherlands	40
China	38
California	37
United States	30
United Kingdom (England)	24
Taiwan	23
Germany	22
Japan	22
Pennsylvania	22
Florida	21
Sweden	21
Iran	19
North Carolina	19
Hong Kong	17
South Korea	17
Texas	17
Georgia	16
Israel	15
New Zealand	14
South Africa	14
Washington	14
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	13
Individuals with Disabilities…	7
Elementary and Secondary…	3
Race to the Top	3
Elementary and Secondary…	2
American Recovery and…	1
Americans with Disabilities…	1
Education Consolidation…	1
Education for All Handicapped…	1
Every Student Succeeds Act…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
Pell Grant Program	1
Rehabilitation Act 1973…	1
Stewart B McKinney Homeless…	1
Temporary Assistance for…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	3
Meets WWC Standards with or without Reservations	3
Does not meet standards	3

Showing 2,221 to 2,235 of 3,122 results Save | Export

In the Eye of the Beholder: Reply to Wilson and Shadish (2006) and Radin, Nelson, Dobyns, and Houtkooper (2006)

Peer reviewed

Direct link

Bosch, Holger; Steinkamp, Fiona; Boller, Emil – Psychological Bulletin, 2006

H. Bosch, F. Steinkamp, and E. Boller's (see record 2006-08436-001) meta-analysis, which demonstrated (a) a small but highly significant overall effect, (b) a small-study effect, and (c) extreme heterogeneity, has provoked widely differing responses. After considering D. B. Wilson and W. R. Shadish's (see record 2006-08436-002) and D. Radin, R.…

Descriptors: Meta Analysis, Publications, Bias, Models

A Reliability Study of BDAE-3 Discourse Coding

Peer reviewed

Direct link

Powell, Thomas W. – Clinical Linguistics & Phonetics, 2006

The third edition of the "Boston Diagnostic Aphasia Examination" (Goodglass, Kaplan, and Barresi) introduced standardized procedures for coding discourse samples elicited using the well known Cookie Theft illustration. To evaluate the reliability of this discourse coding procedure, a transcribed sample was coded by 14 novice examiners…

Descriptors: Examiners, Interrater Reliability, Test Reliability, Aphasia

Judging Text Presented on Screen: Implications for Validity

Peer reviewed

Direct link

Johnson, Martin; Greatorex, Jackie – E-Learning, 2008

Technological innovation undoubtedly offers many potential benefits for education and the assessment of learning, which have been acknowledged elsewhere. One area that is relatively under-researched relates to the practice of how assessors interact with longer texts that are presented on screen. This is an important area of study because there…

Descriptors: Foreign Countries, Innovation, Technological Advancement, Technology Uses in Education

The Effects of Observation Coaching on Children's Graphic Representations

Peer reviewed
PDF on ERIC

Download full text

Vlach, Haley A.; Carver, Sharon M. – Early Childhood Research & Practice, 2008

Education programs have fostered advanced levels of graphic representation ability in young children but have not detailed the specific mechanisms responsible for the accelerated growth. Research suggests that between 6 and 8 years of age children begin to observe more carefully before drawing and that observation prompts aid children's…

Descriptors: Childrens Art, Observation, Scores, Early Childhood Education

Reliability and Confidence in Using a Paired Comparison Paradigm in Perceptual Voice Quality Evaluation

Peer reviewed

Direct link

Yiu, Edwin M.-L.; Chan, Karen M. K.; Mok, Rosa S.-M. – Clinical Linguistics & Phonetics, 2007

One of the ways to improve the reliability in perceptual voice quality rating is to provide listeners with external anchors. A paired comparison matching paradigm using synthesized Cantonese voice stimuli that covered a range of rough and breathy qualities were used to investigate the rating reliability. Twenty-five speech pathology students rated…

Descriptors: Data Analysis, Measures (Individuals), Stimuli, Models

Reporting Gender, Race, Ethnicity, and Sociometric Status: Guidelines for Research and Professional Practice

Peer reviewed

Direct link

Hodge, Samuel R.; Kozub, Francis M.; Robinson, Leah E.; Hersman, Bethany L. – Adapted Physical Activity Quarterly, 2007

The purpose of this study was to determine what trends exist in the identification and description of participants used in data-based studies published in "Adapted Physical Activity Quarterly" and the "Journal of Teaching in Physical Education". Data were analyzed using frequency counts for journals and time periods from the 1980s to 2005 with…

Descriptors: Physical Education, Ethnicity, Socioeconomic Status, Physical Activities

Rating Scale Impact on EFL Essay Marking: A Mixed-Method Study

Peer reviewed

Direct link

Barkaoui, Khaled – Assessing Writing, 2007

Educators often have to choose among different types of rating scales to assess second-language (L2) writing performance. There is little research, however, on how different rating scales affect rater performance. This study employed a mixed-method approach to investigate the effects of two different rating scales on EFL essay scores, rating…

Descriptors: Writing Evaluation, Writing Tests, Rating Scales, Essays

Influences on and Limitations of Classical Test Theory Reliability Estimates.

Download full text

Arnold, Margery E. – 1996

It is incorrect to say "the test is reliable" because reliability is a function not only of the test itself, but of many factors. The present paper explains how different factors affect classical reliability estimates such as test-retest, interrater, internal consistency, and equivalent forms coefficients. Furthermore, the limits of classical test…

Descriptors: Estimation (Mathematics), Generalizability Theory, Heuristics, Interrater Reliability

Of English Marks and American Reviewers.

Download full text

Spolsky, Bernard – 1990

A discussion of the differences between the Test of English as a Foreign Language (TOEFL), an American test battery, and the Cambridge English Examinations (Cambridge), a British battery, focuses on the different approaches to language test development embodied in the tests as the source of difficulty in translating between them for individual…

Descriptors: Comparative Analysis, Cultural Differences, English (Second Language), Foreign Countries

Another Look at Inter-Rater Agreement. Research Report.

Download full text

Zwick, Rebecca – 1986

Most currently used measures of inter-rater agreement for the nominal case incorporate a correction for "chance agreement." The definition of chance agreement is not the same for all coefficients, however. Three chance-corrected coefficients are Cohen's Kappa; Scott's Pi; and the S index of Bennett, Goldstein, and Alpert, which has…

Descriptors: Error of Measurement, Interrater Reliability, Mathematical Models, Measurement Techniques

Alternative Methods for Calculating Intercoder Reliability in Content Analysis: Kappa, Weighted Kappa and Agreement Charts Procedures.

Kang, Namjun – 1987

If content analysis is to satisfy the requirement of objectivity, measures and procedures must be reliable. Reliability is usually measured by the proportion of agreement of all categories identically coded by different coders. For such data to be empirically meaningful, a high degree of inter-coder reliability must be demonstrated. Researchers in…

Descriptors: Content Analysis, Interrater Reliability, Measurement Techniques, Media Research

How Useful Are Suicide Risk Ratings?

Stelmachers, Zigfrids T.; Sherman, Robert E. – 1988

The clinical usefulness of various empirically derived suicide potential rating scales has been questioned by several suicidologists. This study used actual case histories in an attempt to anchor suicide risk ratings. Thirty-three brief case histories of suicidal patients were given to 19 experienced crisis workers for seven-point ratings of…

Descriptors: Clinical Diagnosis, Evaluation Criteria, Evaluation Methods, High Risk Persons

Qualities of Judgmental Ratings by Four Rater Sources.

Download full text

Tsui, Anne S. – 1983

Quality of performance data yielded by subjective judgment is of major concern to researchers in performance appraisal. However, some confusion exists in the analysis of quality on ratings obtained from different rating scale formats and from different raters. To clarify this confusion, a study was conducted to assess the quality of judgmental…

Descriptors: Administrator Evaluation, Administrators, Error of Measurement, Evaluation Methods

The Clinical Validity of the MMPI-168.

Edinger, Jack D.; Vosk, Barbara N. – 1983

Of the many short forms of the Minnesota Multiphasic Personality Inventory (MMPI) that have been developed, the MMPI-168 is among the most promising. To determine whether clinical judgments based on the MMPI-168 are comparable to judgments based on the standard MMPI, 30 clinical psychologists participated in a randomized block, repeated treatment…

Descriptors: Comparative Testing, Diagnostic Tests, Interrater Reliability, Personality Measures

Proficiency Testing in the Less Commonly Taught Languages. ERIC Digest.

Download full text

Thompson, Richard T.; Johnson, Dora E. – 1988

Efforts to expand the generic language proficiency guidelines of the American Council on the Teaching of Foreign Languages (ACTFL) to the less commonly taught languages (LCTLs) began when developers realized that the ACTFL guidelines were too Eurocentric; the guidelines included grammatical categories specific to Western European languages and…

Descriptors: Cultural Context, Interrater Reliability, Language Proficiency, Language Tests

« Previous Page | Next Page »

Pages: 1 | ... | 145 | 146 | 147 | 148 | 149 | 150 | 151 | 152 | 153 | ... | 209

ProQuest LLC	86
Journal of Speech, Language,…	62
Educational and Psychological…	61
Journal of Autism and…	56
Grantee Submission	40
Language Testing	39
Online Submission	35
International Journal of…	34
Assessment & Evaluation in…	33
Research in Developmental…	31
Applied Measurement in…	28
Advances in Health Sciences…	26
Assessment for Effective…	26
ETS Research Report Series	25
Journal of Educational…	24
Educational Measurement:…	23
Measurement in Physical…	20
Language Assessment Quarterly	19
Psychology in the Schools	19
Topics in Early Childhood…	19
Psychological Assessment	18
Educational Assessment	16
Autism: The International…	15
Journal of Consulting and…	15
Personnel Psychology	15
More ▼

Lunz, Mary E.	10
Wind, Stefanie A.	10
Engelhard, George, Jr.	8
Epstein, Michael H.	8
Ingham, Roger J.	8
Johnson, Evelyn S.	8
Matson, Johnny L.	7
McLeod, Bryce D.	7
Moylan, Laura A.	7
Cason, Carolyn L.	6
Cordes, Anne K.	6
Jaeger, Richard M.	6
Johnson, Robert L.	6
Lecavalier, Luc	6
Plake, Barbara S.	6
Tasse, Marc J.	6
Wyse, Adam E.	6
Zheng, Yuzhu	6
Aman, Michael G.	5
Barton, Erin E.	5
Cason, Gerald J.	5
Coniam, David	5
Conroy, Maureen A.	5
Crawford, Angela R.	5
More ▼

Journal Articles	2553
Reports - Research	2241
Reports - Evaluative	515
Speeches/Meeting Papers	272
Reports - Descriptive	163
Tests/Questionnaires	162
Information Analyses	130
Dissertations/Theses -…	89
Opinion Papers	61
Numerical/Quantitative Data	31
Guides - Non-Classroom	11
Books	7
Collected Works - General	3
Guides - Classroom - Teacher	3
Non-Print Media	3
Book/Product Reviews	2
Collected Works - Serials	2
Dissertations/Theses	2
ERIC Digests in Full Text	2
ERIC Publications	2
Guides - General	2
Reports - General	2
Collected Works - Proceedings	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	30
Child Behavior Checklist	18
National Assessment of…	14
Vineland Adaptive Behavior…	14
Autism Diagnostic Observation…	13
Strengths and Difficulties…	11
Woodcock Johnson Tests of…	10
Peabody Picture Vocabulary…	9
SAT (College Admission Test)	9
Wechsler Intelligence Scale…	9
Behavior Assessment System…	8
Dynamic Indicators of Basic…	8
Early Childhood Environment…	8
Graduate Record Examinations	8
International English…	7
Teacher Performance…	6
ACT Assessment	5
Advanced Placement…	5
Behavioral and Emotional…	5
Childhood Autism Rating Scale	5
Classroom Assessment Scoring…	5
Conners Teacher Rating Scale	5
Draw a Person Test	5
Raven Progressive Matrices	5
ACTFL Oral Proficiency…	4
More ▼