ERIC - Search Results

Publication Date

In 2026	0
Since 2025	13
Since 2022 (last 5 years)	48
Since 2017 (last 10 years)	151
Since 2007 (last 20 years)	301

Descriptor

Interrater Reliability	503
Test Reliability	503
Test Validity	260
Test Construction	106
Foreign Countries	103
Psychometrics	91
Evaluation Methods	90
Scores	67
Correlation	62
Scoring	61
Rating Scales	58
Measures (Individuals)	54
Student Evaluation	53
Children	49
Adults	40
Measurement Techniques	40
Generalizability Theory	39
Writing Evaluation	39
Higher Education	38
Elementary School Students	36
Test Items	35
Autism	34
Behavior Rating Scales	32
Construct Validity	32
Language Tests	32
More ▼

Publication Type

Journal Articles	378
Reports - Research	365
Reports - Evaluative	81
Speeches/Meeting Papers	59
Tests/Questionnaires	32
Reports - Descriptive	31
Dissertations/Theses -…	14
Information Analyses	11
Numerical/Quantitative Data	11
Guides - Non-Classroom	6
Opinion Papers	3
Book/Product Reviews	1
Books	1
Collected Works - Proceedings	1
Guides - General	1
Reference Materials -…	1
More ▼

Education Level

Higher Education	65
Postsecondary Education	56
Elementary Education	42
Early Childhood Education	29
Secondary Education	21
Primary Education	16
Elementary Secondary Education	15
Middle Schools	14
Grade 1	13
Preschool Education	13
Grade 3	11
Junior High Schools	11
Kindergarten	9
Grade 2	7
Adult Education	6
High Schools	6
Grade 5	5
Grade 8	5
Intermediate Grades	5
Grade 4	4
Grade 6	4
Grade 7	4
Grade 9	4
Grade 10	1
More ▼

Audience

Researchers	41
Practitioners	8
Administrators	3
Teachers	3
Counselors	1

Location

Turkey	11
Canada	10
Australia	9
United Kingdom	9
Pennsylvania	7
Florida	6
Netherlands	6
Sweden	5
United Kingdom (England)	5
China	4
Illinois	4
Japan	4
North Carolina	4
Brazil	3
California	3
Georgia	3
Germany	3
Indiana	3
Israel	3
Italy	3
Jordan	3
Kansas	3
South Africa	3
United States	3
Belgium	2
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	2
No Child Left Behind Act 2001	1
Pell Grant Program	1

What Works Clearinghouse Rating

Test Reliability X

Showing 406 to 420 of 503 results Save | Export

Measurement of Social Communication Skills of Children with Autism Spectrum Disorders during Interactions with Typical Peers

Peer reviewed

Direct link

Murdock, Linda C.; Cost, Hollie C.; Tieso, Carol – Focus on Autism and Other Developmental Disabilities, 2007

The "Social-Communication Assessment Tool" (S-CAT) was created as a direct observation instrument to quantify specific social and communication deficits of children with autism spectrum disorders (ASD) within educational settings. In this pilot study, the instrument's content validity and interrater reliability were investigated to determine the…

Descriptors: Nonverbal Communication, Autism, Content Validity, Test Validity

Basic Concepts in Generalizability Theory: A More Powerful Approach to Evaluating Reliability.

Download full text

Naizer, Gilbert – 1992

A measurement approach called generalizability theory (G-theory) is an important alternative to the more familiar classical measurement theory that yields less useful coefficients such as alpha or the KR-20 coefficient. G-theory is a theory about the dependability of behavioral measurements that allows the simultaneous estimation of multiple…

Descriptors: Error of Measurement, Estimation (Mathematics), Generalizability Theory, Higher Education

Investigating Variability in Tasks and Rater Judgments in a Performance Test of Foreign Language Speaking.

Download full text

Bachman, Lyle F.; And Others – 1993

This paper outlines the development of a performance assessment measure of language speaking ability, the Language Ability Assessment System (LAAS), which is highly reliable and can be examined for reliability through modern measurement theories, such as generalizability theory (G-theory) and the many-facet Rasch theory. LAAS was developed to…

Descriptors: College Students, Higher Education, Interrater Reliability, Language Proficiency

A Study of the Comparability of Speaking Proficiency Interview Ratings across Three Government Language Training Agencies.

Download full text

Clark, John L. D. – 1986

A study of the reliability of the proficiency ratings scale and techniques used by three federal government agencies--the Central Intelligence Agency, the Defense Language Institute, and the Foreign Service Institute (FSI)--to test employees' oral language proficiency in French and German had two randomly selected two-person teams of testers from…

Descriptors: Comparative Analysis, Federal Government, French, German

Development of the Portuguese Speaking Test. Year One Project Report. Development of Semi-Direct Tests of Oral Proficiency in Hausa, Hebrew, Indonesian and Portuguese.

Download full text

Stansfield, Charles W.; Kenyon, Dorry Mann – 1988

The development and validation of a Portuguese oral language test are described. The test consisted of five item types: personal conversation, giving directions, description of picture sequences, topical discourse, and oral task completion based on printed instructions. Three preliminary forms of the test were administered to a group of language…

Descriptors: Interrater Reliability, Interviews, Language Tests, Oral Language

The Measurement of Developmental Variables: An Overview.

Santmire, Toni E. – 1984

The purpose of this paper is to discuss ways in which developmental psychology suffers from the lack of an appropriate technology of measurement and statistical analysis. The paper begins by noting that developmental psychology is the study of change; that individuals develop through a succession of "stages" which are separated by…

Descriptors: Data Analysis, Data Collection, Developmental Psychology, Developmental Stages

Rater Reliability of the ACTFL Oral Proficiency Interview.

Peer reviewed

Magnan, Sally Sieloff – Canadian Modern Language Review, 1987

Differences in procedures used by academic institutions and government agencies in administering the American Council on the Teaching of Foreign Languages' Oral Proficiency Interview test are examined, and results and implications of two studies of interrater reliability are discussed. (MSE)

Descriptors: Comparative Analysis, Correlation, Evaluation Methods, Evaluators

Generalizability of Written-Response Scores for the Alberta Education English 30 Diploma Examination.

Peer reviewed

Gierl, Mark J. – Alberta Journal of Educational Research, 1998

Examined the generalizability of written-response scores on the English 30 diploma examination administered to Alberta 12th-grade students. Student scores differed as a function of rater, but this variance component was small across two tasks and two administrations; score generalizability was high using a two-rater system; and scale variability…

Descriptors: Error of Measurement, Foreign Countries, Generalizability Theory, High School Seniors

The Reliability and Validity of Mathematics Performance Assessment.

Download full text

Brown, William L.; And Others – 1996

This study presents psychometric characteristics of the mathematics problem solving performance assessment used in the Minneapolis Public Schools, focusing on the interrater reliability, scoring reliability, and validity of the assessment. The Minneapolis Math Problem Solving Assessment (MPSA) was established in 1991. Students are asked to solve…

Descriptors: Elementary School Students, Grade 5, Intermediate Grades, Interrater Reliability

It Is Incorrect To Say "The Test Is Reliable": A Review of the Literature and Implications for Research Practice.

Download full text

Aycock, Tim – 1993

To determine trends in reporting test reliability, 88 articles addressing 188 instruments in 1980, 81 articles covering 205 instruments in 1985, and 67 articles assessing 195 instruments in 1990 in the "Journal of Counseling Psychology" were reviewed. Articles were examined for the way in which reliability was discussed and reported, and…

Descriptors: Educational Practices, Educational Research, Estimation (Mathematics), Interrater Reliability

Exploring Rater Behaviour with Rasch Techniques.

Download full text

McNamara, T. F.; Adams, R. J. – 1991

A preliminary study is reported of the use of new multifaceted Rasch measurement mechanisms for investigating rater characteristics in language testing. Ratings from four judges of scripts from 50 candidates taking the International English Language Testing System test, a test of English for Academic Purposes, are analyzed. The analysis…

Descriptors: Comparative Analysis, English (Second Language), Foreign Countries, Interrater Reliability

Reliability of the Conners Abbreviated Teacher Rating Scale across Raters and across Time: Use with Learning Disabled Students.

Peer reviewed

Epstein, Michael H.; Nieminen, Gayla S. – School Psychology Review, 1983

Teachers and classroom aides of learning disabled students completed the Conners Abbreviated Teacher Rating Scale (CATRS) on two separate occasions. The study investigated the inter-rater and intra-rater reliability of this instrument. CATRS appeared to have sufficient reliability to recommend its continued frequent use. (Author/DWH)

Descriptors: Behavior Rating Scales, Elementary Education, Elementary School Students, Hyperactivity

Pupil Talk in Biology Practical Work--A Preliminary Study.

Peer reviewed

Pugh, Malcolm; Lock, Roger – Research in Science and Technological Education, 1989

The development of a framework for analyzing pupil talk is described and the reliability of scoring transcribed conversions using the framework discussed. Definitions and examples of the terms used in the framework are appended. (Author/YP)

Descriptors: Biology, Foreign Countries, Group Discussion, Interrater Reliability

Parent and Teacher Assessment of Receptive and Expressive Language in Preschool Children with Developmental Disabilities.

Peer reviewed

Sigafoos, Jeff; Pennell, Donna – Education and Training in Mental Retardation and Developmental Disabilities, 1995

Comparison using paired t-tests of parent and teacher ratings for 16 preschool children on the Receptive-Expressive Emergent Language Scale found no significant differences between parent and teacher ratings of expressive language, but a significant difference on the receptive language subscale. However, interrater reliability was relatively low…

Descriptors: Developmental Disabilities, Expressive Language, Interrater Reliability, Language Skills

The Triple-Jump Examination as an Assessment Tool in the Problem-Based Medical Curriculum at the University of Hawaii.

Peer reviewed

Smith, Richard Merrill – Academic Medicine, 1993

A University of Hawaii study compared objective and subjective assessments of the three-step triple jump examination which tests medical students' clinical problem-solving processes. Subjects were 58 first-year students. Results found the subjective assessments were more consistent across problems of varying difficulty level than were objective…

Descriptors: Case Studies, Difficulty Level, Higher Education, Interrater Reliability

« Previous Page | Next Page »

Pages: 1 | ... | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34

Journal of Autism and…	25
Journal of Speech, Language,…	13
ProQuest LLC	13
Assessment for Effective…	12
Grantee Submission	8
International Journal of…	7
Measurement in Physical…	7
Educational and Psychological…	6
International Journal of…	6
Research in Developmental…	6
Assessment	5
Behavioral Disorders	5
Online Submission	5
Psychology in the Schools	5
Research in Developmental…	5
ETS Research Report Series	4
Journal of Positive Behavior…	4
Research Papers in Education	4
American Journal on Mental…	3
Autism: The International…	3
Center for Innovation in…	3
Developmental Medicine &…	3
Developmental Psychology	3
Education and Training in…	3
Gerontologist	3
More ▼

Epstein, Michael H.	7
Johnson, Evelyn S.	4
Matson, Johnny L.	4
Tasse, Marc J.	4
Aman, Michael G.	3
Canivez, Gary L.	3
Capie, William	3
Conroy, Maureen A.	3
Crawford, Angela R.	3
Lecavalier, Luc	3
McLeod, Bryce D.	3
Moylan, Laura A.	3
Unal, Zafer	3
Watkins, Marley W.	3
Zheng, Yuzhu	3
Aktas, Mehtap	2
Anna-Maria Fall	2
Atilgan, Hakan	2
Aydin, Selami	2
Benton, Stephen L.	2
Beula M. Magimairaj	2
Bodur, Yasar	2
Botting, Nicola	2
Breland, Hunter M.	2
More ▼

Strengths and Difficulties…	6
Test of English as a Foreign…	6
Autism Diagnostic Observation…	4
Child Behavior Checklist	4
Conners Teacher Rating Scale	4
Adjustment Scales for…	3
Adult Attachment Interview	3
Advanced Placement…	3
Behavioral and Emotional…	3
Childhood Autism Rating Scale	3
Graduate Record Examinations	3
Teacher Performance…	3
ACT Assessment	2
ACTFL Oral Proficiency…	2
Cognitive Abilities Test	2
Hamilton Rating Scale for…	2
Minnesota Multiphasic…	2
National Assessment of…	2
SAT (College Admission Test)	2
Teacher Rating Scale	2
Alabama High School…	1
Basic Reading Inventory	1
Battelle Developmental…	1
Bayley Scales of Infant…	1
Beck Anxiety Inventory	1
More ▼