ERIC - Search Results

Publication Date

In 2026	0
Since 2025	13
Since 2022 (last 5 years)	48
Since 2017 (last 10 years)	151
Since 2007 (last 20 years)	301

Descriptor

Interrater Reliability	503
Test Reliability	503
Test Validity	260
Test Construction	106
Foreign Countries	103
Psychometrics	91
Evaluation Methods	90
Scores	67
Correlation	62
Scoring	61
Rating Scales	58
Measures (Individuals)	54
Student Evaluation	53
Children	49
Adults	40
Measurement Techniques	40
Generalizability Theory	39
Writing Evaluation	39
Higher Education	38
Elementary School Students	36
Test Items	35
Autism	34
Behavior Rating Scales	32
Construct Validity	32
Language Tests	32
More ▼

Publication Type

Journal Articles	378
Reports - Research	365
Reports - Evaluative	81
Speeches/Meeting Papers	59
Tests/Questionnaires	32
Reports - Descriptive	31
Dissertations/Theses -…	14
Information Analyses	11
Numerical/Quantitative Data	11
Guides - Non-Classroom	6
Opinion Papers	3
Book/Product Reviews	1
Books	1
Collected Works - Proceedings	1
Guides - General	1
Reference Materials -…	1
More ▼

Education Level

Higher Education	65
Postsecondary Education	56
Elementary Education	42
Early Childhood Education	29
Secondary Education	21
Primary Education	16
Elementary Secondary Education	15
Middle Schools	14
Grade 1	13
Preschool Education	13
Grade 3	11
Junior High Schools	11
Kindergarten	9
Grade 2	7
Adult Education	6
High Schools	6
Grade 5	5
Grade 8	5
Intermediate Grades	5
Grade 4	4
Grade 6	4
Grade 7	4
Grade 9	4
Grade 10	1
More ▼

Audience

Researchers	41
Practitioners	8
Administrators	3
Teachers	3
Counselors	1

Location

Turkey	11
Canada	10
Australia	9
United Kingdom	9
Pennsylvania	7
Florida	6
Netherlands	6
Sweden	5
United Kingdom (England)	5
China	4
Illinois	4
Japan	4
North Carolina	4
Brazil	3
California	3
Georgia	3
Germany	3
Indiana	3
Israel	3
Italy	3
Jordan	3
Kansas	3
South Africa	3
United States	3
Belgium	2
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	2
No Child Left Behind Act 2001	1
Pell Grant Program	1

What Works Clearinghouse Rating

Test Reliability X

Showing 481 to 495 of 503 results Save | Export

Agreement and Stability of Teacher Rating Scales for Assessing ADHD in Preschoolers.

Peer reviewed

Loughran, Sandra B. – Early Childhood Education Journal, 2003

Investigated the agreement and stability of three teacher rating scales used to assess attention deficit/hyperactivity disorder (ADHD) in preschoolers. Found that agreement among the rating scales and interrater agreement between teacher and assistant teacher ratings yielded noticeably stronger correlations at elementary school than at preschool 4…

Descriptors: At Risk Persons, Attention Deficit Disorders, Behavior Rating Scales, Early Childhood Education

Placing Texts, Placing Writers: Sources of Readers' Judgments in University Placement-Testing.

Download full text

Sullivan, Francis J. – 1986

A study examined how pragmatic form influences evaluation of student essays in university placement testing. Specifically, the study documented how patterns in students' use of information (assumed to be either old, inferable, or new for readers) affected the holistic scores for quality given to the essays. Subjects, 99 randomly selected entering…

Descriptors: College Freshmen, Essay Tests, Evaluation Criteria, Evaluation Methods

A Proposed Research Program for ESL Composition Evaluation.

Perkins, Kyle – 1986

Based on the premise that composition skills and their evaluation are crucial to the educational process, this paper presents a tentative research program for conducting future English as a second language (ESL) composition evaluation studies. The program developed in the paper covers the following topics as areas which merit further rigorous…

Descriptors: Elementary Secondary Education, English (Second Language), Error Analysis (Language), Evaluation Criteria

A Preliminary Study of Raters for the Test of Spoken English.

Download full text

Bejar, Isaac I. – 1985

The feasibility of reducing scoring costs for the Test of Spoken English (TSE) by using one rater was investigated. Currently, two raters are used. It was found that, because of the possibility of different standards used by potential raters, it does not appear feasible to use a single rater as the sole determiner of speaking proficiency under the…

Descriptors: Analysis of Covariance, Cost Effectiveness, English (Second Language), Evaluation Criteria

Assessing Speaking.

Peer reviewed

Turner, Jean – Annual Review of Applied Linguistics, 1998

This review of research on second-language oral testing outlines the nature of early research in interview-format proficiency testing, then reports on new directions in investigation of construct validity of interview-format and other oral skills tests through examination of examinee, interviewer, and rater performance. Research on empirically…

Descriptors: Construct Validity, Educational Trends, Interrater Reliability, Interviews

The Development of Clinical Supervision Procedures for Extending the Personally and Professionally Inviting Behaviors of Middle Grades Teachers--A Pilot Study.

Strahan, David B.; Van Hoose, John – 1986

The Invitational Teaching Observation Instrument was developed to extend effective teaching through self-assessment and clinical supervision. Based on the theories of Invitational Education, this test analyzed both personal and professional dimensions of teaching. Items reflected research on effective teaching and were cross-validated with two…

Descriptors: Behavior Rating Scales, Classroom Observation Techniques, Elementary School Teachers, Evaluation Criteria

Issues in Portfolio Assessment: The Scorability of Narrative Collections. Project 3.1: Studies in Improving Classroom and Local Assessments.

Download full text

Gearhart, Maryl; Novak, John R.; Herman, Joan L. – 1994

Technical questions regarding the reliability and validity of large-scale portfolio assessment were studied which focused on: (1) whether raters can score collections of writing reliably with rubrics designed for single samples; (2) whether ratings derived from different frameworks differ in their capacities to support technically sound…

Descriptors: Educational Assessment, Elementary Education, Elementary School Students, Essay Tests

A Brief Multi-Dimensional Children's Level-of-Functioning Tool.

Download full text

Srebnik, Debra – 1996

This paper discusses the results of a study that investigated the validity and reliability of the Ecology Rating Scale (ERS). The ERS is a brief, multi-dimensional level-of-functioning instrument that can be rated by parents or clinicians. The ERS is comprised of seven domains of youth functioning: family, school, emotional, legal/justice,…

Descriptors: Academic Achievement, Adolescents, Behavior Disorders, Child Health

Performance Testing Manual and Workbook for Vocational Education Administrators and Teachers.

Florida State Dept. of Education, Tallahassee. Div. of Vocational, Adult, and Community Education. – 1991

This packet contains a manual and a workbook for developing performance tests in vocational education. The manual gives an in-depth description of how to develop, score, and use performance tests. It includes the following sections: definitions of performance testing, steps in developing a performance test, selecting a performance development…

Descriptors: Interrater Reliability, Performance Tests, Postsecondary Education, Scoring

Sampling Variability of Performance Assessments. Report on the Status of Generalizability Performance: Generalizability and Transfer of Performance Assessments. Project 2.4: Design Theory and Psychometrics for Complex Performance Assessment in Science.

Download full text

Shavelson, Richard J.; And Others – 1993

In this paper, performance assessments are cast within a sampling framework. A performance assessment score is viewed as a sample of student performance drawn from a complex universe defined by a combination of all possible tasks, occasions, raters, and measurement methods. Using generalizability theory, the authors present evidence bearing on the…

Descriptors: Academic Achievement, Educational Assessment, Error of Measurement, Evaluators

North Carolina Alternate Assessment Portfolio Pilot Program, 1999-2000. Report of Student Performance.

Download full text

North Carolina State Dept. of Public Instruction, Raleigh. Div. of Accountability/Testing. – 2001

During 1999-2000 school year, the North Carolina Alternate Assessment Portfolio was administered to eligible students with serious cognitive deficits statewide as a pilot program. This report provides state, regional, and local education agency results of that pilot program. The purpose of the pilot was to review the feasibility, validity, and…

Descriptors: Academic Achievement, American Indians, Cultural Differences, Elementary Secondary Education

A Survey of Issues and Item Writing in Language Testing.

Download full text

Strong, Gregory – Thought Currents in English Literature, 1995

This paper traces developments in educational psychology and measurement that led to the Test of English as a Foreign Language (TOEFL) and the test of English for International Communication (TOEIC) and the application of educational measurement terms such as validity and reliability to testing. Use of a table of specifications for planning…

Descriptors: Cloze Procedure, Difficulty Level, English (Second Language), Foreign Countries

Relationship of Admission Test Scores to Writing Performance of Native and Nonnative Speakers of English.

Download full text

Carlson, Sybil B.; And Others – 1985

Four writing samples were obtained from 638 foreign college applicants who represented three major foreign language groups (Arabic, Chinese, and Spanish), and from 60 native English speakers. All four were scored holistically, two were also scored for sentence-level and discourse-level skills, and some were scored by the Writer's Workbench…

Descriptors: Arabic, Chinese, College Entrance Examinations, Computer Software

The Definition and Measurement of Small Military Unit Team Functions. Final Report, July 1980-October 1981.

Download full text

Shiflett, Samuel; And Others – 1985

A study was undertaken to improve the measurement of small team performance within the Army. A provisional taxonomy of team-level performance functions was field-validated; criteria and measures of the functions were developed; and their reliability was examined. The provisional taxonomy, used for observing Army field training exercises, was used…

Descriptors: Behavior Rating Scales, Classification, Evaluation Criteria, Evaluators

Measures of Linguistic Accuracy in Second Language Writing Research.

Peer reviewed

Polio, Charlene G. – Language Learning, 1997

Investigates the reliability of measures of linguistic accuracy in second language writing. The study uses a holistic scale, error-free T-units, and an error classification system on the essays of English-as-a-Second-Language students and discusses why disagreements arise within a rater and between raters. (24 references) (Author/CK)

Descriptors: College Students, English (Second Language), Error Analysis (Language), Error of Measurement

« Previous Page | Next Page »

Pages: 1 | ... | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34

Journal of Autism and…	25
Journal of Speech, Language,…	13
ProQuest LLC	13
Assessment for Effective…	12
Grantee Submission	8
International Journal of…	7
Measurement in Physical…	7
Educational and Psychological…	6
International Journal of…	6
Research in Developmental…	6
Assessment	5
Behavioral Disorders	5
Online Submission	5
Psychology in the Schools	5
Research in Developmental…	5
ETS Research Report Series	4
Journal of Positive Behavior…	4
Research Papers in Education	4
American Journal on Mental…	3
Autism: The International…	3
Center for Innovation in…	3
Developmental Medicine &…	3
Developmental Psychology	3
Education and Training in…	3
Gerontologist	3
More ▼

Epstein, Michael H.	7
Johnson, Evelyn S.	4
Matson, Johnny L.	4
Tasse, Marc J.	4
Aman, Michael G.	3
Canivez, Gary L.	3
Capie, William	3
Conroy, Maureen A.	3
Crawford, Angela R.	3
Lecavalier, Luc	3
McLeod, Bryce D.	3
Moylan, Laura A.	3
Unal, Zafer	3
Watkins, Marley W.	3
Zheng, Yuzhu	3
Aktas, Mehtap	2
Anna-Maria Fall	2
Atilgan, Hakan	2
Aydin, Selami	2
Benton, Stephen L.	2
Beula M. Magimairaj	2
Bodur, Yasar	2
Botting, Nicola	2
Breland, Hunter M.	2
More ▼

Strengths and Difficulties…	6
Test of English as a Foreign…	6
Autism Diagnostic Observation…	4
Child Behavior Checklist	4
Conners Teacher Rating Scale	4
Adjustment Scales for…	3
Adult Attachment Interview	3
Advanced Placement…	3
Behavioral and Emotional…	3
Childhood Autism Rating Scale	3
Graduate Record Examinations	3
Teacher Performance…	3
ACT Assessment	2
ACTFL Oral Proficiency…	2
Cognitive Abilities Test	2
Hamilton Rating Scale for…	2
Minnesota Multiphasic…	2
National Assessment of…	2
SAT (College Admission Test)	2
Teacher Rating Scale	2
Alabama High School…	1
Basic Reading Inventory	1
Battelle Developmental…	1
Bayley Scales of Infant…	1
Beck Anxiety Inventory	1
More ▼