ERIC - Search Results

Publication Date

In 2026	0
Since 2025	56
Since 2022 (last 5 years)	282
Since 2017 (last 10 years)	778
Since 2007 (last 20 years)	2040

Descriptor

Interrater Reliability	3122
Foreign Countries	654
Test Reliability	503
Evaluation Methods	502
Test Validity	410
Correlation	401
Scoring	347
Comparative Analysis	327
Scores	324
Validity	310
Student Evaluation	308
Measures (Individuals)	298
Evaluators	295
Rating Scales	282
Statistical Analysis	268
Higher Education	264
Psychometrics	241
Reliability	230
Observation	229
Scoring Rubrics	216
Test Construction	212
English (Second Language)	211
Teaching Methods	208
Writing Evaluation	206
Intervention	200
More ▼

Education Level

Higher Education	574
Postsecondary Education	420
Elementary Education	282
Secondary Education	179
Early Childhood Education	145
Elementary Secondary Education	120
Middle Schools	109
High Schools	85
Preschool Education	72
Junior High Schools	65
Adult Education	58
Primary Education	57
Kindergarten	45
Grade 4	41
Grade 5	40
Intermediate Grades	40
Grade 1	37
Grade 6	35
Grade 8	32
Grade 3	31
Grade 2	27
Grade 7	27
Grade 10	13
Grade 9	11
Two Year Colleges	8
More ▼

Audience

Researchers	130
Practitioners	42
Teachers	22
Administrators	11
Counselors	3
Policymakers	2

Location

Australia	56
Turkey	53
United Kingdom	46
Canada	45
Netherlands	40
China	38
California	37
United States	30
United Kingdom (England)	24
Taiwan	23
Germany	22
Japan	22
Pennsylvania	22
Florida	21
Sweden	21
Iran	19
North Carolina	19
Hong Kong	17
South Korea	17
Texas	17
Georgia	16
Israel	15
New Zealand	14
South Africa	14
Washington	14
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	13
Individuals with Disabilities…	7
Elementary and Secondary…	3
Race to the Top	3
Elementary and Secondary…	2
American Recovery and…	1
Americans with Disabilities…	1
Education Consolidation…	1
Education for All Handicapped…	1
Every Student Succeeds Act…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
Pell Grant Program	1
Rehabilitation Act 1973…	1
Stewart B McKinney Homeless…	1
Temporary Assistance for…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	3
Meets WWC Standards with or without Reservations	3
Does not meet standards	3

Showing 316 to 330 of 3,122 results Save | Export

Leveraging Telehealth to Evaluate Infants with Prodromal Autism Spectrum Disorder Characteristics Using the Telehealth Evaluation of Development for Infants

Peer reviewed

Direct link

Talbott, Meagan R.; Dufek, Sarah; Young, Greg; Rogers, Sally J. – Autism: The International Journal of Research and Practice, 2022

This study investigated the feasibility of recruiting and assessing infants with prodromal autism characteristics in the first year of life via telehealth. Participants included 41 infants (Mage = 10.51 months, 51.2% female, 80.5% White) whose parents had concerns about social communication delays or autism. All infants met concerns criteria on a…

Descriptors: Infants, Autism Spectrum Disorders, At Risk Persons, Symptoms (Individual Disorders)

Comparison of Inter-Rater Reliability Techniques in Performance-Based Assessment

Peer reviewed
PDF on ERIC

Download full text

Arslan Mancar, Sinem; Gulleroglu, H. Deniz – International Journal of Assessment Tools in Education, 2022

The aim of this study is to analyse the importance of the number of raters and compare the results obtained by techniques based on Classical Test Theory (CTT) and Generalizability (G) Theory. The Kappa and Krippendorff alpha techniques based on CTT were used to determine the inter-rater reliability. In this descriptive research data consists of…

Descriptors: Comparative Analysis, Interrater Reliability, Advanced Placement, Scoring Rubrics

Revolutionising Essay Evaluation: A Cutting-Edge Rubric for AI-Assisted Writing

Peer reviewed

Direct link

Hassan Saleh Mahdi; Ahmed Alkhateeb – International Journal of Computer-Assisted Language Learning and Teaching, 2025

This study aims to develop a robust rubric for evaluating artificial intelligence (AI)--assisted essay writing in English as a Foreign Language (EFL) contexts. Employing a modified Delphi technique, we conducted a comprehensive literature review and administered Likert scale questionnaires. This process yielded nine key evaluation criteria,…

Descriptors: Scoring Rubrics, Essays, Writing Evaluation, Artificial Intelligence

The Mandarin Version of the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) and Its Reliability

Peer reviewed

Direct link

Chen, Zhen; Fang, Rui; Zhang, Yi; Ge, Pingjiang; Zhuang, Peiyun; Chou, Adriana; Jiang, Jack – Journal of Speech, Language, and Hearing Research, 2018

Purpose: The purpose of this study is to develop the Mandarin version of the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) and evaluate its reliability compared with the Grade, Roughness, Breathiness, Asthenia, Strain (GRBAS). Method: The Mandarin version of the CAPE-V tool was translated from the validated English version with…

Descriptors: Voice Disorders, Diagnostic Tests, Mandarin Chinese, Test Reliability

The Generalizability of Running Record Accuracy and Self-Correction Scores

Peer reviewed

Direct link

D'Agostino, Jerome V.; Rodgers, Emily; Winkler, Christa; Johnson, Tracy; Berenbon, Rebecca – Reading Psychology, 2021

Running Records provide a standardized method for recording and assessing students' oral reading behaviors and are excellent formative assessment tools to guide instructional decision-making. This study expands on prior Running Record reliability work by evaluating the extent to which external raters and teachers consistently assessed students'…

Descriptors: Accuracy, Oral Reading, Generalizability Theory, Error Correction

A Model-Data-Fit-Informed Approach to Score Resolution in Performance Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Walker, A. Adrienne – Educational Measurement: Issues and Practice, 2021

Many large-scale performance assessments include score resolution procedures for resolving discrepancies in rater judgments. The goal of score resolution is conceptually similar to person fit analyses: To identify students for whom observed scores may not accurately reflect their achievement. Previously, researchers have observed that…

Descriptors: Goodness of Fit, Performance Based Assessment, Evaluators, Decision Making

Methodologies for Investigating and Interpreting Student-Teacher Rating Incongruence in Noncognitive Assessment

Peer reviewed

Direct link

Flake, Jessica Kay; Petway, Kevin Terrance, II – Educational Measurement: Issues and Practice, 2019

Numerous studies merely note divergence in students' and teachers' ratings of student noncognitive constructs. However, given the increased attention and use of these constructs in educational research and practice, an in-depth study focused on this issue was needed. Using a variety of quantitative methodologies, we thoroughly investigate…

Descriptors: Teachers, Students, Achievement Rating, Interrater Reliability

Reliability of Essay Ratings: A Study on Generalizability Theory

Peer reviewed
PDF on ERIC

Download full text

Atilgan, Hakan – Eurasian Journal of Educational Research, 2019

Purpose: This study intended to examine the generalizability and reliability of essay ratings within the scope of the generalizability (G) theory. Specifically, the effect of raters on the generalizability and reliability of students' essay ratings was examined. Furthermore, variations of the generalizability and reliability coefficients with…

Descriptors: Foreign Countries, Essay Tests, Test Reliability, Interrater Reliability

Validation of an Assessment Centre Process for the Selection of School Leaders in Chile

Peer reviewed

Direct link

Volante, Paulo; Valenzuela, Sergio; Díaz, Alejandro; Fernández, Magdalena; Mladinic, Antonio – School Leadership & Management, 2019

This study seeks to develop and validate an Assessment Centre (AC) tool for the evaluation and selection of school leaders, focusing on the identification of competencies that influence teaching and learning outcomes. International research supports the creation of Assessment Centres to select candidates for these roles, due to their superior…

Descriptors: Foreign Countries, Personnel Selection, School Administration, Assessment Centers (Personnel)

Assessing Movement Competence and Screening for Injury Risk in 8-12-Year-Old Children: Reliability of the Child-Focused Injury Risk Screening Tool (ChildFIRST)

Peer reviewed

Direct link

Miller, Matthew B.; Jimenez-Garcia, John Alexander; Hong, Chang Ki; DeMont, Richard – Measurement in Physical Education and Exercise Science, 2020

The Child-Focused Injury Risk Screening Tool (ChildFIRST) is a process-based assessment including 10 movement skills with 4 associated evaluation criteria. The ChildFIRST has been validated by a group of experts to evaluate movement competence and injury risk in 8-12-year-olds. The purpose of this study is to evaluate the reliability of the…

Descriptors: Screening Tests, Risk Assessment, Injuries, Psychomotor Skills

A Rasch Analysis of Rater Behaviour in Speaking Assessment

Peer reviewed
PDF on ERIC

Download full text

Polat, Murat – International Online Journal of Education and Teaching, 2020

The assessment of speaking skills in foreign language testing has always had some pros (testing learners' speaking skills doubles the validity of any language test) and cons (many testrelevant/irrelevant variables interfere) since it is a multi-dimensional process. In the meantime, exploring grader behaviours while scoring learners' speaking…

Descriptors: Item Response Theory, Interrater Reliability, Speech Skills, Second Language Learning

What You Don't Know about Measurement Error--And Why You Should Care

Direct link

Lichtenstein, Robert – Communique, 2020

Appropriate interpretation of assessment data requires an appreciation that tools are subject to measurement error. School psychologists recognize, at least on an intellectual level, that measures are imperfect--that test scores and other quantitative measures (e.g., rating scales, systematic behavioral observations) are best estimates of…

Descriptors: Error of Measurement, Test Reliability, Pretests Posttests, Standardized Tests

Does Comparative Judgement of Scripts Provide an Effective Means of Maintaining Standards in Mathematics? Research Report

Download full text

Benton, Tom; Leech, Tony; Hughes, Sarah – Cambridge Assessment, 2020

In the context of examinations, the phrase "maintaining standards" usually refers to any activity designed to ensure that it is no easier (or harder) to achieve a given grade in one year than in another. Specifically, it tends to mean activities associated with setting examination grade boundaries. Benton et al (2020) describes a method…

Descriptors: Mathematics Tests, Equated Scores, Comparative Analysis, Difficulty Level

Developing a Tool for Measuring Student Orientations with Respect to Understanding in Mathematical Learning

Peer reviewed
PDF on ERIC

Download full text

Siqi Huang – North American Chapter of the International Group for the Psychology of Mathematics Education, 2023

The goal of this paper is twofold. First, the paper clarifies and elaborates on an important theoretical construct called orientation with respect to understanding in mathematics, which denotes the degree to which students exhibit an inclination towards and demonstrate an earnest concern for understanding in mathematical learning. Second, the…

Descriptors: Mathematics Instruction, Teaching Methods, Problem Solving, Reliability

Spanish Validation of the Impact of Event Scale for People with Intellectual Disabilities, IES-ID

Peer reviewed

Direct link

Nuñez-Polo, Mercedes H. – Journal of Mental Health Research in Intellectual Disabilities, 2022

Introduction: The aim of this study is to validate a Spanish version of the Impact of Event Scale on People with ID (IES-ID). Methods: IES-ID was administered to adults with ID (n = 120), analyzing internal consistency, inter-rater and test-retest reliability, criterion validity, construct validity and feasibility. Results: Good internal…

Descriptors: Spanish, Translation, Construct Validity, Factor Analysis

« Previous Page | Next Page »

Pages: 1 | ... | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | ... | 209

ProQuest LLC	86
Journal of Speech, Language,…	62
Educational and Psychological…	61
Journal of Autism and…	56
Grantee Submission	40
Language Testing	39
Online Submission	35
International Journal of…	34
Assessment & Evaluation in…	33
Research in Developmental…	31
Applied Measurement in…	28
Advances in Health Sciences…	26
Assessment for Effective…	26
ETS Research Report Series	25
Journal of Educational…	24
Educational Measurement:…	23
Measurement in Physical…	20
Language Assessment Quarterly	19
Psychology in the Schools	19
Topics in Early Childhood…	19
Psychological Assessment	18
Educational Assessment	16
Autism: The International…	15
Journal of Consulting and…	15
Personnel Psychology	15
More ▼

Lunz, Mary E.	10
Wind, Stefanie A.	10
Engelhard, George, Jr.	8
Epstein, Michael H.	8
Ingham, Roger J.	8
Johnson, Evelyn S.	8
Matson, Johnny L.	7
McLeod, Bryce D.	7
Moylan, Laura A.	7
Cason, Carolyn L.	6
Cordes, Anne K.	6
Jaeger, Richard M.	6
Johnson, Robert L.	6
Lecavalier, Luc	6
Plake, Barbara S.	6
Tasse, Marc J.	6
Wyse, Adam E.	6
Zheng, Yuzhu	6
Aman, Michael G.	5
Barton, Erin E.	5
Cason, Gerald J.	5
Coniam, David	5
Conroy, Maureen A.	5
Crawford, Angela R.	5
More ▼

Journal Articles	2553
Reports - Research	2241
Reports - Evaluative	515
Speeches/Meeting Papers	272
Reports - Descriptive	163
Tests/Questionnaires	162
Information Analyses	130
Dissertations/Theses -…	89
Opinion Papers	61
Numerical/Quantitative Data	31
Guides - Non-Classroom	11
Books	7
Collected Works - General	3
Guides - Classroom - Teacher	3
Non-Print Media	3
Book/Product Reviews	2
Collected Works - Serials	2
Dissertations/Theses	2
ERIC Digests in Full Text	2
ERIC Publications	2
Guides - General	2
Reports - General	2
Collected Works - Proceedings	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	30
Child Behavior Checklist	18
National Assessment of…	14
Vineland Adaptive Behavior…	14
Autism Diagnostic Observation…	13
Strengths and Difficulties…	11
Woodcock Johnson Tests of…	10
Peabody Picture Vocabulary…	9
SAT (College Admission Test)	9
Wechsler Intelligence Scale…	9
Behavior Assessment System…	8
Dynamic Indicators of Basic…	8
Early Childhood Environment…	8
Graduate Record Examinations	8
International English…	7
Teacher Performance…	6
ACT Assessment	5
Advanced Placement…	5
Behavioral and Emotional…	5
Childhood Autism Rating Scale	5
Classroom Assessment Scoring…	5
Conners Teacher Rating Scale	5
Draw a Person Test	5
Raven Progressive Matrices	5
ACTFL Oral Proficiency…	4
More ▼