ERIC - Search Results

Publication Date

In 2026	0
Since 2025	5
Since 2022 (last 5 years)	11
Since 2017 (last 10 years)	19
Since 2007 (last 20 years)	51

Descriptor

Foreign Countries	54
Interrater Reliability	54
Reliability	54
Validity	24
Statistical Analysis	12
Correlation	11
Scores	11
Comparative Analysis	8
Rating Scales	8
Scoring Rubrics	8
Elementary School Students	7
Measures (Individuals)	7
Psychometrics	7
Student Evaluation	7
Children	6
College Students	6
Factor Analysis	6
Observation	6
Teaching Methods	6
Feedback (Response)	5
Likert Scales	5
Qualitative Research	5
Scoring	5
Second Language Learning	5
Student Attitudes	5
More ▼

Publication Type

Journal Articles	51
Reports - Research	46
Reports - Evaluative	4
Tests/Questionnaires	3
Dissertations/Theses -…	2
Information Analyses	2
Numerical/Quantitative Data	1
Reports - Descriptive	1

Audience

Researchers

Location

Canada	6
Turkey	6
Australia	5
Netherlands	4
Taiwan	4
United States	4
China	3
Italy	3
Norway	3
Spain	3
United Kingdom (England)	3
Belgium	2
Finland	2
Indonesia	2
Singapore	2
Thailand	2
United Kingdom	2
Argentina	1
Bahrain	1
Brazil	1
China (Beijing)	1
Guatemala	1
Iceland	1
India	1
Iran	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Early Childhood Environment…	2
Autism Diagnostic Observation…	1
Draw a Person Test	1
Neale Analysis of Reading…	1
Parenting Stress Index	1
Pediatric Evaluation of…	1
Test of Gross Motor…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 54 results Save | Export

How Consistent Are Humans When Grading Programming Assignments?

Peer reviewed

Direct link

Marcus Messer; Neil C. C. Brown; Michael Kölling; Miaojing Shi – ACM Transactions on Computing Education, 2025

Providing consistent summative assessment to students is important, as the grades they are awarded affect their progression through university and future career prospects. While small cohorts are typically assessed by a single assessor, such as the module/class leader, larger cohorts are often assessed by multiple assessors, typically teaching…

Descriptors: Foreign Countries, Grading, Interrater Reliability, Teaching Assistants

Trust the "Process"? When Fundamental Motor Skill Scores Are Reliably Unreliable

Peer reviewed

Direct link

Hulteen, Ryan M.; True, Larissa; Kroc, Edward – Measurement in Physical Education and Exercise Science, 2023

The typical process for assessing inter-rater reliability is facilitated by training raters within a research team. Lacking is an understanding if inter-rater reliability scores "between" research teams demonstrate adequate reliability. This study examined inter-rater reliability between 16 researchers who assessed fundamental motor…

Descriptors: Psychomotor Skills, Scores, Reliability, Interrater Reliability

Reporting and Measuring English School Qualifications: A Case Study of General Certificate of Secondary Education Results in Survey and Linked Administrative Data in the UK Millennium Cohort Study

Peer reviewed

Direct link

Sarah Stopforth; Roxanne Connelly; Vernon Gayle – Cambridge Journal of Education, 2025

Data on educational qualifications is essential in many research domains. The UK Millennium Cohort Study collected self-reported General Certificate of Secondary Education (GCSE) data in sweep 7 (cohort members aged 17). GCSE data from the National Pupil Database (NPD) has been linked to the MCS. This study investigates the consistency of these…

Descriptors: Foreign Countries, Adolescents, Case Studies, Secondary Education

The Scale of Sincerity Based on Kyai Haji Ahmad Dahlan's Version for Islamic Students: The Rasch Analysis

Peer reviewed
PDF on ERIC

Download full text

Wahyu Nanda Eka Saputra; Trikinasih Handayani; Prima Suci Rohmadheny; Rohmatus Naini; Dody Hartanto; Hardi Santosa; Dewi Afra Khairunnisa; Risma Risansyah; Hanan Riati; Faturrahman – Journal of Education and Learning (EduLearn), 2025

The students are urged to do something without expecting anything in return and only in the name of God. Every islamic student becomes something ideal if they can internalize and implement sincerity. Many people are willing to do something because of an ulterior motive. The importance of sincerity in humans is the background for developing a…

Descriptors: Islam, Interrater Reliability, Prosocial Behavior, Muslims

Interdisciplinary Thinking among Seventh-Grade Students in Lower-Secondary Science Education

Peer reviewed
PDF on ERIC

Download full text

Shasha Chen; Shaohui Chi; Zuhao Wang – Journal of Baltic Science Education, 2025

Interdisciplinary thinking is critical for equipping students to apply scientific knowledge and tackle societal challenges across various disciplines, which has been recognized as a key objective of twenty-first century science education. However, research on effective interdisciplinary assessment in secondary school science education is still…

Descriptors: Thinking Skills, Interdisciplinary Approach, Science Instruction, Grade 7

Validity and Reliability of Speech Data in the Norwegian Registry of Cleft Lip and Palate

Peer reviewed

Direct link

Øydis Hide; Dagrun Slettebø Daltveit; Åse Sivertsen; Anne Katherine Hvistendahl; Randi Lovise Kjerstad; Marit Berntsen Kvinnsland; Nina Helen Pedersen; Christina Sørensen – International Journal of Language & Communication Disorders, 2025

Background: Cleft lip and palate (CLP) treatment in Norway is centralized and multidisciplinary, with long-term follow-up from birth to adulthood. The Norwegian Registry of Cleft Lip and Palate was established to ensure high-quality care and enable systematic data collection. Speech data are a key component, assessed by speech--language therapists…

Descriptors: Foreign Countries, Validity, Reliability, Data Collection

Using Systematic Social Observations to Measure Crime Prevention through Environmental Design and Disorder: In-situ Observations, Photographs, and Google Street View Imagery

Peer reviewed

Direct link

Sas, Marlies; Snaphaan, Thom; Pauwels, Lieven J. R.; Ponnet, Koen; Hardyns, Wim – Field Methods, 2023

This study focuses on the use of systematic social observations (SSO) to measure crime prevention through environmental design (CPTED) and disorder. To improve knowledge about measurement issues in small area research, SSO is conducted by means of three different methods: in-situ, photographs, and Google Street View (GSV) imagery. By evaluating…

Descriptors: Crime Prevention, Measurement Techniques, Photography, Observation

Visualizing Agreement: Bland-Altman Plots as a Supplement to Inter-Rater Reliability Indices

Peer reviewed

Direct link

Brogan L. Barr; Virginia V. W. McIntosh; Eileen F. Britt; Jennifer Jordan; Janet D. Carter – Measurement: Interdisciplinary Research and Perspectives, 2024

Even when raters demonstrate agreement in the use of a measure, limited score variability or violation of often-ignored statistical assumptions can result in lower reliability estimates than intuitively expected. This article uses data drawn from two randomized controlled trials of schema therapy and cognitive behavioral therapy for the treatment…

Descriptors: Evaluators, Interrater Reliability, Reliability, Measurement Techniques

Intra- and Inter-Rater Reliability of the Behaviour Mapping Schedule: A Direct Observational Tool for Classifying Children's Play Behaviour

Peer reviewed

Direct link

Dankiw, Kylie A.; Baldock, Katherine L.; Kumar, Saravana; Tsiros, Margarita D. – Australasian Journal of Early Childhood, 2021

Identifying and describing children's play behaviours is an important component of evaluating child development. The Behaviour Mapping Schedule is a direct observational tool which aims to describe and quantify children's play behaviours but is yet to undergo reliability testing. This study aimed to determine the intra- and inter-rater reliability…

Descriptors: Interrater Reliability, Classification, Child Behavior, Play

Spanish Validation of the Impact of Event Scale for People with Intellectual Disabilities, IES-ID

Peer reviewed

Direct link

Nuñez-Polo, Mercedes H. – Journal of Mental Health Research in Intellectual Disabilities, 2022

Introduction: The aim of this study is to validate a Spanish version of the Impact of Event Scale on People with ID (IES-ID). Methods: IES-ID was administered to adults with ID (n = 120), analyzing internal consistency, inter-rater and test-retest reliability, criterion validity, construct validity and feasibility. Results: Good internal…

Descriptors: Spanish, Translation, Construct Validity, Factor Analysis

Accuracy and Reliability of Large Language Models in Assessing Learning Outcomes Achievement across Cognitive Domains

Peer reviewed

Direct link

Swapna Haresh Teckwani; Amanda Huee-Ping Wong; Nathasha Vihangi Luke; Ivan Cherh Chiet Low – Advances in Physiology Education, 2024

The advent of artificial intelligence (AI), particularly large language models (LLMs) like ChatGPT and Gemini, has significantly impacted the educational landscape, offering unique opportunities for learning and assessment. In the realm of written assessment grading, traditionally viewed as a laborious and subjective process, this study sought to…

Descriptors: Accuracy, Reliability, Computational Linguistics, Standards

An Empirical Study of English Language Teachers' Methodology on the Career Growth of Saudi Students

Peer reviewed
PDF on ERIC

Download full text

Alkhanani, Badriah – International Journal of Language Education, 2022

The purpose of this study was to find the effect of English Language Teachers' Methodology (ELTM) on the Career Growth (CG) of the Saudi students. In order to provide a solid basis for this research study, a cross-sectional-descriptive research design was employed. For scale development and tool standardization, inter-class correlation…

Descriptors: Career Development, English (Second Language), Second Language Learning, Second Language Instruction

Investigating the Psychometric Properties of the ACEI Global Guidelines Assessment, Third Edition (GGA) in Nine Countries

Peer reviewed

Direct link

Hardin, Belinda J.; Bergen, Doris; Busio, Dionne Sills; Boone, William – Early Childhood Education Journal, 2017

The Third Edition of the ACEI Global Guidelines Assessment (GGA) was evaluated for its effectiveness as an international assessment tool for use by early childhood educators to develop, assess, and improve program quality worldwide. This expanded study was conducted in nine countries [People's Republic of China (2 sites), Guatemala, India, Italy,…

Descriptors: Foreign Countries, International Assessment, Early Childhood Education, Psychometrics

The Different Impact of a Structured Peer-Assessment Task in Relation to University Undergraduates' Initial Writing Skills

Peer reviewed

Direct link

Ramon-Casas, Marta; Nuño, Neus; Pons, Ferran; Cunillera, Toni – Assessment & Evaluation in Higher Education, 2019

This article presents an empirical evaluation of the validity and reliability of a peer-assessment activity to improve academic writing competences. Specifically, we explored a large group of psychology undergraduate students with different initial writing skills. Participants (n = 365) produced two different essays, which were evaluated by their…

Descriptors: Peer Evaluation, Validity, Reliability, Writing Skills

Brief Report: An Exploratory Study of the Diagnostic Reliability for Autism Spectrum Disorder

Peer reviewed

Direct link

Taylor, Lauren J.; Eapen, Valsamma; Maybery, Murray; Midford, Sue; Paynter, Jessica; Quarmby, Lyndsay; Smith, Timothy; Williams, Katrina; Whitehouse, Andrew J. – Journal of Autism and Developmental Disorders, 2017

Previous research shows inconsistency in clinician-assigned diagnoses of Autism Spectrum Disorder (ASD). We conducted an exploratory study that examined the concordance of diagnoses between a multidisciplinary assessment team and a range of independent clinicians throughout Australia. Nine video-taped Autism Diagnostic Observation Schedule (ADOS)…

Descriptors: Autism, Pervasive Developmental Disorders, Clinical Diagnosis, Foreign Countries

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Assessment & Evaluation in…	3
Educational Sciences: Theory…	2
International Journal of…	2
Journal of International…	2
Online Submission	2
ProQuest LLC	2
ACM Transactions on Computing…	1
Advances in Health Sciences…	1
Advances in Physiology…	1
Australasian Journal of Early…	1
Australian Journal of…	1
Cambridge Journal of Education	1
Chemistry Education Research…	1
Creativity Research Journal	1
Crime & Delinquency	1
Developmental Psychology	1
Early Child Development and…	1
Early Childhood Education…	1
Education and Treatment of…	1
Educational Psychology in…	1
Educational Research	1
Electronic Journal of…	1
Eurasian Journal of…	1
Field Methods	1
Grantee Submission	1
More ▼

Akalin, Selma	1
Aljunied, Mariam	1
Alkhanani, Badriah	1
Amanda Huee-Ping Wong	1
Andreou, Theresa E.	1
Anne Katherine Hvistendahl	1
Arbaiy, Nurieze	1
Bal, Aydin	1
Baldock, Katherine L.	1
Barone, Lavinia	1
Batdi, Veli	1
Berg, Marie	1
Bergen, Doris	1
Bettany-Saltikov, Josette	1
Bijlsma, Hannah J. E.	1
Bilginer, Hayriye	1
Boone, William	1
Born, Marise Ph.	1
Britton, Emily	1
Brogan L. Barr	1
Busio, Dionne Sills	1
Butterwick, Dale	1
Christina Sørensen	1
Chu, Szu-Yin	1
Chuang, Tsung-Yen	1
More ▼

Higher Education	17
Postsecondary Education	16
Elementary Education	10
Secondary Education	7
Early Childhood Education	5
High Schools	3
Preschool Education	3
Grade 1	2
Grade 2	2
Grade 4	2
Primary Education	2
Adult Education	1
Elementary Secondary Education	1
Grade 3	1
Grade 6	1
Grade 7	1
High School Equivalency…	1
Intermediate Grades	1
Junior High Schools	1
Kindergarten	1
Middle Schools	1
More ▼