ERIC - Search Results

Publication Date

In 2025	13
Since 2024	37
Since 2021 (last 5 years)	98
Since 2016 (last 10 years)	270
Since 2006 (last 20 years)	550

Descriptor

Foreign Countries	645
Interrater Reliability	645
English (Second Language)	119
Correlation	110
Test Reliability	100
Second Language Learning	98
Test Validity	90
Comparative Analysis	84
Evaluation Methods	78
Student Evaluation	77
Scores	76
Statistical Analysis	74
Validity	69
Evaluators	68
Language Tests	67
Questionnaires	65
Second Language Instruction	65
College Students	64
Teaching Methods	58
Scoring	57
Writing Evaluation	56
Measures (Individuals)	54
Elementary School Students	53
Psychometrics	52
Rating Scales	52
More ▼

Publication Type

Journal Articles	602
Reports - Research	528
Reports - Evaluative	77
Tests/Questionnaires	60
Speeches/Meeting Papers	21
Reports - Descriptive	19
Information Analyses	13
Dissertations/Theses -…	7
Opinion Papers	6
Numerical/Quantitative Data	3
Books	1
Collected Works - Proceedings	1
Guides - General	1
Reports - General	1
More ▼

Education Level

Higher Education	214
Postsecondary Education	165
Elementary Education	71
Secondary Education	57
Early Childhood Education	35
Elementary Secondary Education	18
Preschool Education	17
Adult Education	16
High Schools	15
Middle Schools	12
Primary Education	12
Intermediate Grades	11
Grade 4	10
Kindergarten	10
Grade 1	8
Grade 6	8
Grade 3	6
Grade 5	6
Junior High Schools	6
Grade 11	5
Grade 2	5
Grade 10	3
Grade 7	3
Grade 8	3
Grade 9	2
More ▼

Audience

Researchers	6
Practitioners	5
Teachers	3
Policymakers	1

Location

Australia	56
Turkey	51
United Kingdom	45
Canada	44
Netherlands	39
China	36
United Kingdom (England)	24
Taiwan	23
Japan	22
Sweden	21
Germany	20
United States	20
Iran	19
Hong Kong	17
South Korea	16
Israel	15
South Africa	14
New Zealand	13
Spain	12
Finland	11
India	11
Singapore	11
Belgium	10
Italy	9
Norway	9
More ▼

Laws, Policies, & Programs

What Works Clearinghouse Rating

Showing 1 to 15 of 645 results Save | Export

Inconsistencies in Rater-Based Assessments Mainly Affect Borderline Candidates: But Using Simple Heuristics Might Improve Pass-Fail Decisions

Peer reviewed

Direct link

Stefan K. Schauber; Anne O. Olsen; Erik L. Werner; Morten Magelssen – Advances in Health Sciences Education, 2024

Introduction: Research in various areas indicates that expert judgment can be highly inconsistent. However, expert judgment is indispensable in many contexts. In medical education, experts often function as examiners in rater-based assessments. Here, disagreement between examiners can have far-reaching consequences. The literature suggests that…

Descriptors: Medical Students, Performance Based Assessment, Expertise, Interrater Reliability

Test-Retest and Inter-Rater Reliability for Selected Outcomes from a Wearable 3D Inertial Sensor over Different Stable and Unstable Postural Conditions: A Validation Study

Peer reviewed

Direct link

Samuel D'Emanuele; Francesca Nardello; Fabrizio Garau; Diego Campaci; Federico Schena; Cantor Tarperi – Measurement in Physical Education and Exercise Science, 2025

The agreement between a wearable inertial sensor (GYKO, G) and the force platform (P) was assessed by evaluating "test-retest" and "inter-rater reliability." Thirty-eight subjects were enrolled; the selected indices of balance were investigated over foot positions and (un)stable conditions. Intraclass correlation coefficient…

Descriptors: Human Posture, Measurement Equipment, Interrater Reliability, Measurement Techniques

Human versus Machine: The Effectiveness of ChatGPT in Automated Essay Scoring

Peer reviewed

Direct link

Jennifer Manning; Jeffrey Baldwin; Natasha Powell – Innovations in Education and Teaching International, 2025

As ChatGPT continues to reshape student engagement and instructional design, it is crucial to examine its practical implications. This study aims to evaluate the effectiveness of ChatGPT3.5 and ChatGPT4 as potential automated essay scoring (AES) systems. Fifty authentic, student-written annotated bibliographies were evaluated by three human raters…

Descriptors: Foreign Countries, Essays, Writing Evaluation, Artificial Intelligence

Communal Factors in Rater Severity and Consistency over Time in High-Stakes Oral Assessment

Peer reviewed

Direct link

Reeta Neittaanmäki; Iasonas Lamprianou – Language Testing, 2024

This article focuses on rater severity and consistency and their relation to major changes in the rating system in a high-stakes testing context. The study is based on longitudinal data collected from 2009 to 2019 from the second language (L2) Finnish speaking subtest in the National Certificates of Language Proficiency in Finland. We investigated…

Descriptors: Foreign Countries, Interrater Reliability, Evaluators, Item Response Theory

Profiling Communication Ability in Dementia: Validation of a New Cognitive-Communication Assessment Tool

Peer reviewed

Direct link

Suzanna Dooley; Tammy Hopper; Rachael Doyle; Orla Gilheaney; Margaret Walshe – International Journal of Language & Communication Disorders, 2025

Background: Individuals with dementia have communication limitations resulting from cognitive impairments that define the syndrome. Whereas there are numerous cognitive assessments for individuals with dementia, there are far fewer communication assessments. The Profiling Communication Ability in Dementia (P-CAD) was developed to address this gap.…

Descriptors: Communication Skills, Communication Problems, Dementia, Intellectual Disability

Validity and Reliability of Cognitive Constructivism-Oriented Teaching Conception Questionnaire

Peer reviewed

Direct link

Duong Thi Ngoc Ngan; Maria Hercz – Asia-Pacific Education Researcher, 2024

As there is a paucity of instrument investigating a hybrid teaching conception, the current study is seen as part of attempt to fill this gap. The subjects in the study were 310 University participants--instructors in Socialist Republic of Viet Nam (Vietnam). The survey was implemented with the use of Cognitive Constructivism-oriented Teaching…

Descriptors: Blended Learning, Faculty, Teaching Methods, Foreign Countries

The Reliability of Expert Diagnosis of Childhood Apraxia of Speech

Peer reviewed

Direct link

Elizabeth Murray; Shelley Velleman; Jonathan L. Preston; Robert Heard; Akhila Shibu; Patricia McCabe – Journal of Speech, Language, and Hearing Research, 2024

Purpose: The current standard for clinical diagnosis of childhood apraxia of speech (CAS) is expert clinician judgment. The psychometric properties of this standard are not well understood; however, they are important for improving clinical diagnosis. The purpose of this study is to determine the extent to which experts agree on the clinical…

Descriptors: Neurological Impairments, Speech Impairments, Preschool Children, Adolescents

Trust the "Process"? When Fundamental Motor Skill Scores Are Reliably Unreliable

Peer reviewed

Direct link

Hulteen, Ryan M.; True, Larissa; Kroc, Edward – Measurement in Physical Education and Exercise Science, 2023

The typical process for assessing inter-rater reliability is facilitated by training raters within a research team. Lacking is an understanding if inter-rater reliability scores "between" research teams demonstrate adequate reliability. This study examined inter-rater reliability between 16 researchers who assessed fundamental motor…

Descriptors: Psychomotor Skills, Scores, Reliability, Interrater Reliability

Assessing Social Communication and Measuring Changes in Chinese Autistic Preschoolers: A Preliminary Study Using the Social Communication Scale

Peer reviewed

Direct link

Li Wang; Xin Qi; Ziyan Meng; Meiyu Xiang; Zhuoqing Li; Sitong Zhang; Longyun Hu; Hoyee W. Hirai; Carol K. S. To; Patrick C. M. Wong – Journal of Speech, Language, and Hearing Research, 2025

Purpose: Assessing social communication and measuring its changes among young autistic children presents significant challenges, particularly when tracking intervention effects within short timeframes. Existing measures, mostly validated in Western contexts, may not be suitable for culturally diverse populations. Addressing this gap, the Social…

Descriptors: Autism Spectrum Disorders, Preschool Children, Interpersonal Communication, Communication Skills

Emotional Social Screening Tool for School Readiness -- Revised: IsiXhosa Adaptation

Peer reviewed
PDF on ERIC

Download full text

Erica Munnik; Prenita Reddi; Mario R. Smith – South African Journal of Childhood Education, 2025

Background: The need for the adaptation of instruments to other native languages to promote culture-fairness framed this study. Aim: This article reports on the adaptation of the locally developed Emotional Social Screening Tool for School Readiness (E3SR-R) into isiXhosa. Setting: This adaptation study was conducted in South Africa. Methods: The…

Descriptors: Screening Tests, Emotional Development, Social Development, School Readiness

Inter-Evaluator Reliability of Sagittal and Rotational Spinal Measurements from 3D Ultrasound Imaging of Healthy Females in Standing with Varying Arm Positions

Peer reviewed

Direct link

Aislinn Ganci; Miran Qazizada; Brianna Fehr; Ana Vucenovic; Edmond Lou; Eric Parent – Measurement in Physical Education and Exercise Science, 2024

Spinal alignment can be assessed without radiation using three-dimensional ultrasound imaging (3DUS). Reliable measurements could inform the ideal arm position for scoliosis radiographs. This study determined the inter-evaluator reliability of axial vertebral rotation (AVR) measurements and sagittal curve angles in healthy females from 3DUS spinal…

Descriptors: Foreign Countries, Young Adults, Adults, Adolescents

The Politics of Reading Textbooks: Intergenerational and International Reflections on China

Peer reviewed

Direct link

Liz Jackson; Michael W. Apple; Fei Yan; Jason Cong Lin; Chenxi Jiang; Tongzhou Li; Edward Vickers – Educational Philosophy and Theory, 2024

In this collective essay the authors consider the nature and consequences of reading and researching across difference in an international and intergenerational team, whose core members are focused on understanding how curriculum operates and the nature of textbook representation of diversity in Mainland China, Hong Kong, Taiwan, and Macau.…

Descriptors: Foreign Countries, Textbooks, Reading Research, Educational Research

Reliability and Validity of the ARMIDILO-S in Sex Offenders with Intellectual Disabilities

Peer reviewed

Direct link

Pouls, Claudia; Jeandarme, Inge – Journal of Mental Health Research in Intellectual Disabilities, 2023

Background: The ARMIDILO-S is advocated as a promising tool for assessing dynamic risk factors in sex offenders with intellectual disabilities (SOIDs). However, research remains scarce. The present study aimed to further validate this instrument in SOIDs. Method: The study prospectively followed 38 SOIDs for up to one year to test the accuracy of…

Descriptors: Test Reliability, Test Validity, Sexual Abuse, Criminals

Same Grade for Different Reasons, Different Grades for the Same Reason?

Peer reviewed

Direct link

Ilona Rinne – Assessment & Evaluation in Higher Education, 2024

It is widely acknowledged in research that common criteria and aligned standards do not result in consistent assessment of such a complex performance as the final undergraduate thesis. Assessment is determined by examiners' understanding of rubrics and their views on thesis quality. There is still a gap in the research literature about how…

Descriptors: Foreign Countries, Undergraduate Students, Teacher Education Programs, Evaluation Criteria

The Reliability of Using ChatGPT in Rating EFL Writings

Peer reviewed
PDF on ERIC

Download full text

Yang Yang – Shanlax International Journal of Education, 2024

This paper explores the reliability of using ChatGPT in evaluating EFL writing by assessing its intra- and inter-rater reliability. Eighty-two compositions were randomly sampled from the Written English Corpus of Chinese Learners. These compositions were rated by three experienced raters with regard to 'language', 'content', and 'organization'.…

Descriptors: English (Second Language), Second Language Instruction, Writing (Composition), Evaluation Methods

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 43

Assessment & Evaluation in…	17
Language Testing	14
Online Submission	13
Journal of Speech, Language,…	11
Journal of Autism and…	10
Language Assessment Quarterly	10
Advances in Health Sciences…	9
Early Child Development and…	9
International Journal of…	9
Measurement in Physical…	9
Assessment in Education:…	7
Educational Sciences: Theory…	7
English Language Teaching	7
Journal of Intellectual…	7
Research Papers in Education	7
Child Language Teaching and…	6
Journal of Intellectual &…	6
ProQuest LLC	6
Research in Developmental…	6
Studies in Higher Education	6
Autism: The International…	5
Creativity Research Journal	5
Eurasian Journal of…	5
International Education…	5
Journal of Baltic Science…	5
More ▼

Greatorex, Jackie	5
Baird, Jo-Anne	4
Coniam, David	4
Sata, Mehmet	4
Karakaya, Ismail	3
Ahmadi, Alireza	2
Aktas, Mehtap	2
Atilgan, Hakan	2
Aydin, Selami	2
Bahreini, Kiavash	2
Beijaard, Douwe	2
Bell, John F.	2
Bijani, Houman	2
Black, Beth	2
Bramley, Tom	2
Chan, Roger W.	2
Chen, Ching-I	2
Cheung, Wai Ming	2
De Maeyer, Sven	2
Dempster, Edith R.	2
Einarsdottir, Johanna	2
Erman Aslanoglu, Aslihan	2
Gillberg, Christopher	2
Gridley, Nicole	2
More ▼

Test of English as a Foreign…	11
Strengths and Difficulties…	6
Autism Diagnostic Observation…	5
Raven Progressive Matrices	5
Child Behavior Checklist	4
International English…	4
Program for International…	4
Early Childhood Environment…	3
Mullen Scales of Early…	3
Trends in International…	3
Vineland Adaptive Behavior…	3
Wechsler Adult Intelligence…	3
Behavioral and Emotional…	2
Classroom Assessment Scoring…	2
Draw a Person Test	2
Obsessive Compulsive Scale	2
Peabody Picture Vocabulary…	2
Pediatric Evaluation of…	2
Preschool and Kindergarten…	2
Test of English for…	2
Adult Attachment Interview	1
Beck Anxiety Inventory	1
Beck Depression Inventory	1
Behavior Assessment System…	1
Behavior Problem Checklist	1
More ▼