ERIC - Search Results

Publication Date

In 2025

Descriptor

Interrater Reliability	38
Foreign Countries	13
Evaluation Methods	10
Artificial Intelligence	9
Test Reliability	9
Test Validity	9
Evaluation Criteria	7
Scoring	7
Error of Measurement	6
Scores	6
Scoring Rubrics	6
Student Evaluation	6
Accuracy	5
Evaluators	5
Psychometrics	5
Automation	4
College Students	4
Comparative Testing	4
Undergraduate Students	4
Computer Assisted Testing	3
Correlation	3
Engineering Education	3
English (Second Language)	3
Essays	3
Measurement Techniques	3
More ▼

Publication Type

Journal Articles	38
Reports - Research	36
Tests/Questionnaires	2
Information Analyses	1
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Higher Education	13
Postsecondary Education	13
Secondary Education	8
High Schools	3
Junior High Schools	3
Middle Schools	3
Early Childhood Education	2
Elementary Education	2
Adult Education	1
Grade 1	1
Grade 2	1
Grade 3	1
Grade 7	1
Primary Education	1
More ▼

Audience

Location

China	3
Belgium	1
Canada	1
Germany	1
Illinois (Urbana)	1
Indonesia	1
Ireland	1
Israel	1
Italy	1
North Carolina	1
Saudi Arabia	1
South Africa	1
South Korea	1
Texas	1
Thailand	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Autism Diagnostic Observation…	1
Classroom Assessment Scoring…	1
Mullen Scales of Early…	1
Strengths and Difficulties…	1
Vineland Adaptive Behavior…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 38 results Save | Export

Automated Scoring in Learning Progression-Based Assessment: A Comparison of Researcher and Machine Interpretations

Peer reviewed

Direct link

Hui Jin; Cynthia Lima; Limin Wang – Educational Measurement: Issues and Practice, 2025

Although AI transformer models have demonstrated notable capability in automated scoring, it is difficult to examine how and why these models fall short in scoring some responses. This study investigated how transformer models' language processing and quantification processes can be leveraged to enhance the accuracy of automated scoring. Automated…

Descriptors: Automation, Scoring, Artificial Intelligence, Accuracy

Technical Adequacy-Reliability

Peer reviewed

Direct link

Susan K. Johnsen – Gifted Child Today, 2025

The author provides information about reliability and areas that educators should examine in determining if an assessment is consistent and trustworthy for use, and how it should be interpreted in making decisions about students. Reliability areas that are discussed in the column include internal consistency, test-retest or stability, inter-scorer…

Descriptors: Test Reliability, Academically Gifted, Student Evaluation, Error of Measurement

Reliability of a Frequency Method for Assessing Vegetable Intake Using Photos among College Students: A Smart Phone Approach

Peer reviewed

Direct link

Heena Suthar; Krisha Thiagarajah; Ibraheem Karaye; Zayra Teresa Lopez-Ixta; Trishnee Bhurosy – Journal of American College Health, 2025

Objective: To measure the interrater reliability of assessing the frequency of vegetable intake using mobile photos and descriptions. Design: Repeated measures design. Setting: A Midwestern university. Participants: Undergraduate students (N = 165). Measurable Outcome/Analysis: Number of times each of these vegetable subgroups were consumed daily:…

Descriptors: Interrater Reliability, Incidence, Food, Eating Habits

Examining the Psychometric Impact of Targeted and Random Double-Scoring in Mixed-Format Assessments

Peer reviewed

Direct link

Yangmeng Xu; Stefanie A. Wind – Educational Measurement: Issues and Practice, 2025

Double-scoring constructed-response items is a common but costly practice in mixed-format assessments. This study explored the impacts of Targeted Double-Scoring (TDS) and random double-scoring procedures on the quality of psychometric outcomes, including student achievement estimates, person fit, and student classifications under various…

Descriptors: Academic Achievement, Psychometrics, Scoring, Evaluation Methods

Test-Retest and Inter-Rater Reliability for Selected Outcomes from a Wearable 3D Inertial Sensor over Different Stable and Unstable Postural Conditions: A Validation Study

Peer reviewed

Direct link

Samuel D'Emanuele; Francesca Nardello; Fabrizio Garau; Diego Campaci; Federico Schena; Cantor Tarperi – Measurement in Physical Education and Exercise Science, 2025

The agreement between a wearable inertial sensor (GYKO, G) and the force platform (P) was assessed by evaluating "test-retest" and "inter-rater reliability." Thirty-eight subjects were enrolled; the selected indices of balance were investigated over foot positions and (un)stable conditions. Intraclass correlation coefficient…

Descriptors: Human Posture, Measurement Equipment, Interrater Reliability, Measurement Techniques

Development of a Categorical Scoring Codebook for Entrepreneurial Mindset (EM) Concept Maps

Peer reviewed

Direct link

Alexandra Jackson; Cheryl Bodnar; Elise Barrella; Juan Cruz; Krista Kecskemety – Journal of STEM Education: Innovations and Research, 2025

Recent curricular interventions in engineering education have focused on encouraging students to develop an entrepreneurial mindset (EM) to equip them with the skills needed to generate innovative ideas and address complex global problems upon entering the workforce. Methods to evaluate these interventions have been inconsistent due to the lack of…

Descriptors: Engineering Education, Entrepreneurship, Concept Mapping, Student Evaluation

Developing an Automatic Pronunciation Scorer: Aligning Speech Evaluation Models and Applied Linguistics Constructs

Peer reviewed

Direct link

Danwei Cai; Ben Naismith; Maria Kostromitina; Zhongwei Teng; Kevin P. Yancey; Geoffrey T. LaFlair – Language Learning, 2025

Globalization and increases in the numbers of English language learners have led to a growing demand for English proficiency assessments of spoken language. In this paper, we describe the development of an automatic pronunciation scorer built on state-of-the-art deep neural network models. The model is trained on a bespoke human-rated dataset that…

Descriptors: Automation, Scoring, Pronunciation, Speech Tests

Human versus Machine: The Effectiveness of ChatGPT in Automated Essay Scoring

Peer reviewed

Direct link

Jennifer Manning; Jeffrey Baldwin; Natasha Powell – Innovations in Education and Teaching International, 2025

As ChatGPT continues to reshape student engagement and instructional design, it is crucial to examine its practical implications. This study aims to evaluate the effectiveness of ChatGPT3.5 and ChatGPT4 as potential automated essay scoring (AES) systems. Fifty authentic, student-written annotated bibliographies were evaluated by three human raters…

Descriptors: Foreign Countries, Essays, Writing Evaluation, Artificial Intelligence

Grading Exams Using Large Language Models: A Comparison between Human and AI Grading of Exams in Higher Education Using ChatGPT

Peer reviewed

Direct link

Jonas Flodén – British Educational Research Journal, 2025

This study compares how the generative AI (GenAI) large language model (LLM) ChatGPT performs in grading university exams compared to human teachers. Aspects investigated include consistency, large discrepancies and length of answer. Implications for higher education, including the role of teachers and ethics, are also discussed. Three…

Descriptors: College Faculty, Artificial Intelligence, Comparative Testing, Scoring

Profiling Communication Ability in Dementia: Validation of a New Cognitive-Communication Assessment Tool

Peer reviewed

Direct link

Suzanna Dooley; Tammy Hopper; Rachael Doyle; Orla Gilheaney; Margaret Walshe – International Journal of Language & Communication Disorders, 2025

Background: Individuals with dementia have communication limitations resulting from cognitive impairments that define the syndrome. Whereas there are numerous cognitive assessments for individuals with dementia, there are far fewer communication assessments. The Profiling Communication Ability in Dementia (P-CAD) was developed to address this gap.…

Descriptors: Communication Skills, Communication Problems, Dementia, Intellectual Disability

Informant Discrepancies in Universal Screening as a Function of Student and Teacher Characteristics

Peer reviewed

Direct link

Brittany N. Zakszeski; Heather E. Ormiston; Malena A. Nygaard; Kane Carlock – School Psychology Review, 2025

Despite the widespread use of school-based universal screening systems for social, emotional, and behavioral risk, limited research has examined discrepancies in ratings provided by teachers and their secondary students. Using the Social, Academic, and Emotional Behavior Risk Screener (SAEBRS; teacher report) and mySAEBRS (student report) scores…

Descriptors: Middle School Students, Middle School Teachers, Screening Tests, Affective Behavior

Assessing Social Communication and Measuring Changes in Chinese Autistic Preschoolers: A Preliminary Study Using the Social Communication Scale

Peer reviewed

Direct link

Li Wang; Xin Qi; Ziyan Meng; Meiyu Xiang; Zhuoqing Li; Sitong Zhang; Longyun Hu; Hoyee W. Hirai; Carol K. S. To; Patrick C. M. Wong – Journal of Speech, Language, and Hearing Research, 2025

Purpose: Assessing social communication and measuring its changes among young autistic children presents significant challenges, particularly when tracking intervention effects within short timeframes. Existing measures, mostly validated in Western contexts, may not be suitable for culturally diverse populations. Addressing this gap, the Social…

Descriptors: Autism Spectrum Disorders, Preschool Children, Interpersonal Communication, Communication Skills

Emotional Social Screening Tool for School Readiness -- Revised: IsiXhosa Adaptation

Peer reviewed
PDF on ERIC

Download full text

Erica Munnik; Prenita Reddi; Mario R. Smith – South African Journal of Childhood Education, 2025

Background: The need for the adaptation of instruments to other native languages to promote culture-fairness framed this study. Aim: This article reports on the adaptation of the locally developed Emotional Social Screening Tool for School Readiness (E3SR-R) into isiXhosa. Setting: This adaptation study was conducted in South Africa. Methods: The…

Descriptors: Screening Tests, Emotional Development, Social Development, School Readiness

The Impact of Online Peer Assessment on Academic Performance in Higher Education: A Meta-Analytic Review

Peer reviewed
PDF on ERIC

Download full text

Kübra Karakaya Özyer – Journal of Educators Online, 2025

This meta-analytic study investigates the impact of online peer assessment on academic achievement in higher education. By synthesizing 20 effect sizes, we provide a comprehensive understanding of how online peer assessment influences student learning outcomes. The findings reveal a statistically significant positive effect (Hedges's g = 0.672),…

Descriptors: Electronic Learning, Peer Evaluation, Higher Education, Meta Analysis

Analyzing Inter-Rater Variation: Exploring Consistency in Mathematics Teachers' Scoring of Exam Papers

Peer reviewed
PDF on ERIC

Download full text

Hosseinali Gholami – Mathematics Teaching Research Journal, 2025

Scoring mathematics exam papers accurately is vital for fostering students' engagement and interest in the subject. Incorrect scoring practices can erode motivation and lead to the development of false self-confidence. Therefore, the implementation of appropriate scoring methods is essential for the success of mathematics education. This study…

Descriptors: Interrater Reliability, Mathematics Teachers, Scoring, Mathematics Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3

Educational Measurement:…	2
International Journal of…	2
Journal of Baltic Science…	2
Journal of Speech, Language,…	2
Active Learning in Higher…	1
Advances in Physiology…	1
American Journal on…	1
Assessment Update	1
British Educational Research…	1
Educational Process:…	1
Gifted Child Today	1
Innovations in Education and…	1
International Journal of…	1
International Journal of…	1
Journal of American College…	1
Journal of Attention Disorders	1
Journal of Computer Assisted…	1
Journal of Education and…	1
Journal of Education and…	1
Journal of Educational…	1
Journal of Educators Online	1
Journal of Engineering…	1
Journal of Learning Analytics	1
Journal of STEM Education:…	1
Journal of Teaching in…	1
More ▼

Ahmed Alkhateeb	1
Alexandra Jackson	1
Alyssa M. Merbler	1
Amanda Barany	1
Amy E. Ramage	1
Andres Felipe Zambrano	1
Andrew Katz	1
Ben Naismith	1
Benjamin Mitchell-Yellin	1
Beth K. Janetski	1
Breanne J. Byiers	1
Brittany N. Zakszeski	1
Cantor Tarperi	1
Carol K. S. To	1
Chantel C. Burkitt	1
Chase Young	1
Cheryl Bodnar	1
Cristin Holland	1
Cynthia Lima	1
Danwei Cai	1
Dewi Afra Khairunnisa	1
Diego Campaci	1
Dody Hartanto	1
Elise Barrella	1
Elizabeth Choi-Tucci	1
More ▼