ERIC - Search Results

Publication Date

In 2026	1
Since 2025	144
Since 2022 (last 5 years)	857
Since 2017 (last 10 years)	1953
Since 2007 (last 20 years)	4728

Descriptor

Reliability	6225
Validity	2514
Foreign Countries	2452
Measures (Individuals)	1434
Correlation	1178
Factor Analysis	1145
Statistical Analysis	953
Student Attitudes	889
Questionnaires	864
Scores	753
Psychometrics	677
College Students	660
Comparative Analysis	622
Higher Education	546
Teacher Attitudes	505
Evaluation Methods	479
Teaching Methods	473
Undergraduate Students	455
Gender Differences	435
Academic Achievement	421
Construct Validity	415
Factor Structure	414
Models	411
Elementary School Students	403
Likert Scales	398
More ▼

Education Level

Higher Education	1645
Postsecondary Education	1363
Secondary Education	741
Elementary Education	663
High Schools	340
Middle Schools	319
Early Childhood Education	246
Junior High Schools	225
Elementary Secondary Education	173
Primary Education	123
Preschool Education	111
Intermediate Grades	101
Grade 8	96
Grade 4	92
Grade 5	92
Grade 7	86
Grade 3	78
Grade 6	78
Kindergarten	68
Grade 9	55
Grade 1	53
Adult Education	52
Grade 2	50
Grade 10	49
Grade 11	42
More ▼

Audience

Researchers	111
Practitioners	31
Teachers	17
Administrators	14
Policymakers	9
Counselors	5
Students	2

Location

Turkey	440
Australia	117
Canada	112
China	112
Taiwan	95
Nigeria	91
Indonesia	85
United States	81
Netherlands	80
Spain	68
Hong Kong	63
Malaysia	63
United Kingdom	62
United Kingdom (England)	61
Iran	59
Germany	57
California	52
Jordan	52
South Korea	50
India	44
Florida	42
Greece	40
Thailand	40
Pennsylvania	38
Finland	36
More ▼

What Works Clearinghouse Rating

Meets WWC Standards with or without Reservations	1
Does not meet standards	2

Showing 1 to 15 of 6,225 results Save | Export

Brief Research Report: Effects of Sampling Error and Categorization on Estimation of Measure of Sampling Adequacy

Peer reviewed

Direct link

Hsin-Yun Lee; You-Lin Chen; Li-Jen Weng – Journal of Experimental Education, 2024

The second version of Kaiser's Measure of Sampling Adequacy (MSA[subscript 2]) has been widely applied to assess the factorability of data in psychological research. The MSA[subscript 2] is developed in the population and little is known about its behavior in finite samples. If estimated MSA[subscript 2]s are biased due to sampling errors,…

Descriptors: Error of Measurement, Reliability, Sampling, Statistical Bias

How Consistent Are Humans When Grading Programming Assignments?

Peer reviewed

Direct link

Marcus Messer; Neil C. C. Brown; Michael Kölling; Miaojing Shi – ACM Transactions on Computing Education, 2025

Providing consistent summative assessment to students is important, as the grades they are awarded affect their progression through university and future career prospects. While small cohorts are typically assessed by a single assessor, such as the module/class leader, larger cohorts are often assessed by multiple assessors, typically teaching…

Descriptors: Foreign Countries, Grading, Interrater Reliability, Teaching Assistants

Grading Exams Using Large Language Models: A Comparison between Human and AI Grading of Exams in Higher Education Using ChatGPT

Peer reviewed

Direct link

Jonas Flodén – British Educational Research Journal, 2025

This study compares how the generative AI (GenAI) large language model (LLM) ChatGPT performs in grading university exams compared to human teachers. Aspects investigated include consistency, large discrepancies and length of answer. Implications for higher education, including the role of teachers and ethics, are also discussed. Three…

Descriptors: College Faculty, Artificial Intelligence, Comparative Testing, Scoring

Reporting and Measuring English School Qualifications: A Case Study of General Certificate of Secondary Education Results in Survey and Linked Administrative Data in the UK Millennium Cohort Study

Peer reviewed

Direct link

Sarah Stopforth; Roxanne Connelly; Vernon Gayle – Cambridge Journal of Education, 2025

Data on educational qualifications is essential in many research domains. The UK Millennium Cohort Study collected self-reported General Certificate of Secondary Education (GCSE) data in sweep 7 (cohort members aged 17). GCSE data from the National Pupil Database (NPD) has been linked to the MCS. This study investigates the consistency of these…

Descriptors: Foreign Countries, Adolescents, Case Studies, Secondary Education

Reliability of Ratings of an English Language Arts Curriculum with the Curriculum Evaluation Guidelines

Peer reviewed

Direct link

Matthew K. Burns; Heba Z. Abdelnaby; Jonie B. Welland; Katherine A. Graves; Kari Kurto – Assessment for Effective Intervention, 2024

The current study examined the reliability of The Reading League Curriculum-Evaluation Guidelines (CEGs), which were developed to help school-based teams rate the presence of red flags when considering adopting specific literacy curricula. Coders (n = 30) independently used the CEGs to evaluate a free online English language arts curriculum. The…

Descriptors: English Curriculum, English Instruction, Language Arts, Curriculum Evaluation

Validity and Intrarater Reliability of the Fysiometer--Measuring Eccentric Knee Flexor Force during the Nordic Hamstring Exercise

Peer reviewed

Direct link

Morten Pallisgaard Støve; Mathias Kringelholt Kristensen; Jonas Nielsen; Lea Dyhrberg Madsen – Measurement in Physical Education and Exercise Science, 2025

Between limb strength, asymmetry is a leading risk factor for hamstring strain re-injury. However, few accurate testing methodologies are available in clinical settings. This study examined the validity and reliability of eccentric knee flexor torque measured with a novel Nordic Hamstring Device. Twenty-seven healthy participants were assessed in…

Descriptors: Validity, Reliability, Human Body, Foreign Countries

Trust the "Process"? When Fundamental Motor Skill Scores Are Reliably Unreliable

Peer reviewed

Direct link

Hulteen, Ryan M.; True, Larissa; Kroc, Edward – Measurement in Physical Education and Exercise Science, 2023

The typical process for assessing inter-rater reliability is facilitated by training raters within a research team. Lacking is an understanding if inter-rater reliability scores "between" research teams demonstrate adequate reliability. This study examined inter-rater reliability between 16 researchers who assessed fundamental motor…

Descriptors: Psychomotor Skills, Scores, Reliability, Interrater Reliability

Treatment Fidelity in a Feasibility Trial of the Aphasia Intervention, Virtual Elaborated Semantic Feature Analysis

Peer reviewed

Direct link

Niamh Devane; Sofia Mazzoleni; Nicholas Behn; Jane Marshall; Stephanie Wilson; Katerina Hilari – International Journal of Language & Communication Disorders, 2025

Background and Aims: The reliability and validity of an intervention can be improved by checking treatment fidelity (TF). TF methods identify core components of an intervention, check their presence (or absence) and identify threats to fidelity. The Virtual Elaborated Semantic Feature Analysis (VESFA) intervention comprised individual sessions of…

Descriptors: Aphasia, Intervention, Fidelity, Feasibility Studies

Mixed Model Generalizability Theory: A Case Study and Tutorial

Peer reviewed
PDF on ERIC

Download full text

Alan Huebner; Gustaf B. Skar; Mengchen Huang – Practical Assessment, Research & Evaluation, 2025

Generalizability theory is a modern and powerful framework for conducting reliability analyses. It is flexible to accommodate both random and fixed facets. However, there has been a relative scarcity in the practical literature on how to handle the fixed facet case. This article aims to provide practitioners a conceptual understanding and…

Descriptors: Generalizability Theory, Multivariate Analysis, Statistical Analysis, Writing Evaluation

The Scale of Sincerity Based on Kyai Haji Ahmad Dahlan's Version for Islamic Students: The Rasch Analysis

Peer reviewed
PDF on ERIC

Download full text

Wahyu Nanda Eka Saputra; Trikinasih Handayani; Prima Suci Rohmadheny; Rohmatus Naini; Dody Hartanto; Hardi Santosa; Dewi Afra Khairunnisa; Risma Risansyah; Hanan Riati; Faturrahman – Journal of Education and Learning (EduLearn), 2025

The students are urged to do something without expecting anything in return and only in the name of God. Every islamic student becomes something ideal if they can internalize and implement sincerity. Many people are willing to do something because of an ulterior motive. The importance of sincerity in humans is the background for developing a…

Descriptors: Islam, Interrater Reliability, Prosocial Behavior, Muslims

Exploring Ranking Consistency of Generative AI in MOOC Platform Evaluation: A Non-Parametric Approach

Peer reviewed
PDF on ERIC

Download full text

Victor K. Y. Chan – International Association for Development of the Information Society, 2025

This paper extends a prior study on the consistency of generative Artificial Intelligence (AI) models in evaluating Massive Open Online Course (MOOC) platforms. While the original work focused on the consistency of direct numerical scores, this research investigates the consistency of the rankings derived from those scores. When evaluating…

Descriptors: Artificial Intelligence, MOOCs, Reliability, Evaluation Methods

Toward Sufficient Statistical Power in Algorithmic Bias Assessment: A Test for ABROCA

Peer reviewed
PDF on ERIC

Download full text

Conrad Borchers – International Educational Data Mining Society, 2025

Algorithmic bias is a pressing concern in educational data mining (EDM), as it risks amplifying inequities in learning outcomes. The Area Between ROC Curves (ABROCA) metric is frequently used to measure discrepancies in model performance across demographic groups to quantify overall model fairness. However, its skewed distribution--especially when…

Descriptors: Algorithms, Bias, Statistics, Simulation

Classification Consistency and Accuracy Indices for Simple Structure MIRT Model

Peer reviewed

Direct link

Huan Liu; Won-Chan Lee – Journal of Educational Measurement, 2025

This study investigates the estimation of classification consistency and accuracy indices for composite summed and theta scores within the SS-MIRT framework, using five popular approaches, including the Lee, Rudner, Guo, Bayesian EAP, and Bayesian MCMC approaches. The procedures are illustrated through analysis of two real datasets and further…

Descriptors: Classification, Reliability, Accuracy, Item Response Theory

IRT Scoring and Recursion for Estimating Reliability and Other Accuracy Indices

Peer reviewed

Direct link

Tim Moses; YoungKoung Kim – Journal of Educational Measurement, 2025

This study considers the estimation of marginal reliability and conditional accuracy measures using a generalized recursion procedure with several IRT-based ability and score estimators. The estimators include MLE, TCC, and EAP abilities, and corresponding test scores obtained with different weightings of the item scores. We consider reliability…

Descriptors: Item Response Theory, Scoring, Reliability, Accuracy

Validity and Reliability of Child-Friendly School Policy Evaluation Instruments in Primary Schools: Confirmatory Factor Analysis

Peer reviewed
PDF on ERIC

Download full text

Riana Nurhayati; Suranto Aw; Siti Irene Astuti Dwiningrum; Mami Hajaroh; Herwin Herwin – International Journal of Educational Methodology, 2024

Evaluation of child-friendly school (CFS) policies is essential to determine the achievements of school efforts in reducing violence cases. This research aims to proving the reliability and validity of CFS policy evaluation instruments in elementary schools with different locations. This investigation uses the Context Input Process Product (CIPP)…

Descriptors: Validity, Reliability, School Policy, Program Evaluation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 415

Educational and Psychological…	125
Online Submission	106
Educational Research and…	80
Journal of Education and…	72
Measurement in Physical…	62
Journal of Psychoeducational…	59
Educational Sciences: Theory…	55
International Education…	54
Journal of Speech, Language,…	53
Psychological Assessment	48
Research on Social Work…	47
Psychology in the Schools	46
Measurement and Evaluation in…	44
Grantee Submission	42
Journal of Autism and…	42
Assessment & Evaluation in…	39
Child Development	38
Eurasian Journal of…	35
International Journal of…	35
Developmental Psychology	34
ETS Research Report Series	33
International Journal of…	32
Applied Measurement in…	31
Journal of Education and…	29
Advances in Health Sciences…	28
More ▼

Price, Gary G.	14
Briesch, Amy M.	13
Fraser, Barry J.	12
Riley-Tillman, T. Chris	11
Thompson, Bruce	10
Chafouleas, Sandra M.	9
Christ, Theodore J.	9
Gill, Brian	9
Haberman, Shelby J.	9
Tsai, Chin-Chung	9
Attali, Yigal	7
Francis, Leslie J.	7
Lane, Kathleen Lynne	7
Lipscomb, Stephen	7
Marsh, Herbert W.	7
Matson, Johnny L.	7
Chiang, Hanley	6
Kulinna, Pamela Hodges	6
Lee, Yong-Won	6
Liang, Chaoyun	6
Onslow, Mark	6
Onwuegbuzie, Anthony J.	6
Petscher, Yaacov	6
Polikoff, Morgan S.	6
More ▼

Reports - Research	6225
Journal Articles	5515
Tests/Questionnaires	366
Speeches/Meeting Papers	319
Information Analyses	141
Numerical/Quantitative Data	42
Opinion Papers	17
Guides - Non-Classroom	13
Reports - Evaluative	13
Reports - Descriptive	10
Books	8
Non-Print Media	7
Multilingual/Bilingual…	4
Reports - General	4
Guides - Classroom - Teacher	2
Historical Materials	2
Legal/Legislative/Regulatory…	2
Reports -…	2
Book/Product Reviews	1
Collected Works - General	1
Collected Works - Proceedings	1
Collected Works - Serial	1
Dissertations/Theses	1
Dissertations/Theses -…	1
Dissertations/Theses -…	1
More ▼

No Child Left Behind Act 2001	17
Individuals with Disabilities…	12
Race to the Top	6
Elementary and Secondary…	5
Individuals with Disabilities…	4
Americans with Disabilities…	3
Every Student Succeeds Act…	3
Comprehensive Employment and…	2
Elementary and Secondary…	2
Head Start	2
Rehabilitation Act 1973…	2
Adoption Assistance and Child…	1
Adoption and Safe Families…	1
Child Abuse Prevention and…	1
Elementary and Secondary…	1
Elementary and Secondary…	1
Family Educational Rights and…	1
National School Lunch Act 1970	1
Universal Declaration of…	1
More ▼

Wechsler Intelligence Scale…	23
Child Behavior Checklist	19
Motivated Strategies for…	19
Program for International…	17
Peabody Picture Vocabulary…	16
SAT (College Admission Test)	15
Strengths and Difficulties…	15
Beck Depression Inventory	13
Behavior Assessment System…	12
Test of English as a Foreign…	12
Academic Motivation Scale	11
Teacher Efficacy Scale	11
ACT Assessment	10
Early Childhood Environment…	10
Iowa Tests of Basic Skills	10
MacArthur Communicative…	10
Marlowe Crowne Social…	10
Stanford Achievement Tests	10
Learning Style Inventory	9
Social Skills Rating System	9
Autism Diagnostic Observation…	8
Dynamic Indicators of Basic…	8
Learning and Study Strategies…	8
Maslach Burnout Inventory	8
Self Directed Search	8
More ▼