ERIC - Search Results

Publication Date

In 2026	0
Since 2025	58
Since 2022 (last 5 years)	264
Since 2017 (last 10 years)	511
Since 2007 (last 20 years)	1261

Descriptor

Evaluation Methods	2746
Test Reliability	1410
Test Validity	992
Reliability	965
Student Evaluation	567
Validity	515
Interrater Reliability	502
Foreign Countries	445
Test Construction	366
Higher Education	360
Measurement Techniques	306
Psychometrics	276
Evaluation Criteria	252
Elementary Secondary Education	239
Measures (Individuals)	228
Correlation	220
Scores	215
Comparative Analysis	212
Statistical Analysis	194
Models	193
Rating Scales	192
Questionnaires	186
College Students	185
Program Evaluation	166
Factor Analysis	162
More ▼

Education Level

Higher Education	401
Postsecondary Education	276
Elementary Education	160
Elementary Secondary Education	148
Secondary Education	114
Early Childhood Education	75
Middle Schools	65
High Schools	60
Junior High Schools	37
Adult Education	34
Primary Education	33
Preschool Education	32
Grade 4	23
Grade 5	22
Grade 6	21
Kindergarten	21
Grade 3	20
Intermediate Grades	20
Grade 1	19
Grade 8	18
Grade 7	16
Grade 2	14
Grade 10	10
Grade 12	7
Grade 11	5
More ▼

Audience

Researchers	137
Practitioners	99
Teachers	41
Administrators	32
Policymakers	17
Students	13
Counselors	5
Support Staff	3
Community	1
Media Staff	1
Parents	1
More ▼

Location

Australia	45
United Kingdom	41
Canada	31
United Kingdom (England)	29
China	28
United States	28
Turkey	27
California	22
Florida	21
Netherlands	19
Israel	16
Texas	15
North Carolina	12
Pennsylvania	12
Spain	12
Taiwan	12
Germany	11
Indonesia	11
New York	9
South Korea	9
Illinois	8
Iran	8
Malaysia	8
South Africa	8
Tennessee	8
More ▼

Laws, Policies, & Programs

Every Student Succeeds Act…	11
No Child Left Behind Act 2001	11
Individuals with Disabilities…	8
Elementary and Secondary…	5
Race to the Top	5
Rehabilitation Act 1973…	4
Elementary and Secondary…	3
American Recovery and…	1
Americans with Disabilities…	1
Comprehensive Employment and…	1
Education Amendments 1974	1
Elementary and Secondary…	1
Elementary and Secondary…	1
Elementary and Secondary…	1
Elementary and Secondary…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
Womens Educational Equity Act	1
More ▼

What Works Clearinghouse Rating

Does not meet standards

Evaluation Methods X

Showing 61 to 75 of 2,746 results Save | Export

Assessing the Quality of Student-Generated Content at Scale: A Comparative Analysis of Peer-Review Models

Peer reviewed

Direct link

Darvishi, Ali; Khosravi, Hassan; Rahimi, Afshin; Sadiq, Shazia; Gasevic, Dragan – IEEE Transactions on Learning Technologies, 2023

Engaging students in creating learning resources has demonstrated pedagogical benefits. However, to effectively utilize a repository of student-generated content (SGC), a selection process is needed to separate high- from low-quality resources as some of the resources created by students can be ineffective, inappropriate, or incorrect. A common…

Descriptors: Student Developed Materials, Educational Assessment, Peer Evaluation, Evaluation Methods

Semi-Automatic Assessment of Vocalization Quality for Children With and Without Angelman Syndrome

Peer reviewed

Direct link

Lisa R. Hamrick; Amanda Seidl; Bridgette L. Kelleher – American Journal on Intellectual and Developmental Disabilities, 2023

Automated methods for processing of daylong audio recordings are efficient and may be an effective way of assessing developmental stage for typically developing children; however, their utility for children with developmental disabilities may be limited by constraints of algorithms and the scope of variables produced. Here, we present a novel…

Descriptors: Genetic Disorders, Developmental Disabilities, Child Development, Verbal Communication

How Reliable and Valid Are the Evaluations of Digital Competence in Higher Education: A Systematic Mapping Study

Peer reviewed

Direct link

Saltos-Rivas, Rafael; Novoa-Hernández, Pavel; Serrano Rodríguez, Rocío – SAGE Open, 2022

Evaluating digital competencies has become a topic of growing interest in recent years. Although several reviews and studies have summarized the main elements of progress and shortcomings in this area, some issues are yet to be explored. Very little information is available about the ways of ensuring the validity and reliability of the instrument…

Descriptors: Test Reliability, Test Validity, Evaluation Methods, Technological Literacy

Which Scale Short Form Development Method Is Better? A Comparison of ACO, TS, and SCOFA

Peer reviewed
PDF on ERIC

Download full text

Kogar, Hakan – International Journal of Assessment Tools in Education, 2022

The purpose of this study is to identify which scale short-form development method produces better findings in different factor structures. A simulation study was designed based on this purpose. Three different factor structures and three simulation conditions were selected. As the findings of this simulation study, the model-data fit and…

Descriptors: Test Construction, Measures (Individuals), Factor Structure, Test Reliability

Simple Techniques to Bypass GenAI Text Detectors: Implications for Inclusive Education

Peer reviewed

Direct link

Mike Perkins; Jasper Roe; Binh H. Vu; Darius Postma; Don Hickerson; James McGaughran; Huy Q. Khuat – International Journal of Educational Technology in Higher Education, 2024

This study investigates the efficacy of six major Generative AI (GenAI) text detectors when confronted with machine-generated content modified to evade detection (n = 805). We compare these detectors to assess their reliability in identifying AI-generated text in educational settings, where they are increasingly used to address academic integrity…

Descriptors: Artificial Intelligence, Inclusion, Computer Software, Word Processing

Operationalizing a Weighted Performance Scoring Model for Sustainable e-Learning in Medical Education: Insights from Expert Judgement

Peer reviewed
PDF on ERIC

Download full text

Deborah Oluwadele; Yashik Singh; Timothy Adeliyi – Electronic Journal of e-Learning, 2024

Validation is needed for any newly developed model or framework because it requires several real-life applications. The investment made into e-learning in medical education is daunting, as is the expectation for a positive return on investment. The medical education domain requires data-wise implementation of e-learning as the debate continues…

Descriptors: Electronic Learning, Evaluation Methods, Medical Education, Sustainability

Studying Score Stability with a Harmonic Regression Family: A Comparison of Three Approaches to Adjustment of Examinee-Specific Demographic Data

Peer reviewed

Direct link

Lee, Yi-Hsuan; Haberman, Shelby J. – Journal of Educational Measurement, 2021

For assessments that use different forms in different administrations, equating methods are applied to ensure comparability of scores over time. Ideally, a score scale is well maintained throughout the life of a testing program. In reality, instability of a score scale can result from a variety of causes, some are expected while others may be…

Descriptors: Scores, Regression (Statistics), Demography, Data

Using Machine Learning to Score Multi-Dimensional Assessments of Chemistry and Physics

Peer reviewed

Direct link

Maestrales, Sarah; Zhai, Xiaoming; Touitou, Israel; Baker, Quinton; Schneider, Barbara; Krajcik, Joseph – Journal of Science Education and Technology, 2021

In response to the call for promoting three-dimensional science learning (NRC, 2012), researchers argue for developing assessment items that go beyond rote memorization tasks to ones that require deeper understanding and the use of reasoning that can improve science literacy. Such assessment items are usually performance-based constructed…

Descriptors: Artificial Intelligence, Scoring, Evaluation Methods, Chemistry

Analyzing Inter-Rater Variation: Exploring Consistency in Mathematics Teachers' Scoring of Exam Papers

Peer reviewed
PDF on ERIC

Download full text

Hosseinali Gholami – Mathematics Teaching Research Journal, 2025

Scoring mathematics exam papers accurately is vital for fostering students' engagement and interest in the subject. Incorrect scoring practices can erode motivation and lead to the development of false self-confidence. Therefore, the implementation of appropriate scoring methods is essential for the success of mathematics education. This study…

Descriptors: Interrater Reliability, Mathematics Teachers, Scoring, Mathematics Tests

A Systematic Review of Early Writing Assessment Tools

Peer reviewed

Direct link

Katherine L. Buchanan; Milena Keller-Margulis; Amanda Hut; Weihua Fan; Sarah S. Mire; G. Thomas Schanding Jr. – Early Childhood Education Journal, 2025

There is considerable research regarding measures of early reading but much less in early writing. Nevertheless, writing is a critical skill for success in school and early difficulties in writing are likely to persist without intervention. A necessary step toward identifying those students who need additional support is the use of screening…

Descriptors: Writing Evaluation, Evaluation Methods, Emergent Literacy, Beginning Writing

Utilizing Deep Learning AI to Analyze Scientific Models: Overcoming Challenges

Peer reviewed

Direct link

Tingting Li; Kevin Haudek; Joseph Krajcik – Journal of Science Education and Technology, 2025

Scientific modeling is a vital educational practice that helps students apply scientific knowledge to real-world phenomena. Despite advances in AI, challenges in accurately assessing such models persist, primarily due to the complexity of cognitive constructs and data imbalances in educational settings. This study addresses these challenges by…

Descriptors: Artificial Intelligence, Scientific Concepts, Models, Automation

The AI Teacher Test: Measuring the Pedagogical Ability of Blender and GPT-3 in Educational Dialogues

Peer reviewed
PDF on ERIC

Download full text

Tack, Anaïs; Piech, Chris – International Educational Data Mining Society, 2022

How can we test whether state-of-the-art generative models, such as Blender and GPT-3, are good AI teachers, capable of replying to a student in an educational dialogue? Designing an AI teacher test is challenging: although evaluation methods are much-needed, there is no off-the-shelf solution to measuring pedagogical ability. This paper reports…

Descriptors: Artificial Intelligence, Dialogs (Language), Bayesian Statistics, Decision Making

Same Grade for Different Reasons, Different Grades for the Same Reason?

Peer reviewed

Direct link

Ilona Rinne – Assessment & Evaluation in Higher Education, 2024

It is widely acknowledged in research that common criteria and aligned standards do not result in consistent assessment of such a complex performance as the final undergraduate thesis. Assessment is determined by examiners' understanding of rubrics and their views on thesis quality. There is still a gap in the research literature about how…

Descriptors: Foreign Countries, Undergraduate Students, Teacher Education Programs, Evaluation Criteria

Reimagining Balanced Assessment Systems

Download full text

Scott F. Marion, Editor; James W. Pellegrino, Editor; Amy I. Berman, Editor – National Academy of Education, 2024

High-quality assessments are crucial to many aspects of the educational process. They can help policymakers monitor long-term educational trends, assist state educational agencies (SEAs) and local educational agencies (LEAs) in allocating resources and professional development opportunities, provide insights to teachers about how well students…

Descriptors: Educational Assessment, Educational Policy, Equal Education, Test Validity

The Reliability of Using ChatGPT in Rating EFL Writings

Peer reviewed
PDF on ERIC

Download full text

Yang Yang – Shanlax International Journal of Education, 2024

This paper explores the reliability of using ChatGPT in evaluating EFL writing by assessing its intra- and inter-rater reliability. Eighty-two compositions were randomly sampled from the Written English Corpus of Chinese Learners. These compositions were rated by three experienced raters with regard to 'language', 'content', and 'organization'.…

Descriptors: English (Second Language), Second Language Instruction, Writing (Composition), Evaluation Methods

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 184

Raykov, Tenko	9
Epstein, Michael H.	7
Jaeger, Richard M.	7
Matson, Johnny L.	7
Amrein-Beardsley, Audrey	6
Follman, John	6
Gill, Brian	6
Gresham, Frank M.	6
Thompson, Bruce	6
Fink, Arlene	5
Marcoulides, George A.	5
Bastick, Tony	4
Cason, Carolyn L.	4
Cason, Gerald J.	4
Deno, Stanley L.	4
Elliott, Stephen N.	4
Erford, Bradley T.	4
Eva, Kevin W.	4
Fitz-Gibbon, Carol Taylor	4
Hambleton, Ronald K.	4
Herman, Joan L.	4
Horner, Robert H.	4
Koretz, Daniel	4
Lembke, Erica S.	4
More ▼

Journal Articles	1867
Reports - Research	1427
Reports - Evaluative	534
Reports - Descriptive	293
Speeches/Meeting Papers	247
Information Analyses	142
Tests/Questionnaires	129
Opinion Papers	113
Guides - Non-Classroom	74
Dissertations/Theses -…	67
Books	35
Guides - Classroom - Teacher	25
Numerical/Quantitative Data	25
ERIC Publications	15
Reference Materials -…	14
Collected Works - Proceedings	10
Reports - General	10
Collected Works - General	9
Guides - General	9
Collected Works - Serials	7
ERIC Digests in Full Text	7
Guides - Classroom - Learner	3
Book/Product Reviews	2
Legal/Legislative/Regulatory…	2
Non-Print Media	2
More ▼

National Assessment of…	11
Wechsler Intelligence Scale…	10
Child Behavior Checklist	8
Praxis Series	7
Aberrant Behavior Checklist	6
Program for International…	6
Teacher Performance…	6
Woodcock Johnson Tests of…	6
ACT Assessment	5
Bayley Scales of Infant…	5
Dynamic Indicators of Basic…	5
Minnesota Multiphasic…	5
SAT (College Admission Test)	5
Stanford Achievement Tests	5
Behavior Assessment System…	4
MacArthur Communicative…	4
Peabody Picture Vocabulary…	4
Social Skills Rating System	4
Trends in International…	4
Advanced Placement…	3
Autism Diagnostic Observation…	3
Beck Anxiety Inventory	3
Behavioral and Emotional…	3
Clinical Evaluation of…	3
Graduate Management Admission…	3
More ▼

ProQuest LLC	66
Educational and Psychological…	46
Assessment & Evaluation in…	28
Journal of Autism and…	28
Online Submission	25
Advances in Health Sciences…	19
Grantee Submission	19
Journal of Psychoeducational…	19
Psychology in the Schools	19
Psychological Assessment	18
School Psychology Review	17
Measurement and Evaluation in…	16
Journal of Educational…	15
American Journal on Mental…	14
Assessment for Effective…	14
Assessment and Evaluation in…	13
Journal of Speech and Hearing…	13
Personnel Psychology	13
Applied Measurement in…	12
Research in Developmental…	12
Research in Developmental…	12
Applied Psychological…	11
Child Abuse & Neglect: The…	11
Diagnostique	11
Journal of Speech, Language,…	11
More ▼