ERIC - Search Results

Publication Date

In 2026	0
Since 2025	56
Since 2022 (last 5 years)	282
Since 2017 (last 10 years)	778
Since 2007 (last 20 years)	2040

Descriptor

Interrater Reliability	3122
Foreign Countries	654
Test Reliability	503
Evaluation Methods	502
Test Validity	410
Correlation	401
Scoring	347
Comparative Analysis	327
Scores	324
Validity	310
Student Evaluation	308
Measures (Individuals)	298
Evaluators	295
Rating Scales	282
Statistical Analysis	268
Higher Education	264
Psychometrics	241
Reliability	230
Observation	229
Scoring Rubrics	216
Test Construction	212
English (Second Language)	211
Teaching Methods	208
Writing Evaluation	206
Intervention	200
More ▼

Education Level

Higher Education	574
Postsecondary Education	420
Elementary Education	282
Secondary Education	179
Early Childhood Education	145
Elementary Secondary Education	120
Middle Schools	109
High Schools	85
Preschool Education	72
Junior High Schools	65
Adult Education	58
Primary Education	57
Kindergarten	45
Grade 4	41
Grade 5	40
Intermediate Grades	40
Grade 1	37
Grade 6	35
Grade 8	32
Grade 3	31
Grade 2	27
Grade 7	27
Grade 10	13
Grade 9	11
Two Year Colleges	8
More ▼

Audience

Researchers	130
Practitioners	42
Teachers	22
Administrators	11
Counselors	3
Policymakers	2

Location

Australia	56
Turkey	53
United Kingdom	46
Canada	45
Netherlands	40
China	38
California	37
United States	30
United Kingdom (England)	24
Taiwan	23
Germany	22
Japan	22
Pennsylvania	22
Florida	21
Sweden	21
Iran	19
North Carolina	19
Hong Kong	17
South Korea	17
Texas	17
Georgia	16
Israel	15
New Zealand	14
South Africa	14
Washington	14
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	13
Individuals with Disabilities…	7
Elementary and Secondary…	3
Race to the Top	3
Elementary and Secondary…	2
American Recovery and…	1
Americans with Disabilities…	1
Education Consolidation…	1
Education for All Handicapped…	1
Every Student Succeeds Act…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
Pell Grant Program	1
Rehabilitation Act 1973…	1
Stewart B McKinney Homeless…	1
Temporary Assistance for…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	3
Meets WWC Standards with or without Reservations	3
Does not meet standards	3

Showing 406 to 420 of 3,122 results Save | Export

Comparing Machine and Human Reviewers to Evaluate the Risk of Bias in Randomized Controlled Trials

Peer reviewed

Direct link

Armijo-Olivo, Susan; Craig, Rodger; Campbell, Sandy – Research Synthesis Methods, 2020

Background: Evidence from new health technologies is growing, along with demands for evidence to inform policy decisions, creating challenges in completing health technology assessments (HTAs)/systematic reviews (SRs) in a timely manner. Software can decrease the time and burden by automating the process, but evidence validating such software is…

Descriptors: Comparative Analysis, Computer Software, Decision Making, Randomized Controlled Trials

Factors Considered in the Assessment of Computer Science Engineering Capstone Projects and Their Influence on Discrepancies between Assessors

Peer reviewed

Direct link

Domínguez, César; Jaime, Arturo; García-Izquierdo, Francisco José; Olarte, Juan José – ACM Transactions on Computing Education, 2020

A capstone project is an extensive learning experience traditionally developed during a student's final academic year. Assessing such a complex assignment involves several challenges and is usually based upon the evaluations of at least two different people: the capstone project advisor, and one or more other assessors. Quantitative studies…

Descriptors: Computer Science Education, Capstone Experiences, Student Evaluation, Student Projects

A Two-Stage Method for Classroom Assessments of Essay Writing

Peer reviewed

Direct link

Humphry, Stephen Mark; Heldsinger, Sandy – Journal of Educational Measurement, 2019

To capitalize on professional expertise in educational assessment, it is desirable to develop and test methods of rater-mediated assessment that enable classroom teachers to make reliable and informative judgments. Accordingly, this article investigates the reliability of a two-stage method used by classroom teachers to assess primary school…

Descriptors: Essays, Elementary School Students, Writing (Composition), Writing Evaluation

Examining the Reliability of Scores from a Performance Assessment of Practice-Based Competencies

Peer reviewed

Direct link

Roduta Roberts, Mary; Alves, Cecilia Brito; Werther, Karin; Bahry, Louise M. – Journal of Psychoeducational Assessment, 2019

The purpose of this study was to examine the reliability and sources of score variation from a performance assessment of practice competencies within an occupational therapy program. Data from 99 students who participated in a practical exam were examined. A generalizability analysis of analytic, total, and overall holistic scores was completed…

Descriptors: Performance Based Assessment, Test Reliability, Scores, Occupational Therapy

Reliability of Informant-Report Measures of Executive Functioning in Children with Down Syndrome

Peer reviewed

Direct link

Esbensen, Anna J.; Hoffman, Emily K.; Shaffer, Rebecca; Chen, Elizabeth; Patel, Lina; Jacola, Lisa – American Journal on Intellectual and Developmental Disabilities, 2019

The current study evaluates the psychometric properties of the Behavior Rating Inventory of Executive Function (BRIEF) with children with Down syndrome. Caregivers of 84 children with Down syndrome rated their child's behavior with the BRIEF. Teacher ratings were obtained for 57 children. About 40% of children with Down syndrome were reported by…

Descriptors: Executive Function, Children, Down Syndrome, Behavior Rating Scales

Validity of Comparative Judgement to Assess Academic Writing: Examining Implications of Its Holistic Character and Building on a Shared Consensus

Peer reviewed

Direct link

van Daal, Tine; Lesterhuis, Marije; Coertjens, Liesje; Donche, Vincent; De Maeyer, Sven – Assessment in Education: Principles, Policy & Practice, 2019

Recently, comparative judgement has been introduced as an alternative method for scoring essays. Although this method is promising in terms of obtaining reliable scores, empirical evidence concerning its validity is lacking. The current study examines implications resulting from two critical assumptions underpinning the use of comparative…

Descriptors: Academic Discourse, Validity, Writing Evaluation, Value Judgment

Objective Laryngoscopic Measures from Older Patients with Voice Complaints and Signs of Aging

Peer reviewed

Direct link

Stager, Sheila V.; Gupta, Simran; Amdur, Richard; Bielamowicz, Steven A. – Journal of Speech, Language, and Hearing Research, 2021

Purpose: The purpose of this study was to use objective measures of glottal gap, bowing, and supraglottic compression from selected images of laryngoscopic examinations from adults over 60 years of age with voice complaints and signs of aging to test current hypotheses on whether degree of severity impacts treatment recommendations and potential…

Descriptors: Older Adults, Patients, Aging (Individuals), Voice Disorders

Investigating the Consistency between Students' and Teachers' Ratings for the Assessment of Problem-Solving Skills with Many-Facet Rasch Measurement Model

Peer reviewed
PDF on ERIC

Download full text

Saritas Akyol, Seyhan; Karakaya, Ismail – Eurasian Journal of Educational Research, 2021

Purpose: To assess students' problem-solving skills, this study aims to investigate the consistency between self- and peer-ratings in consideration of the teachers' ratings in the process. Method: This study was a descriptive study which examines the mathematical problem-solving skills with the MFRM model concerning self-, peer- and teachers'…

Descriptors: Problem Solving, Item Response Theory, Self Evaluation (Individuals), Peer Evaluation

Fairness in Oral Language Assessment: Training Raters and Considering Examinees' Expectations

Peer reviewed
PDF on ERIC

Download full text

Doosti, Mehdi; Ahmadi Safa, Mohammad – International Journal of Language Testing, 2021

This study examined the effect of rater training on promoting inter-rater reliability in oral language assessment. It also investigated whether rater training and the consideration of the examinees' expectations by the examiners have any effect on test-takers' perceptions of being fairly evaluated. To this end, four raters scored 31 Iranian…

Descriptors: Oral Language, Language Tests, Interrater Reliability, Training

Psychometrics of the Pragmatic Rating Scale for School-Age Children with a Range of Linguistic and Social Communication Skills

Peer reviewed

Direct link

Dillon, Emily; Holingue, Calliope; Herman, Dana; Landa, Rebecca J. – Journal of Speech, Language, and Hearing Research, 2021

Purpose: Social communication or pragmatic skills are continuously distributed in the general population. Impairment in these skills is associated with two clinical disorders, autism spectrum disorder (ASD) and social (pragmatic) communication disorder. Such impairment can impact a child's peer acceptance, school performance, and current and later…

Descriptors: Psychometrics, Pragmatics, Rating Scales, Elementary School Students

The Longitudinal Stability of Rating Characteristics in an EFL Examination: Methodological and Substantive Considerations

Peer reviewed

Direct link

Lamprianou, Iasonas; Tsagari, Dina; Kyriakou, Nansia – Language Testing, 2021

This longitudinal study (2002-2014) investigates the stability of rating characteristics of a large group of raters over time in the context of the writing paper of a national high-stakes examination. The study uses one measure of rater severity and two measures of rater consistency. The results suggest that the rating characteristics of…

Descriptors: Longitudinal Studies, Evaluators, High Stakes Tests, Writing Evaluation

Rubric for Assessing Thinking Skills in Free-Response Exam Problems

Peer reviewed

Direct link

Al-Salmani, Fatema; Thacker, Beth – Physical Review Physics Education Research, 2021

We designed a rubric to assess free-response exam problems in order to compare thinking skills evidenced in exams in classes taught by different pedagogies. The rubric was designed based on Bloom's taxonomy and then used to code exam problems. We have analyzed historical and recent exam problems in both algebra-based and calculus-based exams. In…

Descriptors: Inquiry, Thinking Skills, Scoring Rubrics, Algebra

The Development of a Test to Explore the Students' Mental Models and External Representation Patterns of Hanging Objects

Peer reviewed
PDF on ERIC

Download full text

Kaharu, Sarintan N.; Mansyur, Jusman – Pegem Journal of Education and Instruction, 2021

This study aims to develop a test that can be used to explore mental models and representation patterns of objects in liquid fluid. The test developed by adapting the Reeves's Development Model was carried out in several stages, namely: determining the orientation and test segments; initial survey; preparation of the initial draft; try out;…

Descriptors: Test Construction, Schemata (Cognition), Scientific Concepts, Water

Identifying the Core Vocabulary for Adults with Complex Communication Needs from the British National Corpus by Analyzing Grouped Frequency Distributions

Peer reviewed

Direct link

Shin, Sangeun; Park, HyunJu; Hill, Katya – Journal of Speech, Language, and Hearing Research, 2021

Purpose: This study is aimed to identify the high-frequency vocabulary (HFV), otherwise termed "core vocabulary" for adults with complex communication needs. Method: Three major characteristics of the HFV--a relatively small number of different words (NDW), a relatively high word frequency, and a high word commonality across…

Descriptors: Word Frequency, Vocabulary Skills, Adults, Age Differences

Monitoring the Performance of Human and Automated Scores for Spoken Responses

Peer reviewed

Direct link

Wang, Zhen; Zechner, Klaus; Sun, Yu – Language Testing, 2018

As automated scoring systems for spoken responses are increasingly used in language assessments, testing organizations need to analyze their performance, as compared to human raters, across several dimensions, for example, on individual items or based on subgroups of test takers. In addition, there is a need in testing organizations to establish…

Descriptors: Automation, Scoring, Speech Tests, Language Tests

« Previous Page | Next Page »

Pages: 1 | ... | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | ... | 209

ProQuest LLC	86
Journal of Speech, Language,…	62
Educational and Psychological…	61
Journal of Autism and…	56
Grantee Submission	40
Language Testing	39
Online Submission	35
International Journal of…	34
Assessment & Evaluation in…	33
Research in Developmental…	31
Applied Measurement in…	28
Advances in Health Sciences…	26
Assessment for Effective…	26
ETS Research Report Series	25
Journal of Educational…	24
Educational Measurement:…	23
Measurement in Physical…	20
Language Assessment Quarterly	19
Psychology in the Schools	19
Topics in Early Childhood…	19
Psychological Assessment	18
Educational Assessment	16
Autism: The International…	15
Journal of Consulting and…	15
Personnel Psychology	15
More ▼

Lunz, Mary E.	10
Wind, Stefanie A.	10
Engelhard, George, Jr.	8
Epstein, Michael H.	8
Ingham, Roger J.	8
Johnson, Evelyn S.	8
Matson, Johnny L.	7
McLeod, Bryce D.	7
Moylan, Laura A.	7
Cason, Carolyn L.	6
Cordes, Anne K.	6
Jaeger, Richard M.	6
Johnson, Robert L.	6
Lecavalier, Luc	6
Plake, Barbara S.	6
Tasse, Marc J.	6
Wyse, Adam E.	6
Zheng, Yuzhu	6
Aman, Michael G.	5
Barton, Erin E.	5
Cason, Gerald J.	5
Coniam, David	5
Conroy, Maureen A.	5
Crawford, Angela R.	5
More ▼

Journal Articles	2553
Reports - Research	2241
Reports - Evaluative	515
Speeches/Meeting Papers	272
Reports - Descriptive	163
Tests/Questionnaires	162
Information Analyses	130
Dissertations/Theses -…	89
Opinion Papers	61
Numerical/Quantitative Data	31
Guides - Non-Classroom	11
Books	7
Collected Works - General	3
Guides - Classroom - Teacher	3
Non-Print Media	3
Book/Product Reviews	2
Collected Works - Serials	2
Dissertations/Theses	2
ERIC Digests in Full Text	2
ERIC Publications	2
Guides - General	2
Reports - General	2
Collected Works - Proceedings	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	30
Child Behavior Checklist	18
National Assessment of…	14
Vineland Adaptive Behavior…	14
Autism Diagnostic Observation…	13
Strengths and Difficulties…	11
Woodcock Johnson Tests of…	10
Peabody Picture Vocabulary…	9
SAT (College Admission Test)	9
Wechsler Intelligence Scale…	9
Behavior Assessment System…	8
Dynamic Indicators of Basic…	8
Early Childhood Environment…	8
Graduate Record Examinations	8
International English…	7
Teacher Performance…	6
ACT Assessment	5
Advanced Placement…	5
Behavioral and Emotional…	5
Childhood Autism Rating Scale	5
Classroom Assessment Scoring…	5
Conners Teacher Rating Scale	5
Draw a Person Test	5
Raven Progressive Matrices	5
ACTFL Oral Proficiency…	4
More ▼