ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	7
Since 2017 (last 10 years)	16
Since 2007 (last 20 years)	70

Descriptor

Evaluation Problems	70
Test Reliability	41
Evaluation Methods	39
Test Validity	28
Interrater Reliability	19
Student Evaluation	18
Reliability	15
Educational Assessment	14
Evaluation Research	14
Measurement	12
Educational Policy	11
Foreign Countries	11
Psychometrics	11
Research Methodology	11
Validity	11
Educational Practices	10
Evaluation Criteria	10
Evidence	10
Error of Measurement	9
Program Effectiveness	9
Robustness (Statistics)	9
Teacher Effectiveness	9
Educational Research	8
Measurement Techniques	8
Models	8
More ▼

Publication Type

Journal Articles	60
Reports - Research	29
Reports - Evaluative	23
Reports - Descriptive	11
Opinion Papers	5
Dissertations/Theses -…	3
Information Analyses	2
Books	1
Collected Works - General	1
Non-Print Media	1
Speeches/Meeting Papers	1
More ▼

Education Level

Elementary Secondary Education	22
Higher Education	21
Postsecondary Education	16
Secondary Education	6
Elementary Education	4
Adult Education	3
Early Childhood Education	3
High Schools	3
Preschool Education	3
Grade 3	2
Junior High Schools	2
Middle Schools	2
Grade 4	1
Grade 5	1
Kindergarten	1
Primary Education	1
More ▼

Audience

Administrators	2
Policymakers	1
Practitioners	1
Teachers	1

Location

Canada	4
Florida	2
United Kingdom (England)	2
Arizona	1
Australia	1
California	1
Finland	1
Hong Kong	1
Maryland	1
North Carolina	1
Portugal	1
Tennessee	1
Turkey	1
Washington	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	2
Race to the Top	2
Individuals with Disabilities…	1

Assessments and Surveys

Program for International…	2
Stanford Achievement Tests	2
Classroom Assessment Scoring…	1
Florida Comprehensive…	1
National Assessment of…	1
Pediatric Evaluation of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 70 results Save | Export

Different Methods for Assessing Preservice Teachers' Instruction: Why Measures Matter

Peer reviewed

Direct link

Arielle Boguslav; Julie Cohen – Journal of Teacher Education, 2024

Teacher preparation programs are increasingly expected to use data on preservice teacher (PST) skills to drive program improvement and provide targeted supports. Observational ratings are especially vital, but also prone to measurement issues. Scores may be influenced by factors unrelated to PSTs' instructional skills, including rater standards.…

Descriptors: Preservice Teachers, Measures (Individuals), Evaluation Problems, Teaching Skills

Monitoring Rater Quality in Observational Systems: Issues Due to Unreliable Estimates of Rater Quality

Peer reviewed

Direct link

Mark White; Matt Ronfeldt – Educational Assessment, 2024

Standardized observation systems seek to reliably measure a specific conceptualization of teaching quality, managing rater error through mechanisms such as certification, calibration, validation, and double-scoring. These mechanisms both support high quality scoring and generate the empirical evidence used to support the scoring inference (i.e.,…

Descriptors: Interrater Reliability, Quality Control, Teacher Effectiveness, Error Patterns

Detecting Test Flakiness without Rerunning Tests

Direct link

Abdulrahman Alshammari – ProQuest LLC, 2024

A critical component of modern software development practices, particularly continuous integration (CI), is the halt of development activities in response to test failures which requires further investigation and debugging. As software changes, regression testing becomes vital to verify that new code does not affect existing functionality.…

Descriptors: Computer Software, Programming, Coding, Test Reliability

Grading in Chemistry: Variations in Instructors' Evaluation of Student Written Responses

Direct link

Michelle Herridge – ProQuest LLC, 2021

Evaluation of student written work during summative assessments is an important and critical task for instructors at all educational levels. Nevertheless, few research studies exist that provide insights into how different instructors approach this task. Chemistry faculty (FIs) and graduate student instructors (GSIs) regularly engage in the…

Descriptors: Science Instruction, Chemistry, College Faculty, Teaching Assistants

Reimagining Balanced Assessment Systems

Download full text

Scott F. Marion, Editor; James W. Pellegrino, Editor; Amy I. Berman, Editor – National Academy of Education, 2024

High-quality assessments are crucial to many aspects of the educational process. They can help policymakers monitor long-term educational trends, assist state educational agencies (SEAs) and local educational agencies (LEAs) in allocating resources and professional development opportunities, provide insights to teachers about how well students…

Descriptors: Educational Assessment, Educational Policy, Equal Education, Test Validity

Analysis of Evaluation Policies in the Philanthropic Sector

Peer reviewed

Direct link

Kinarsky, Alana R.; Christie, Christina A. – American Journal of Evaluation, 2022

Since 2007, two taxonomies have been proposed to identify the components of evaluation practice that may be specified in an evaluation policy. Little is known, however, about how these taxonomies align with evaluation policies developed by philanthropic foundations. Through thematic analysis, this article first compares 12 foundation evaluation…

Descriptors: Taxonomy, Evaluation Methods, Philanthropic Foundations, Educational Policy

Online Data Collection via Questionnaires: Challenges and Solutions. Sage Research Methods: Doing Research Online

Direct link

Mojgan Rashtchi; SeyyedeFateme Ghazi Mir Saeed – Sage Research Methods Cases, 2023

The reason for conducting the present case study was the problems the researchers encountered during data collection for another research project (Primary Study) entitled "The effects of virtual versus traditional flipped classes on EFL learners' grammar knowledge, self-regulation, and autonomy." Two online questionnaires were…

Descriptors: Data Collection, Questionnaires, Barriers, Research Methodology

Rethinking Online Assessment Quality from Pre-Service Teachers Perspectives

Peer reviewed
PDF on ERIC

Download full text

Mücahit Öztürk – Open Praxis, 2024

This study examined the problems that pre-service teachers face in the online assessment process and their suggestions for solutions to these problems. The participants were 136 pre-service teachers who have been experiencing online assessment for a long time and who took the Foundations of Open and Distance Learning course. This research is a…

Descriptors: Foreign Countries, Preservice Teacher Education, Preservice Teachers, Distance Education

Inter-Rater Reliability of Washington State's Kindergarten Entry Assessment

Peer reviewed

Direct link

Joseph, Gail; Soderberg, Janet S.; Stull, Sara; Cummings, Kevin; McCutchen, Deborah; Han, Rachel J. – Early Education and Development, 2020

Research Findings: This study explores the inter-rater reliability of WaKIDS, Washington State's kindergarten entry assessment (KEA). Specifically, we analyze (1) the extent to which teachers' assessments are in agreement with a master code, (2) how often inaccurate assessment decisions lead to misidentification of school readiness, and (3)…

Descriptors: Interrater Reliability, School Readiness, Kindergarten, Evaluation Problems

The Miscalculation of Interrater Reliability: A Case Study Involving the AAC&U VALUE Rubrics

Peer reviewed
PDF on ERIC

Download full text

Szafran, Robert F. – Practical Assessment, Research & Evaluation, 2017

Institutional assessment of student learning objectives has become a fact-of-life in American higher education and the Association of American Colleges and Universities' (AAC&U) VALUE Rubrics have become a widely adopted evaluation and scoring tool for student work. As faculty from a variety of disciplines, some less familiar with the…

Descriptors: Interrater Reliability, Case Studies, Scoring Rubrics, Behavioral Objectives

Let's Disagree (to Agree): Queering the Rhetoric of Agreement in Writing Assessment

Peer reviewed
PDF on ERIC

Download full text

Walker, Paul – Composition Forum, 2017

This article describes and theorizes a failed writing program assessment study to question the influence of "the rhetoric of agreement," or reliability, on writing assessment practice and its prevalence in validating institutional mandated assessments. Offering the phrase "dwelling in disagreement" as a queer perspective, the…

Descriptors: Rhetoric, Writing Tests, Test Reliability, Program Validation

Definitions Matter: Investigating and Comparing Different Operalionalizations of Academic Undermatching

Peer reviewed
PDF on ERIC

Download full text

Gansemer-Topf, Ann M.; Downey, Jillian; Genschel, Ulrike – Research & Practice in Assessment, 2017

Effective assessment practice requires clearly defining and operationalizing terminology. We illustrate the importance of this practice by focusing on academic "undermatching"--when students enroll in colleges that are less academically selective than those for which they are academically prepared. Undermatching has been viewed as a…

Descriptors: Differences, Definitions, Vocabulary, Comparative Analysis

To Be or Not to Be: Understanding University Academic English Teachers' Perceptions of Assessing Self-Directed Learning

Peer reviewed

Direct link

Lau, Ken – Innovations in Education and Teaching International, 2018

Self-directed learning, despite its growing popularity in education, has challenged conventional assessment practice which often foregrounds the presentation of identical conditions to ensure reliability. This article discusses the results of a case study of university academic English teachers' perceptions and reported practices of assessing…

Descriptors: Independent Study, Teacher Attitudes, Case Studies, Educational Practices

How Can We Help Our Students Be More Critical? Examining the Details in Questionnaire Studies

Peer reviewed
PDF on ERIC

Download full text

Direct link

Hartley, James – Psychology Teaching Review, 2017

In this article, Hartley notes the difficulties of using questionnaires to assess the efficiency of new instructional methods and highlights nine issues that researchers must consider. Hartley continues the discussion about the use of questionnaires and suggests that psychology teachers can help improve the teaching of psychology by drawing…

Descriptors: Questionnaires, Instructional Innovation, Instructional Effectiveness, Teaching Methods

Reliability of Multi-Category Rating Scales

Peer reviewed

Direct link

Parker, Richard I.; Vannest, Kimberly J.; Davis, John L. – Journal of School Psychology, 2013

The use of multi-category scales is increasing for the monitoring of IEP goals, classroom and school rules, and Behavior Improvement Plans (BIPs). Although they require greater inference than traditional data counting, little is known about the inter-rater reliability of these scales. This simulation study examined the performance of nine…

Descriptors: Rating Scales, Scaling, Interrater Reliability, Test Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

School Psychology Review	4
Teachers College Record	4
Educational Research and…	3
ProQuest LLC	3
American Journal of Evaluation	2
Assessing Writing	2
NHSA Dialog	2
Practical Assessment,…	2
Psychological Methods	2
Research & Practice in…	2
American Educational Research…	1
Applied Measurement in…	1
Arts Education Policy Review	1
Assessment & Evaluation in…	1
Assessment in Education:…	1
British Educational Research…	1
Canadian Journal of Education	1
Carnegie Foundation for the…	1
Collected Essays on Learning…	1
Composition Forum	1
Creativity Research Journal	1
Early Education and…	1
Economics of Education Review	1
Education Policy Analysis…	1
Educational Assessment	1
More ▼

Bagnato, Stephen J.	2
Berliner, David C.	2
Macy, Marisa	2
Abdulrahman Alshammari	1
Amy I. Berman, Editor	1
Anderson, Andrew	1
Arielle Boguslav	1
Avery, Marybell	1
Baker, Beverly A.	1
Balcão Reis, Ana	1
Ballou, Dale	1
Bates, Simon P.	1
Beck, Audrey	1
Booker, Kevin	1
Bordelon, Suzanne	1
Bowman, Nicholas A.	1
Burrows, Vanessa	1
Camilli, Gregory	1
Cheng, Britte H.	1
Christie, Christina A.	1
Clarkeburn, Henriikka	1
Colker, Alexis M.	1
Conner, Jerusha Osber	1
Cooksy, Leslie J.	1
Cramer, Kenneth M.	1
More ▼