ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	15

Descriptor

Evaluation Problems	42
Reliability	42
Validity	28
Evaluation Methods	24
Educational Assessment	12
Elementary Secondary Education	11
Evaluation Criteria	10
Measurement Techniques	9
Student Evaluation	9
Research Methodology	8
Measurement	7
Models	7
Program Effectiveness	7
Educational Policy	6
Program Evaluation	6
Psychometrics	6
Research Problems	6
Definitions	5
Educational Research	5
Evaluation Research	5
Evidence	5
Higher Education	5
Intervention	5
Data Collection	4
Evaluation Needs	4
More ▼

Publication Type

Journal Articles	29
Reports - Research	13
Opinion Papers	10
Reports - Evaluative	8
Information Analyses	7
Reports - Descriptive	7
Speeches/Meeting Papers	4
ERIC Digests in Full Text	2
ERIC Publications	2
Reports - General	2
Books	1
Non-Print Media	1
More ▼

Education Level

Elementary Secondary Education	8
Higher Education	5
Postsecondary Education	4

Audience

Researchers	4
Policymakers	2
Administrators	1
Practitioners	1

Location

United Kingdom (England)	3
Georgia	1
United Kingdom (Wales)	1
United States	1
Utah	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 42 results Save | Export

Analysis of Evaluation Policies in the Philanthropic Sector

Peer reviewed

Direct link

Kinarsky, Alana R.; Christie, Christina A. – American Journal of Evaluation, 2022

Since 2007, two taxonomies have been proposed to identify the components of evaluation practice that may be specified in an evaluation policy. Little is known, however, about how these taxonomies align with evaluation policies developed by philanthropic foundations. Through thematic analysis, this article first compares 12 foundation evaluation…

Descriptors: Taxonomy, Evaluation Methods, Philanthropic Foundations, Educational Policy

Online Data Collection via Questionnaires: Challenges and Solutions. Sage Research Methods: Doing Research Online

Direct link

Mojgan Rashtchi; SeyyedeFateme Ghazi Mir Saeed – Sage Research Methods Cases, 2023

The reason for conducting the present case study was the problems the researchers encountered during data collection for another research project (Primary Study) entitled "The effects of virtual versus traditional flipped classes on EFL learners' grammar knowledge, self-regulation, and autonomy." Two online questionnaires were…

Descriptors: Data Collection, Questionnaires, Barriers, Research Methodology

Definitions Matter: Investigating and Comparing Different Operalionalizations of Academic Undermatching

Peer reviewed
PDF on ERIC

Download full text

Gansemer-Topf, Ann M.; Downey, Jillian; Genschel, Ulrike – Research & Practice in Assessment, 2017

Effective assessment practice requires clearly defining and operationalizing terminology. We illustrate the importance of this practice by focusing on academic "undermatching"--when students enroll in colleges that are less academically selective than those for which they are academically prepared. Undermatching has been viewed as a…

Descriptors: Differences, Definitions, Vocabulary, Comparative Analysis

Exogenous Variables and Value-Added Assessments: A Fatal Flaw

Peer reviewed

Direct link

Berliner, David C. – Teachers College Record, 2014

Background: There has been rapid growth in value-added assessment of teachers to meet the widely supported policy goal of identifying the most effective and the most ineffective teachers in a school system. The former group is to be rewarded while the latter group is to be helped or fired for their poor performance. But, value-added approaches to…

Descriptors: Teacher Effectiveness, Academic Achievement, Teacher Evaluation, Scores

English Value-Added Measures: Examining the Limitations of School Performance Measurement

Peer reviewed

Direct link

Perry, Thomas – British Educational Research Journal, 2016

Value-added "Progress" measures are to be introduced for all English schools in 2016 as "headline" measures of school performance. This move comes despite research highlighting high levels of instability in value-added measures and concerns about the omission of contextual variables in the planned measure. This article studies…

Descriptors: Foreign Countries, Value Added Models, School Effectiveness, Performance Based Assessment

The Elusive Nature of Reliability: Problems and Pitfalls in Scoring Clinical Practice Action Research Projects

Download full text

Moffett, David W.; Reid, Barbara K. – Online Submission, 2010

The Investigators studied scoring reliability of Candidates' ten day unit plans of instruction through prescribed action research projects, across three academic years. Scoring of the projects in year one provided opportunities for further refinement of the action research evaluation methods in year two. Across three terms in years one and two…

Descriptors: Research Projects, Action Research, Student Evaluation, Mastery Learning

Reliability and Validity of Rubrics for Assessment through Writing

Peer reviewed

Direct link

Rezaei, Ali Reza; Lovorn, Michael – Assessing Writing, 2010

This experimental project investigated the reliability and validity of rubrics in assessment of students' written responses to a social science "writing prompt". The participants were asked to grade one of the two samples of writing assuming it was written by a graduate student. In fact both samples were prepared by the authors. The…

Descriptors: Spelling, Sentence Structure, Punctuation, Social Sciences

Killeen's Probability of Replication and Predictive Probabilities: How to Compute, Use, and Interpret Them

Peer reviewed

Direct link

Lecoutre, Bruno; Lecoutre, Marie-Paule; Poitevineau, Jacques – Psychological Methods, 2010

P. R. Killeen's (2005a) probability of replication ("p[subscript rep]") of an experimental result is the fiducial Bayesian predictive probability of finding a same-sign effect in a replication of an experiment. "p[subscript rep]" is now routinely reported in "Psychological Science" and has also begun to appear in…

Descriptors: Research Methodology, Guidelines, Probability, Computation

A Model-Averaging Approach to Replication : The Case of "p[subscript rep]"

Peer reviewed

Direct link

Iverson, Geoffrey J.; Wagenmakers, Eric-Jan; Lee, Michael D. – Psychological Methods, 2010

The purpose of the recently proposed "p[subscript rep]" statistic is to estimate the probability of concurrence, that is, the probability that a replicate experiment yields an effect of the same sign (Killeen, 2005a). The influential journal "Psychological Science" endorses "p[subscript rep]" and recommends its use…

Descriptors: Effect Size, Evaluation Methods, Probability, Experiments

Identifying the Onset and Offset of Stuttering Events.

Peer reviewed

Ingham, Roger J.; And Others – Journal of Speech and Hearing Research, 1995

Four experienced stuttering researchers viewed videodisks of spontaneous speech from chronic stutterers and attempted to locate the precise onset and offset of individual stuttering events. Results showed interjudge disagreements that challenge the reliability and validity of onset and offset judgments. Highly agreed stuttering events were…

Descriptors: Adults, Clinical Diagnosis, Evaluation Problems, Interrater Reliability

How Much Can We Reliably Know about What Examinees Know?

Peer reviewed

Direct link

Sinharay, Sandip; Haberman, Shelby J. – Measurement: Interdisciplinary Research and Perspectives, 2009

In this commentary, the authors discuss some of the issues regarding the use of diagnostic classification models that practitioners should keep in mind. In the authors experience, these issues are not as well known as they should be. The authors then provide recommendations on diagnostic scoring.

Descriptors: Scoring, Reliability, Validity, Classification

The Assessment of Lesson Plans in Teacher Education: A Case Study in Assessment Validity and Reliability

Peer reviewed

Direct link

Tummons, Jonathan – Assessment & Evaluation in Higher Education, 2010

This paper forms part of an exploration of assessment on one part-time higher education (HE) course: an in-service, professional qualification for teachers and trainers in the learning and skills sector which is delivered on a franchise basis across a network of further education colleges in the north of England. This paper proposes that the…

Descriptors: Foreign Countries, Portfolios (Background Materials), Portfolio Assessment, Validity

Combining Grades from Different Assessments: How Reliable Is the Result?

Peer reviewed

Cresswell, M. J. – Educational Review, 1988

The author suggests combining grades from component assessments to provide an overall student assessment. He explores the concept of reliability and concludes that the overall assessment will be reliable only if the number of grades used to report component achievements equals or exceeds the number used to report overall achievement. (Author/CH)

Descriptors: Evaluation Problems, Grades (Scholastic), Holistic Evaluation, Reliability

A Consideration of the Validity and Reliability of Suicide Mortality Data.

Peer reviewed

O'Carroll, Patrick W. – Suicide and Life-Threatening Behavior, 1989

Briefly outlines problems associated with definition and official certification of suicide and reviews literature pertaining to validity and reliability of suicide statistics. Considers process of suicide certification as a test, estimating its sensitivity, specificity, and predictive value, using data from studies reviewed. (NB)

Descriptors: Attrition (Research Studies), Death, Evaluation Problems, Reliability

Toward Developing a Science of Treatment Integrity: Introduction to the Special Series

Peer reviewed

Direct link

Hagermoser Sanetti, Lisa M.; Kratochwill, Thomas R. – School Psychology Review, 2009

Treatment integrity (also referred to as "treatment fidelity," "intervention integrity," and "procedural reliability") is an important methodological concerning both research and practice because treatment integrity data are essential to making valid conclusions regarding treatment outcomes. Despite its relationship to validity, treatment…

Descriptors: Intervention, Research Methodology, Models, Validity

Previous Page | Next Page »

Pages: 1 | 2 | 3

School Psychology Review	4
Psychological Methods	2
American Journal of Distance…	1
American Journal of Evaluation	1
Assessing Writing	1
Assessment & Evaluation in…	1
British Educational Research…	1
Carnegie Foundation for the…	1
Child Study Journal	1
Education for Information	1
Educational Evaluation and…	1
Educational Psychology: An…	1
Educational Review	1
Educational Studies	1
Electronic Journal of…	1
Florida Journal of…	1
Journal of Educational…	1
Journal of Research on…	1
Journal of Speech and Hearing…	1
Measurement:…	1
Online Submission	1
Research & Practice in…	1
Sage Research Methods Cases	1
Simulation/Games for Learning	1
Social Studies	1
More ▼

Berliner, David C.	1
Bond, Lloyd	1
Buckle, C. F.	1
Buser, Karen P.	1
Christie, Christina A.	1
Coburn, Louisa	1
Cresswell, M. J.	1
Dowell, David A.	1
Downey, Jillian	1
Dudley-Marling, Curt	1
Easton, Julia E.	1
Fajman, Nancy	1
Ferguson, Lon	1
Follman, John	1
Fourie, Ina	1
Gansemer-Topf, Ann M.	1
Genschel, Ulrike	1
Gresham, Frank M.	1
Haberman, Shelby J.	1
Hagermoser Sanetti, Lisa M.	1
Haskell, Robert E.	1
Hatch, Jill A.	1
Hayes, John R.	1
Hedge, Jerry W.	1
More ▼