Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 2 |
| Since 2017 (last 10 years) | 3 |
| Since 2007 (last 20 years) | 15 |
Descriptor
| Evaluation Problems | 42 |
| Reliability | 42 |
| Validity | 28 |
| Evaluation Methods | 24 |
| Educational Assessment | 12 |
| Elementary Secondary Education | 11 |
| Evaluation Criteria | 10 |
| Measurement Techniques | 9 |
| Student Evaluation | 9 |
| Research Methodology | 8 |
| Measurement | 7 |
| More ▼ | |
Source
Author
| Berliner, David C. | 1 |
| Bond, Lloyd | 1 |
| Buckle, C. F. | 1 |
| Buser, Karen P. | 1 |
| Christie, Christina A. | 1 |
| Coburn, Louisa | 1 |
| Cresswell, M. J. | 1 |
| Dowell, David A. | 1 |
| Downey, Jillian | 1 |
| Dudley-Marling, Curt | 1 |
| Easton, Julia E. | 1 |
| More ▼ | |
Publication Type
Education Level
| Elementary Secondary Education | 8 |
| Higher Education | 5 |
| Postsecondary Education | 4 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Kinarsky, Alana R.; Christie, Christina A. – American Journal of Evaluation, 2022
Since 2007, two taxonomies have been proposed to identify the components of evaluation practice that may be specified in an evaluation policy. Little is known, however, about how these taxonomies align with evaluation policies developed by philanthropic foundations. Through thematic analysis, this article first compares 12 foundation evaluation…
Descriptors: Taxonomy, Evaluation Methods, Philanthropic Foundations, Educational Policy
Mojgan Rashtchi; SeyyedeFateme Ghazi Mir Saeed – Sage Research Methods Cases, 2023
The reason for conducting the present case study was the problems the researchers encountered during data collection for another research project (Primary Study) entitled "The effects of virtual versus traditional flipped classes on EFL learners' grammar knowledge, self-regulation, and autonomy." Two online questionnaires were…
Descriptors: Data Collection, Questionnaires, Barriers, Research Methodology
Gansemer-Topf, Ann M.; Downey, Jillian; Genschel, Ulrike – Research & Practice in Assessment, 2017
Effective assessment practice requires clearly defining and operationalizing terminology. We illustrate the importance of this practice by focusing on academic "undermatching"--when students enroll in colleges that are less academically selective than those for which they are academically prepared. Undermatching has been viewed as a…
Descriptors: Differences, Definitions, Vocabulary, Comparative Analysis
Berliner, David C. – Teachers College Record, 2014
Background: There has been rapid growth in value-added assessment of teachers to meet the widely supported policy goal of identifying the most effective and the most ineffective teachers in a school system. The former group is to be rewarded while the latter group is to be helped or fired for their poor performance. But, value-added approaches to…
Descriptors: Teacher Effectiveness, Academic Achievement, Teacher Evaluation, Scores
Perry, Thomas – British Educational Research Journal, 2016
Value-added "Progress" measures are to be introduced for all English schools in 2016 as "headline" measures of school performance. This move comes despite research highlighting high levels of instability in value-added measures and concerns about the omission of contextual variables in the planned measure. This article studies…
Descriptors: Foreign Countries, Value Added Models, School Effectiveness, Performance Based Assessment
Moffett, David W.; Reid, Barbara K. – Online Submission, 2010
The Investigators studied scoring reliability of Candidates' ten day unit plans of instruction through prescribed action research projects, across three academic years. Scoring of the projects in year one provided opportunities for further refinement of the action research evaluation methods in year two. Across three terms in years one and two…
Descriptors: Research Projects, Action Research, Student Evaluation, Mastery Learning
Rezaei, Ali Reza; Lovorn, Michael – Assessing Writing, 2010
This experimental project investigated the reliability and validity of rubrics in assessment of students' written responses to a social science "writing prompt". The participants were asked to grade one of the two samples of writing assuming it was written by a graduate student. In fact both samples were prepared by the authors. The…
Descriptors: Spelling, Sentence Structure, Punctuation, Social Sciences
Lecoutre, Bruno; Lecoutre, Marie-Paule; Poitevineau, Jacques – Psychological Methods, 2010
P. R. Killeen's (2005a) probability of replication ("p[subscript rep]") of an experimental result is the fiducial Bayesian predictive probability of finding a same-sign effect in a replication of an experiment. "p[subscript rep]" is now routinely reported in "Psychological Science" and has also begun to appear in…
Descriptors: Research Methodology, Guidelines, Probability, Computation
Iverson, Geoffrey J.; Wagenmakers, Eric-Jan; Lee, Michael D. – Psychological Methods, 2010
The purpose of the recently proposed "p[subscript rep]" statistic is to estimate the probability of concurrence, that is, the probability that a replicate experiment yields an effect of the same sign (Killeen, 2005a). The influential journal "Psychological Science" endorses "p[subscript rep]" and recommends its use…
Descriptors: Effect Size, Evaluation Methods, Probability, Experiments
Peer reviewedIngham, Roger J.; And Others – Journal of Speech and Hearing Research, 1995
Four experienced stuttering researchers viewed videodisks of spontaneous speech from chronic stutterers and attempted to locate the precise onset and offset of individual stuttering events. Results showed interjudge disagreements that challenge the reliability and validity of onset and offset judgments. Highly agreed stuttering events were…
Descriptors: Adults, Clinical Diagnosis, Evaluation Problems, Interrater Reliability
Sinharay, Sandip; Haberman, Shelby J. – Measurement: Interdisciplinary Research and Perspectives, 2009
In this commentary, the authors discuss some of the issues regarding the use of diagnostic classification models that practitioners should keep in mind. In the authors experience, these issues are not as well known as they should be. The authors then provide recommendations on diagnostic scoring.
Descriptors: Scoring, Reliability, Validity, Classification
Tummons, Jonathan – Assessment & Evaluation in Higher Education, 2010
This paper forms part of an exploration of assessment on one part-time higher education (HE) course: an in-service, professional qualification for teachers and trainers in the learning and skills sector which is delivered on a franchise basis across a network of further education colleges in the north of England. This paper proposes that the…
Descriptors: Foreign Countries, Portfolios (Background Materials), Portfolio Assessment, Validity
Peer reviewedCresswell, M. J. – Educational Review, 1988
The author suggests combining grades from component assessments to provide an overall student assessment. He explores the concept of reliability and concludes that the overall assessment will be reliable only if the number of grades used to report component achievements equals or exceeds the number used to report overall achievement. (Author/CH)
Descriptors: Evaluation Problems, Grades (Scholastic), Holistic Evaluation, Reliability
Peer reviewedO'Carroll, Patrick W. – Suicide and Life-Threatening Behavior, 1989
Briefly outlines problems associated with definition and official certification of suicide and reviews literature pertaining to validity and reliability of suicide statistics. Considers process of suicide certification as a test, estimating its sensitivity, specificity, and predictive value, using data from studies reviewed. (NB)
Descriptors: Attrition (Research Studies), Death, Evaluation Problems, Reliability
Hagermoser Sanetti, Lisa M.; Kratochwill, Thomas R. – School Psychology Review, 2009
Treatment integrity (also referred to as "treatment fidelity," "intervention integrity," and "procedural reliability") is an important methodological concerning both research and practice because treatment integrity data are essential to making valid conclusions regarding treatment outcomes. Despite its relationship to validity, treatment…
Descriptors: Intervention, Research Methodology, Models, Validity

Direct link
