Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 4 |
| Since 2017 (last 10 years) | 11 |
| Since 2007 (last 20 years) | 41 |
Descriptor
| Evaluation Problems | 62 |
| Test Reliability | 62 |
| Test Validity | 42 |
| Evaluation Methods | 32 |
| Evaluation Research | 13 |
| Student Evaluation | 13 |
| Foreign Countries | 10 |
| Testing Problems | 9 |
| Elementary Secondary Education | 8 |
| Evaluation Criteria | 8 |
| Item Analysis | 8 |
| More ▼ | |
Source
Author
| Bagnato, Stephen J. | 2 |
| Macy, Marisa | 2 |
| Abdulrahman Alshammari | 1 |
| Amy I. Berman, Editor | 1 |
| Anderson, Andrew | 1 |
| Arielle Boguslav | 1 |
| Athanasou, James A. | 1 |
| Bachor, Dan G. | 1 |
| Balcão Reis, Ana | 1 |
| Ballou, Dale | 1 |
| Bates, Simon P. | 1 |
| More ▼ | |
Publication Type
Education Level
Location
| Canada | 3 |
| Florida | 2 |
| Arizona | 1 |
| Australia | 1 |
| Austria | 1 |
| California | 1 |
| Hong Kong | 1 |
| Maryland | 1 |
| North Carolina | 1 |
| Portugal | 1 |
| Turkey | 1 |
| More ▼ | |
Laws, Policies, & Programs
| Individuals with Disabilities… | 1 |
| No Child Left Behind Act 2001 | 1 |
| Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Abdulrahman Alshammari – ProQuest LLC, 2024
A critical component of modern software development practices, particularly continuous integration (CI), is the halt of development activities in response to test failures which requires further investigation and debugging. As software changes, regression testing becomes vital to verify that new code does not affect existing functionality.…
Descriptors: Computer Software, Programming, Coding, Test Reliability
Scott F. Marion, Editor; James W. Pellegrino, Editor; Amy I. Berman, Editor – National Academy of Education, 2024
High-quality assessments are crucial to many aspects of the educational process. They can help policymakers monitor long-term educational trends, assist state educational agencies (SEAs) and local educational agencies (LEAs) in allocating resources and professional development opportunities, provide insights to teachers about how well students…
Descriptors: Educational Assessment, Educational Policy, Equal Education, Test Validity
Arielle Boguslav; Julie Cohen – Journal of Teacher Education, 2024
Teacher preparation programs are increasingly expected to use data on preservice teacher (PST) skills to drive program improvement and provide targeted supports. Observational ratings are especially vital, but also prone to measurement issues. Scores may be influenced by factors unrelated to PSTs' instructional skills, including rater standards.…
Descriptors: Preservice Teachers, Measures (Individuals), Evaluation Problems, Teaching Skills
Mücahit Öztürk – Open Praxis, 2024
This study examined the problems that pre-service teachers face in the online assessment process and their suggestions for solutions to these problems. The participants were 136 pre-service teachers who have been experiencing online assessment for a long time and who took the Foundations of Open and Distance Learning course. This research is a…
Descriptors: Foreign Countries, Preservice Teacher Education, Preservice Teachers, Distance Education
Michelle Herridge – ProQuest LLC, 2021
Evaluation of student written work during summative assessments is an important and critical task for instructors at all educational levels. Nevertheless, few research studies exist that provide insights into how different instructors approach this task. Chemistry faculty (FIs) and graduate student instructors (GSIs) regularly engage in the…
Descriptors: Science Instruction, Chemistry, College Faculty, Teaching Assistants
Walker, Paul – Composition Forum, 2017
This article describes and theorizes a failed writing program assessment study to question the influence of "the rhetoric of agreement," or reliability, on writing assessment practice and its prevalence in validating institutional mandated assessments. Offering the phrase "dwelling in disagreement" as a queer perspective, the…
Descriptors: Rhetoric, Writing Tests, Test Reliability, Program Validation
Szafran, Robert F. – Practical Assessment, Research & Evaluation, 2017
Institutional assessment of student learning objectives has become a fact-of-life in American higher education and the Association of American Colleges and Universities' (AAC&U) VALUE Rubrics have become a widely adopted evaluation and scoring tool for student work. As faculty from a variety of disciplines, some less familiar with the…
Descriptors: Interrater Reliability, Case Studies, Scoring Rubrics, Behavioral Objectives
Lau, Ken – Innovations in Education and Teaching International, 2018
Self-directed learning, despite its growing popularity in education, has challenged conventional assessment practice which often foregrounds the presentation of identical conditions to ensure reliability. This article discusses the results of a case study of university academic English teachers' perceptions and reported practices of assessing…
Descriptors: Independent Study, Teacher Attitudes, Case Studies, Educational Practices
Hartley, James – Psychology Teaching Review, 2017
In this article, Hartley notes the difficulties of using questionnaires to assess the efficiency of new instructional methods and highlights nine issues that researchers must consider. Hartley continues the discussion about the use of questionnaires and suggests that psychology teachers can help improve the teaching of psychology by drawing…
Descriptors: Questionnaires, Instructional Innovation, Instructional Effectiveness, Teaching Methods
Berliner, David C. – Education Policy Analysis Archives, 2018
The Scylla and Charybdis in this discussion of teacher evaluation are standardized achievement test data on the one hand, and classroom observational systems on the other. These are the two most common methods used to judge teachers' competency. Both have serious flaws: the former primarily with validity, the latter primarily with reliability. At…
Descriptors: Teacher Evaluation, Evaluation Problems, Standardized Tests, Achievement Tests
Wodka, Ericka L.; Puts, Nicolaas A. J.; Mahone, E. Mark; Edden, Richard A. E.; Tommerdahl, Mark; Mostofsky, Stewart H. – Journal of Autism and Developmental Disorders, 2016
Sensory processing abnormalities in autism have largely been described by parent report. This study used a multi-method (parent-report and measurement), multi-trait (tactile sensitivity and attention) design to evaluate somatosensory processing in ASD. Results showed multiple significant within-method (e.g., parent report of different…
Descriptors: Attention, Multitrait Multimethod Techniques, Autism, Pervasive Developmental Disorders
Phillipo, Kate; Conner, Jerusha Osber; Davidson, Shannon; Pope, Denise – Teachers College Record, 2017
Background: A large body of survey-based research asserts that the quality and strength of student-teacher relationships (STRs) predict a host of academic and nonacademic outcomes; however, advances in survey design research have led some to question existing survey instruments' psychometric soundness. Concurrently, qualitative research on STRs…
Descriptors: Literature Reviews, Self Disclosure (Individuals), Evaluation Methods, Teacher Student Relationship
Cramer, Kenneth M.; Page, Stewart; Burrows, Vanessa; Lamoureux, Chastine; Mackay, Sarah; Pedri, Victoria; Pschibul, Rebecca – Collected Essays on Learning and Teaching, 2016
Based on analyses of Maclean's ranking data pertaining to Canadian universities published over the last 24 years, we present a summary of statistical findings of annual ranking exercises, as well as discussion about their current status and the effects upon student welfare. Some illustrative tables are also presented. Using correlational and…
Descriptors: Foreign Countries, Universities, Classification, Institutional Advancement
Rutkowski, Leslie; Rutkowski, David – Educational Researcher, 2016
In the current article, we consider the influential position of the Programme for International Student Assessment (PISA) and discuss several methodological areas that demonstrate the need for caution when using and interpreting PISA results. We motivate our argument by briefly describing the program's increased influence in educational policy…
Descriptors: International Assessment, Outcome Measures, Data Interpretation, Research Reports
Huang, Xiaoping; Hu, Zhongfeng – Higher Education Studies, 2015
The main problem of the educational evaluation validity is that it just copies the conceptual framework system of validity from educational measurement to its own conceptual system. The validity conceptual system that fits the need of theory and practice of educational evaluation has not been established yet. According to the inherent attributive…
Descriptors: Test Validity, Educational Assessment, Evaluation Problems, Theory Practice Relationship

Direct link
Peer reviewed
