Publication Date
| In 2026 | 0 |
| Since 2025 | 3 |
| Since 2022 (last 5 years) | 19 |
| Since 2017 (last 10 years) | 64 |
| Since 2007 (last 20 years) | 142 |
Descriptor
| Program Evaluation | 258 |
| Test Reliability | 116 |
| Reliability | 109 |
| Evaluation Methods | 69 |
| Test Validity | 66 |
| Program Effectiveness | 58 |
| Validity | 55 |
| Test Construction | 50 |
| Foreign Countries | 48 |
| Interrater Reliability | 48 |
| Statistical Analysis | 36 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 11 |
| Practitioners | 4 |
| Administrators | 2 |
| Counselors | 1 |
| Teachers | 1 |
Location
| Florida | 7 |
| California | 6 |
| Tennessee | 6 |
| Turkey | 6 |
| Canada | 4 |
| North Carolina | 4 |
| Pennsylvania | 4 |
| Sweden | 4 |
| Illinois | 3 |
| Missouri | 3 |
| Netherlands | 3 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Riana Nurhayati; Suranto Aw; Siti Irene Astuti Dwiningrum; Mami Hajaroh; Herwin Herwin – International Journal of Educational Methodology, 2024
Evaluation of child-friendly school (CFS) policies is essential to determine the achievements of school efforts in reducing violence cases. This research aims to proving the reliability and validity of CFS policy evaluation instruments in elementary schools with different locations. This investigation uses the Context Input Process Product (CIPP)…
Descriptors: Validity, Reliability, School Policy, Program Evaluation
Yesildag Hasancebi, Funda; Yuksel, Busra Tuncay; Mesci, Gunkut – International Journal of Assessment Tools in Education, 2022
The purpose of this study was to develop a reliable and valid rating scale for the use of the assessment and evaluation of lesson plans and teaching practices that are based on argumentation-based inquiry (ABI). The study covered two academic years (four academic semesters). Qualitative and quantitative methods were utilized throughout the…
Descriptors: Foreign Countries, Rating Scales, Test Construction, Test Validity
Stella Y. Kim; Carl Westine; Tong Wu; Derek Maher – Journal of College Student Retention: Research, Theory & Practice, 2024
The primary purpose of this study is to validate a student engagement measure for its use in evaluation of a learning assistant (LA) program. A series of psychometric evaluations were made for both the original scale of Higher Education Student Engagement Scale (HESES) and its adapted version designed to be used in gauging the effectiveness of…
Descriptors: Learner Engagement, Teaching Assistants, Test Validity, Test Reliability
Deborah Oluwadele; Yashik Singh; Timothy Adeliyi – Electronic Journal of e-Learning, 2024
Validation is needed for any newly developed model or framework because it requires several real-life applications. The investment made into e-learning in medical education is daunting, as is the expectation for a positive return on investment. The medical education domain requires data-wise implementation of e-learning as the debate continues…
Descriptors: Electronic Learning, Evaluation Methods, Medical Education, Sustainability
Kelly L. Simonton; Mengyi Wei; Pamela Kulinna; Allison Poulos – Journal of Youth Development, 2025
Afterschool settings have continued to be identified as an ideal outlet for supporting youth development in many arenas, including physical activity (PA) behaviors and social emotional learning (SEL) skills. In fact, PA centered programs are ideal settings for youth to 8 develop and practice SEL skills in authentic ways. However, there a several…
Descriptors: Social Emotional Learning, Questionnaires, Elementary School Students, After School Programs
Jennifer Sdunzik; Ann M. Bessenbacher; Wilella D. Burgess; Asia M. Mohamud; Abdirisak Dalmar – American Journal of Evaluation, 2025
The success of development projects and evaluations hinges on having access to research protocols and methodologies that consider the needs and characteristics of stakeholders, subjects, and context while remaining rigorous and culturally sound. These efforts are often complicated by a dearth of tools that have been tested for validity and…
Descriptors: Foreign Countries, Program Evaluation, International Programs, Data Collection
Koriakin, Taylor A.; McKee, Sarah L.; Schwartz, Marlene B.; Chafouleas, Sandra M. – Journal of School Health, 2020
Background: Stakeholders increasingly recognize the role of policy in implementing Whole School, Whole Community, Whole Child (WSCC) frameworks in schools; however, few tools are currently available to assess alignment between district policies and WSCC concepts. The purpose of this study was to expand the Wellness School Assessment Tool (WellSAT)…
Descriptors: School Policy, Health Services, Health Promotion, Wellness
Fritz, Ronda; Harn, Beth; Biancarosa, Gina; Lucero, Audrey; Flannery, K. Brigid – Assessment for Effective Intervention, 2019
This study investigated the use of brief observations to measure implementation of small group interventions using the Quality of Intervention Delivery and Receipt (QIDR) tool. Videos of 10-min segments representing the beginning, middle, and end of each 30-min intervention lesson were coded for implementation. Results indicated that (a)…
Descriptors: Intervention, Program Implementation, Efficiency, Observation
Didion, Lisa; Filderman, Marissa J.; Roberts, Greg; Benz, Sarah A.; Olmstead, Cassandra L. – Assessment for Effective Intervention, 2023
Rubric-based observations of pre- and inservice teachers are common practice in schools. Popular observation tools often result in minimal variation in ratings between teachers, require extensive training and time demands for raters, and provide minimal feedback for professional development. Alternatively, direct observation methods are evidenced…
Descriptors: Audio Equipment, Reliability, Classroom Observation Techniques, Teacher Behavior
Catharine Lory; Emily Gregori – Behavioral Disorders, 2024
Systematic reviews of single-case experimental research (SCER) in special education often use the What Works Clearinghouse (WWC) Standards to assess the methodological rigor of studies within a given literature base. While significant changes were made between the two most recent versions of the WWC standards, no research to date has evaluated the…
Descriptors: Program Effectiveness, Standards, Evidence, Case Studies
Gokce, Hasan; Eroglu, Seyide; Karaca, Melek; Bektas, Oktay – Journal of Science Learning, 2022
In STEMNET's report, 76% of 500 teachers interviewed stated that joining the STEM Club increased students' ability to solve real-world problems. This study aims to develop a valid and reliable measurement tool for evaluating STEM clubs. The research sample consisting of 149 teachers who carry out STEM club activities in schools in Turkey was…
Descriptors: STEM Education, Clubs, Program Evaluation, Test Validity
Turkish Preschool Children's Representations of Friendship: Story Completion Method Adaptation Study
Nur, Imray; Aktas Arnas, Yasare – International Journal of Assessment Tools in Education, 2022
In the preschool period, children's friendships considered a crucial developmental task. Hence, it is critical to evaluate children's mental representations of friendships during this period. This study aims to evaluate the validity and reliability of the story completion protocol designed to evaluate preschool children's mental representations of…
Descriptors: Foreign Countries, Preschool Children, Friendship, Visualization
Reddy, Pritika; Chaudhary, Kaylash; Sharma, Bibhya; Chand, Darren – Education and Information Technologies, 2021
In the digital age, advocating and improving digital literacy is a global challenge. There have been scales developed to measure individuals' digital literacy competencies; however, intervention programs have been only a few. This research paper articulates design details, validity, reliability and effectiveness of a new online modulated digital…
Descriptors: Game Based Learning, Intervention, Technological Literacy, Pacific Islanders
Bahr, Michael W.; Edwin, Mary; Long, Kara A. – Assessment for Effective Intervention, 2023
This study focused on the development of the Multi-Tiered Systems of Support--Sustainability Scale (MTSS-SS). Review of the literature identified factors associated with sustainability for multi-tiered systems of support (MTSS), and it indicated few sustainability measures currently exist for practitioners and researchers to incorporate into MTSS…
Descriptors: Test Construction, Measurement Techniques, Multi Tiered Systems of Support, Sustainability
Nordengren, Chase R. – Professional Development in Education, 2023
Professional learning evaluators struggle to balance the pursuit of rigorous outcomes with the time and money that some effective measures entail. While many evaluations rely on surveys for their ease of administration and application, little research has explored how surveys can measure multiple constructs beyond participant satisfaction as part…
Descriptors: Faculty Development, Teacher Surveys, Outcomes of Education, Test Construction

Peer reviewed
Direct link
