Publication Date
| In 2026 | 0 |
| Since 2025 | 58 |
| Since 2022 (last 5 years) | 316 |
| Since 2017 (last 10 years) | 615 |
| Since 2007 (last 20 years) | 1736 |
Descriptor
| Evaluation Methods | 3975 |
| Test Validity | 2083 |
| Validity | 1473 |
| Test Reliability | 995 |
| Student Evaluation | 803 |
| Foreign Countries | 637 |
| Test Construction | 560 |
| Reliability | 527 |
| Higher Education | 452 |
| Elementary Secondary Education | 418 |
| Measurement Techniques | 418 |
| More ▼ | |
Source
Author
| Fuchs, Lynn S. | 12 |
| Baker, Eva L. | 11 |
| Cronin, John | 11 |
| Marsh, Herbert W. | 11 |
| Amrein-Beardsley, Audrey | 9 |
| Linn, Robert L. | 9 |
| Sireci, Stephen G. | 9 |
| Raykov, Tenko | 8 |
| Deno, Stanley L. | 7 |
| Epstein, Michael H. | 7 |
| Matson, Johnny L. | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 193 |
| Practitioners | 121 |
| Teachers | 47 |
| Administrators | 31 |
| Policymakers | 27 |
| Students | 16 |
| Counselors | 7 |
| Media Staff | 4 |
| Community | 3 |
| Support Staff | 3 |
| Parents | 2 |
| More ▼ | |
Location
| Australia | 66 |
| United Kingdom | 56 |
| Canada | 47 |
| California | 32 |
| Netherlands | 30 |
| United States | 30 |
| United Kingdom (England) | 26 |
| Germany | 23 |
| Turkey | 22 |
| China | 21 |
| Taiwan | 21 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Kylie L. Anglin – Annenberg Institute for School Reform at Brown University, 2025
Since 2018, institutions of higher education have been aware of the "enrollment cliff" which refers to expected declines in future enrollment. This paper attempts to describe how prepared institutions in Ohio are for this future by looking at trends leading up to the anticipated decline. Using IPEDS data from 2012-2022, we analyze trends…
Descriptors: Validity, Artificial Intelligence, Models, Best Practices
Bangun Sartono; Widha Sunarno; Baskoro Adi Prayitno; Nurma Yunita Indriyanti – Educational Process: International Journal, 2025
Background/purpose: Critical thinking (CT) is fundamental in science education, but instruments to measure CT in specific domains, such as physics, are still limited. The present study aims to develop and validate the Critical Thinking Test in Temperature and Heat (CTTH), an instrument designed to measure critical thinking skills in the topic of…
Descriptors: Test Construction, Test Validity, Critical Thinking, Evaluation Methods
Wendy Chan – Asia Pacific Education Review, 2024
As evidence from evaluation and experimental studies continue to influence decision and policymaking, applied researchers and practitioners require tools to derive valid and credible inferences. Over the past several decades, research in causal inference has progressed with the development and application of propensity scores. Since their…
Descriptors: Probability, Scores, Causal Models, Statistical Inference
Nan Xie; Zhengxu Li; Haipeng Lu; Wei Pang; Jiayin Song; Beier Lu – IEEE Transactions on Learning Technologies, 2025
Classroom engagement is a critical factor for evaluating students' learning outcomes and teachers' instructional strategies. Traditional methods for detecting classroom engagement, such as coding and questionnaires, are often limited by delays, subjectivity, and external interference. While some neural network models have been proposed to detect…
Descriptors: Learner Engagement, Artificial Intelligence, Technology Uses in Education, Educational Technology
Melissa Raspa; Angela Gwaltney; Carla Bann; Jana von Hehn; Timothy A. Benke; Eric D. Marsh; Sarika U. Peters; Amitha Ananth; Alan K. Percy; Jeffrey L. Neul – Journal of Autism and Developmental Disorders, 2025
Rett syndrome is a severe neurodevelopmental disorder that affects about 1 in 10,000 females. Clinical trials of disease modifying therapies are on the rise, but there are few psychometrically sound caregiver-reported outcome measures available to assess treatment benefit. We report on a new caregiver-reported outcome measure, the Rett Caregiver…
Descriptors: Neurodevelopmental Disorders, Genetic Disorders, Females, Test Validity
John A. Jimenez-Garcia; Chanelle Montpetit; Richard DeMont – Measurement in Physical Education and Exercise Science, 2024
The Child Focused Injury Risk Screening Tool (ChildFIRST) aims to measure movement competence and lower-limb-injury risk in 8-12-year-old children. Although the ChildFIRST has face and content validity evidence, stronger validity evidence is warranted. We tested the concurrent validity of the ChildFIRST using motion analysis, and the convergent…
Descriptors: Preadolescents, Kinesiology, Risk Assessment, Human Body
Alexandra Jackson; Elise Barrella; Cheryl Bodnar – Journal of Engineering Education, 2024
Background: Concept maps are a valid assessment tool to explore student understanding of diverse topics. Many types of academic programs have integrated concept mapping into their courses, resulting in various activities and scoring methods to understand student perceptions. Purpose: Few prior reviews of concept mapping have addressed their use…
Descriptors: Engineering Education, Concept Mapping, Scoring Rubrics, Evaluation Methods
Abede Mack; Katelynn Carter-Rogers; Priscilla Bahaw; Ayanna Stephens – Discover Education, 2024
Appetite for entrepreneurship education (EE) among vocational students has surged dramatically, driven by persistent challenges of unemployment. As a result, vocational institutions are increasingly focused on how much entrepreneurship exposure students receive, particularly how frequently instructors impart core business knowledge and skills to…
Descriptors: Entrepreneurship, Technical Education, Student Attitudes, Knowledge Level
Chitra Sabapathy – Shanlax International Journal of Education, 2024
Background: Mid-semester evaluations are gaining traction as a means to gather evaluation data for formative purposes. However, it is not clear if course coordinators who conduct these evaluations are adequately equipped with evaluative knowledge and skills to guide them through their evaluative processes. Objectives: This study is a…
Descriptors: Evaluation Methods, Instructor Coordinators, Tutors, College Students
Divya Varier; Marvin G. Powell; Stephanie Dodman; Samantha T. Ives; Elizabeth DeMulder; Jenice L. View – Educational Assessment, 2024
Considerable literature is devoted to teachers' assessment use to support teaching and learning. The study examined the factor structure of a measure of teachers' assessment use along the assessments "of", "for", and "as" learning purpose dimensions. The study also examined the factor structure of teachers' perceived…
Descriptors: Assessment Literacy, Elementary Secondary Education, Teacher Evaluation, Teacher Attitudes
Tia M. Fechter; Heeyeon Yoon – Language Testing, 2024
This study evaluated the efficacy of two proposed methods in an operational standard-setting study conducted for a high-stakes language proficiency test of the U.S. government. The goal was to seek low-cost modifications to the existing Yes/No Angoff method to increase the validity and reliability of the recommended cut scores using a convergent…
Descriptors: Standard Setting, Language Proficiency, Language Tests, Evaluation Methods
Naumann, Sandra; Byrne, Michelle L.; de la Fuente, Alethia; Harrewijn, Anita; Nugiel, Tehila; Rosen, Maya; van Atteveldt, Nienke; Matusz, Pawel J. – Mind, Brain, and Education, 2022
In cognitive neurosciences, fundamental principles of mental processes and functional brain organization have been established with highly controlled tasks and testing environments. Recent technical advances allowed the investigation of these functions and their brain mechanisms in naturalistic settings. The diversity in those approaches have been…
Descriptors: Evaluation Methods, Neurosciences, Educational Research, Validity
Siti Suprihatiningsih; Masriyah; Rooselyna Ekawati – Journal of Education and Learning (EduLearn), 2025
The knowledge of the materials to be taught to the students is the basic knowledge that preservice mathematics teachers should possess, as they need to prepare themselves for teaching. In order to research preservice teachers' understanding of the subject matter and teaching skils, valid and reliable test instruments are required. Knowledge of…
Descriptors: Preservice Teachers, Pedagogical Content Knowledge, Preservice Teacher Education, Mathematics Teachers
Seungbak Lee; Minsoo Kang; Jae-Hyeon Park; Hyo-Jun Yun – Measurement in Physical Education and Exercise Science, 2025
The PageRank model has been applied in sport ranking systems; however, prior implementations exhibited limitations and failed to produce valid rankings. This study analyzed 1,466 National Collegiate Athletic Association (NCAA) Division 1 football games and developed a novel, modified PageRank model. We also proposed an artificial…
Descriptors: Algorithms, Evaluation Methods, Team Sports, College Athletics
Shereen El Mallah – Journal of Adolescent Research, 2024
Racially and ethnically diverse populations from minoritized backgrounds are often exposed to research methodologies that amplify structural racism and negate their sociocultural reality. Although cross-cultural validation of measures is considered a requisite step to multigroup comparisons, researchers apply measures validated and standardized in…
Descriptors: Minority Groups, Youth, Participatory Research, Validity

Peer reviewed
Direct link
