Publication Date
| In 2026 | 1 |
| Since 2025 | 878 |
| Since 2022 (last 5 years) | 4487 |
| Since 2017 (last 10 years) | 10420 |
| Since 2007 (last 20 years) | 21883 |
Descriptor
| Test Validity | 21728 |
| Validity | 13774 |
| Test Reliability | 10826 |
| Foreign Countries | 9848 |
| Test Construction | 6867 |
| Factor Analysis | 5755 |
| Measures (Individuals) | 5617 |
| Predictive Validity | 5018 |
| Psychometrics | 4800 |
| Reliability | 4632 |
| Correlation | 4370 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 1169 |
| Practitioners | 629 |
| Teachers | 336 |
| Administrators | 165 |
| Policymakers | 110 |
| Counselors | 63 |
| Students | 63 |
| Parents | 15 |
| Community | 12 |
| Media Staff | 10 |
| Support Staff | 8 |
| More ▼ | |
Location
| Turkey | 1387 |
| Australia | 704 |
| Canada | 626 |
| China | 527 |
| United States | 439 |
| Indonesia | 387 |
| United Kingdom | 363 |
| Germany | 338 |
| California | 337 |
| Netherlands | 334 |
| Spain | 309 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 7 |
| Meets WWC Standards with or without Reservations | 12 |
| Does not meet standards | 10 |
Öz, Serap; Özdemir, Ali – International Journal of Contemporary Educational Research, 2022
The purpose of this study is to develop a valid and reliable Likert-type scale that can be used to measure the data literacy skills of educators. In the development process of the scale, after reviewing the relevant literature, a pool of 130 items was designed and presented to the experts for their view. After the evaluation of experts, the…
Descriptors: Likert Scales, Test Construction, Construct Validity, Test Reliability
D. Princiotta; K. Caspary – SRI Education, a Division of SRI International, 2022
YouthTruth is a national survey project that harnesses student and stakeholder feedback to help guide decision-making by school leaders and education funders. With a grant from the Fund for Shared Insight, YouthTruth partnered with SRI Education to examine the relationship between key student experience scales and school-level academic and…
Descriptors: Elementary Secondary Education, Student Surveys, Student Experience, Outcomes of Education
Agustina, Eka Nurmala Sari; Widadah, Soffil; Nisa, Putri Afinanun – Mathematics Teaching Research Journal, 2021
Education currently only prioritizes mastery of scientific aspects and students' intelligence. Math problems are still related to fictitious general knowledge. For this reason, local wisdom-based learning is needed whose learning is packaged using objects, events, and various things that are close to students' lives to raise the local potential of…
Descriptors: Mathematics Instruction, Problem Solving, Indigenous Knowledge, Values Education
Herman, Keith C.; Reinke, Wendy M.; Huang, Francis L.; Thompson, Aaron M.; Doyle-Barker, Levi – School Psychology, 2021
Early adolescence represents a critical developmental period for the identification, prevention, and early intervention of mental health concerns. The Early Identification System--Student Report (EIS-SR) was developed as a user-friendly, accessible, and cost-efficient method for identifying youth at risk for mental health concerns. The present…
Descriptors: Psychometrics, Identification, Screening Tests, Middle School Students
Kittelman, Angus; Mercer, Sterett H.; McIntosh, Kent; Nese, Rhonda N. T. – Grantee Submission, 2021
To identify the most effective strategies for implementing and sustaining Tier 2 and 3 behavior support systems, a measure of general and tier-specific factors hypothesized to predict sustained implementation is needed. To address this need, we conducted two studies examining the construct validity of the "Advanced Level Tier Interventions…
Descriptors: Positive Behavior Supports, Measures (Individuals), Test Construction, Construct Validity
Pablo Robles-García; Stuart McLean; Jeffrey Stewart; Ji-young Shin; Claudia Helena Sánchez-Gutiérrez – Language Assessment Quarterly, 2024
Recent literature in the field of L2 vocabulary assessment has advocated for the development of written receptive vocabulary tests such as Vocabulary Levels Tests (VLTs) that use: (a) meaning-recall item formats, (b) a minimum of 40 item counts per 1,000-frequency band to improve level estimates, and (c) lemmas (not word-families) as the lexical…
Descriptors: Spanish, Test Validity, Test Construction, Vocabulary Development
André G. Bateman; Nicholas D. Myers; Deborah Feltz; Karin A. Pfeiffer; Kimberly Kelly; Alan L. Smith; Seungmin Lee; Adam McMahon; Isaac Prilleltensky; Ora Prilleltensky; Ahnalee M. Brincks – Measurement in Physical Education and Exercise Science, 2024
The purpose of this study was to explore the validity of evidence for self-efficacy to regulate physical activity scale (SERPA) measurement using an exploratory latent variable approach. The objectives were to explore the dimensionality, temporal invariance, and external validity of scores produced by the SERPA, a modified version of the barriers…
Descriptors: Adults, Obesity, Physical Activity Level, Self Management
Katie J. Robinson; David R. Lubans; Myrto F. Mavilidi; Francisco B. Ortega; Nicholas Riley – Measurement in Physical Education and Exercise Science, 2024
The purpose of our study was to assess the test-retest reliability and concurrent validity of the 30 sec Sit to Stand test in a sample of adolescents. We recruited 30 male (58%) and 22 female (42%) participants (mean age = 15.77 years ± 0.46). Participants completed the 30-sec Sit to Stand and standing long jump tests on two occasions separated by…
Descriptors: Test Validity, Adolescents, Psychomotor Skills, Test Reliability
Mirian Agus; Giovanni Bonaiuti; Arianna Marras – Journal of Science Education and Technology, 2024
In recent years, numerous research studies have highlighted how teachers' perceptions of educational robotics (ER) and their sense of self-efficacy can influence the learning process. Although different instruments exist to investigate teachers' perspectives on ER, the Robotics Interest Questionnaire (RIQ) scale, developed within the Portuguese…
Descriptors: Teacher Attitudes, Self Efficacy, Test Validity, Test Reliability
Scott J. Peters; Matthew C. Makel; Lindsay Ellis Lee; Tamra Stambaugh; Matthew T. McBee; D. Betsy McCoach; Kiana R. Johnson – Gifted Child Today, 2024
Universal screening is one of the most-common topics and well-accepted best practices within the field of gifted and talented education. There appears to be little disagreement that universally screening all students as part of a gifted and talented identification process results in fewer missed students. But surprisingly, there is little guidance…
Descriptors: Academically Gifted, Talent Identification, Screening Tests, Test Validity
Tíscar Rodríguez-Jiménez; Verónica Vidal-Arenas; Raquel Falcó; Beatriz Moreno-Amador; Juan C. Marzo; José A. Piqueras – Child & Youth Care Forum, 2024
Background: The Social Emotional Distress Scale-Secondary (SEDS-S) is a short measure designed for comprehensive school-based mental health screening, particularly for using very brief self-reported measures of well-being and distress. Whereas prior studies have shown validity and reliability evidence for the English version, there is a lack of…
Descriptors: Measures (Individuals), Psychometrics, Spanish, Well Being
Seunghan Lee; Amar Sadanand Shetty; Lora A. Cavuoto – IEEE Transactions on Learning Technologies, 2024
Recent usage of virtual reality (VR) technology in surgical training has emerged because of its cost-effectiveness, time savings, and cognition-based feedback generation. However, the quantitative evaluation of its effectiveness in training is still not thoroughly studied. This article demonstrates the effectiveness of a VR-based surgical training…
Descriptors: Markov Processes, Computer Simulation, Teaching Methods, Surgery
Turhan, Abdullah; Roest, Jesse J.; Delforterie, Monique J.; Van der Helm, G. H. Peer; Neimeijer, Elien G.; Didden, Robert – Journal of Applied Research in Intellectual Disabilities, 2024
Background: The Group Climate Inventory (GCI) was tested for measurement invariance across 332 adults with and 225 adults without mild intellectual disabilities in Dutch forensic treatment, and for latent mean differences on its "Support", "Growth", "Repression", and "Atmosphere" subscales. Method:…
Descriptors: Adults, Mild Intellectual Disability, Crime, Institutionalized Persons
Lauren N. Currie; Robinder P. Bedi; Anita M. Hubley – Measurement and Evaluation in Counseling and Development, 2024
This study evaluated the psychometric properties of the Hope-Action Inventory (HAI) scores with a problematic substance use population (N = 783). The hierarchical seven-factor structure of the HAI fit the data well. Further, the HAI scores had satisfactory internal consistency reliability and good convergent evidence for validity.
Descriptors: Psychometrics, Substance Abuse, Test Validity, Test Reliability
James Soland – Journal of Research on Educational Effectiveness, 2024
When randomized control trials are not possible, quasi-experimental methods often represent the gold standard. One quasi-experimental method is difference-in-difference (DiD), which compares changes in outcomes before and after treatment across groups to estimate a causal effect. DiD researchers often use fairly exhaustive robustness checks to…
Descriptors: Item Response Theory, Testing, Test Validity, Intervention

Peer reviewed
Direct link
