Publication Date
| In 2026 | 8 |
| Since 2025 | 2276 |
| Since 2022 (last 5 years) | 12791 |
| Since 2017 (last 10 years) | 33916 |
| Since 2007 (last 20 years) | 68407 |
Descriptor
| Foreign Countries | 30560 |
| Test Validity | 21743 |
| Scores | 18256 |
| Academic Achievement | 16928 |
| Test Construction | 16756 |
| Test Reliability | 15028 |
| Achievement Tests | 14859 |
| Standardized Tests | 14720 |
| Comparative Analysis | 14431 |
| Elementary Secondary Education | 13042 |
| Language Tests | 12551 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5034 |
| Teachers | 3393 |
| Researchers | 2630 |
| Policymakers | 1232 |
| Administrators | 978 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2822 |
| Australia | 2426 |
| Canada | 2270 |
| California | 1854 |
| United States | 1726 |
| Texas | 1615 |
| China | 1578 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1202 |
| Germany | 1122 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Bruno D. Zumbo – International Journal of Assessment Tools in Education, 2023
In line with the journal volume's theme, this essay considers lessons from the past and visions for the future of test validity. In the first part of the essay, a description of historical trends in test validity since the early 1900s leads to the natural question of whether the discipline has progressed in its definition and description of test…
Descriptors: Test Theory, Test Validity, True Scores, Definitions
Krystal Thomas; Todd A. Grindal; Daisy Wise Rutstein; Gullnar Syed; Sarah Nixon Gerard; Shari Golan; Sheryl Cababa; Amanda Di Dio; Behnosh Najafi; Kat Ward – SRI Education, a Division of SRI International, 2023
Instructional coaching, informed by observation tools that measure teachers' practices, has been effective in improving teaching quality in early learning programs. However, existing measurement tools limit teachers' abilities to implement this type of instructional coaching at scale. To address this challenge, a team at SRI Education, along with…
Descriptors: Preschool Education, Kindergarten, Coaching (Performance), Observation
Laura Sparaci; Valentina Fantasia; Chiara Bonsignori; Cecilia Provenzale; Domenico Formica; Fabrizio Taffoni – Reading and Writing: An Interdisciplinary Journal, 2025
A growing number of primary school students experience difficulties with grapho-motor skills involved in handwriting, which impact both form and content of their texts. Therefore, it is important to assess and monitor handwriting skills in primary school via standardized tests and detect specific grapho-motor parameters (GMPs) which impact…
Descriptors: Handwriting, Writing Instruction, Writing Tests, Standardized Tests
Mariana Barragán Torres; Meg Bates; Sarah Cashdollar – Illinois Workforce and Education Research Collaborative, Discovery Partners Institute, 2025
Aggregate national test score data have shown that student learning declined from SY19 to SY21, and that recovery began occurring from SY21 to SY22. To further the understanding of how districts in Illinois' performances have changed since the onset of the pandemic, the authors explored variation in districts' change in standardized test scores…
Descriptors: School Districts, Test Score Decline, Student Improvement, Achievement Gains
Benedict C. O. F. Fehringer; Meike Bonefeld; Fabian Schunk – Social Psychology of Education: An International Journal, 2025
Bias Awareness is understood as individual differences in people's sensitivity to and concerns about their expressions of subtle bias. In 2015, Perry et al. developed a scale to measure the awareness and concern of one's own subtle bias (Bias Awareness Scale, BAS). The present research aims to test the validity of a German adaptation of the Bias…
Descriptors: Foreign Countries, Bias, Teachers, Teacher Attitudes
José Hernando Ávila-Toscano; Laura Isabel Rambal-Rivaldo; David Javier Fortich Pérez; Leonardo Vargas-Delgado – International Journal of Education in Mathematics, Science and Technology, 2025
Technological mediation has gained relevance in teaching mathematics. Its usefulness and impact depend, to a great extent, on how students approach the learning of the discipline. Two independent instrumental studies were conducted to analyze the psychometric properties of the Spanish version of the Mathematics and Technology Attitudes Scale…
Descriptors: Mathematics Instruction, Educational Technology, Technology Uses in Education, Psychometrics
Ali Özcan; Fatma Tezel Sahin – International Journal of Psychology and Educational Studies, 2025
The present study aims to adapt the Technoference in Parent-Child Relationships Scale (TPCRS) to Turkish culture by conducting validity and reliability analyses. The study group consists of the parents of 445 children between the ages of 3 and 6 attending preschool in the Denizli province. Expert opinions were consulted for the language validity…
Descriptors: Foreign Countries, Parent Child Relationship, Test Validity, Test Reliability
Kuan-Yu Jin; Wai-Lok Siu – Journal of Educational Measurement, 2025
Educational tests often have a cluster of items linked by a common stimulus ("testlet"). In such a design, the dependencies caused between items are called "testlet effects." In particular, the directional testlet effect (DTE) refers to a recursive influence whereby responses to earlier items can positively or negatively affect…
Descriptors: Models, Test Items, Educational Assessment, Scores
Omar Saleh Bani Yassin; Aiman Mohammad Freihat; Sabri Hassan Al-Tarawneh – Educational Process: International Journal, 2025
Background/purpose: This study aimed to investigate the differences among the equations used in estimating the reliability coefficient using the half-split method. These equations demonstrate Spearman-Brown's, Rulon's, Guttman's, Mosier's, Flanagan's, and Horst's. Materials/methods: The study instrument was a 43-item scale for evaluating the…
Descriptors: Foreign Countries, Equations (Mathematics), Mathematics Instruction, Grade 10
Melissa H. Black; Karl Lundin Remnélius; Lovisa Alehagen; Thomas Bourgeron; Sven Bölte – Journal of Autism and Developmental Disorders, 2025
Purpose: A considerable number of screening and diagnostic tools for autism exist, but variability in these measures presents challenges to data harmonization and the comparability and generalizability of findings. At the same time, there is a movement away from autism symptomatology to stances that capture heterogeneity and appreciate diversity.…
Descriptors: Symptoms (Individual Disorders), Classification, Measures (Individuals), Autism Spectrum Disorders
Mohd Norlizam Mohd Razali; Aida Hanim A. Hamid; Bity Salwana Alias; Azlin Norhaini Mansor – Journal of Education and Learning (EduLearn), 2025
A teacher competency instrument was developed to determine the level of teacher competency in small schools in Peninsular Malaysia. This study was conducted in Perak and Negeri Sembilan to determine the instrument's reliability and validity. Exploratory factor analysis (EFA) and item reliability analysis were used to determine the questionnaire's…
Descriptors: Foreign Countries, Elementary Secondary Education, Small Schools, Rural Schools
Victoria Crisp; Sylvia Vitello; Abdullah Ali Khan; Heather Mahy; Sarah Hughes – Research Matters, 2025
This research set out to enhance our understanding of the exam techniques and types of written annotations or markings that learners may wish to use to support their thinking when taking digital multiple-choice exams. Additionally, we aimed to further explore issues around the factors that contribute to learners writing less rough work and…
Descriptors: Computer Assisted Testing, Test Format, Multiple Choice Tests, Notetaking
Kazuhiro Yamaguchi – Journal of Educational and Behavioral Statistics, 2025
This study proposes a Bayesian method for diagnostic classification models (DCMs) for a partially known Q-matrix setting between exploratory and confirmatory DCMs. This Q-matrix setting is practical and useful because test experts have pre-knowledge of the Q-matrix but cannot readily specify it completely. The proposed method employs priors for…
Descriptors: Models, Classification, Bayesian Statistics, Evaluation Methods
Matthew T. Mahar; Hoyong Sung – Measurement in Physical Education and Exercise Science, 2025
Field-based tests of aerobic fitness that can be administered quickly and do not require maximal effort are desirable. The purpose was to develop and validate quarter-mile walk tests for 10--13-year-olds. Participants (N = 59) walked one mile on two different days. Walk times, heart rates, body mass, physical activity, and aerobic fitness were…
Descriptors: Physical Fitness, Test Construction, Exercise, Early Adolescents
Ali M. Alodat; Qais Al-Meqdad; Maha Al-Hendawi; Nawaf Al-Zyoud; Osamah Bataineh – Journal of Advanced Academics, 2025
This study uses a rigorous research process to explore the psychometric properties of the Gifted Rating Scale-School Form (GRS-S) within the Qatari educational context. We employed stratified cluster sampling of 326 students (aged 6-13 years, M = 10.9) from 25 public schools in Doha. Data was collected in the second semester of the 2023-2024…
Descriptors: Academically Gifted, Rating Scales, Psychometrics, Foreign Countries

Peer reviewed
Direct link
