Publication Date
| In 2026 | 6 |
| Since 2025 | 2195 |
| Since 2022 (last 5 years) | 12710 |
| Since 2017 (last 10 years) | 33835 |
| Since 2007 (last 20 years) | 68326 |
Descriptor
| Foreign Countries | 30532 |
| Test Validity | 21728 |
| Scores | 18248 |
| Academic Achievement | 16912 |
| Test Construction | 16738 |
| Test Reliability | 15015 |
| Achievement Tests | 14839 |
| Standardized Tests | 14712 |
| Comparative Analysis | 14429 |
| Elementary Secondary Education | 13038 |
| Language Tests | 12549 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5034 |
| Teachers | 3391 |
| Researchers | 2630 |
| Policymakers | 1229 |
| Administrators | 976 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2815 |
| Australia | 2426 |
| Canada | 2269 |
| California | 1853 |
| United States | 1725 |
| Texas | 1615 |
| China | 1578 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1202 |
| Germany | 1121 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Chan Zhang; Shuaiying Cao; Minglei Wang; Jiangyan Wang; Lirui He – Field Methods, 2025
Previous research on grid questions has mostly focused on their comparability with the item-by-item method and the use of shading to help respondents navigate through a grid. This study extends prior work by examining whether lexical similarity among grid items affects how respondents answer the questions in an experiment where we manipulated…
Descriptors: Foreign Countries, Surveys, Test Construction, Design
Valeria Damiani; Julian Fraillon – Large-scale Assessments in Education, 2025
Globalization and its impact on contemporary societies have gained new impetus with the notions of global citizenship education (GCED) and education for sustainable development (ESD), considered, together with civic and citizenship education (CCE), as a means for promoting students' engagement in global/local issues and providing them with the…
Descriptors: Civics, Citizenship Education, Global Approach, Sustainable Development
Juliana Reyes-Martin; David Simó-Pinatella; Ana Andrés – Journal of Applied Research in Intellectual Disabilities, 2025
Background: Behavioural problems in individuals with intellectual disabilities have a negative impact on them. Limited assessment measures exist in Spain. This study aimed to validate the Behavior Problems Inventory--Short Form (BPI-S) in the Spanish population by examining its psychometric properties and factorial structures. Method: This study…
Descriptors: Foreign Countries, Behavior Problems, Students with Disabilities, Intellectual Disability
Onur Dönmez; Yavuz Akbulut; Gözde Zabzun; Berrin Köseoglu – Applied Cognitive Psychology, 2025
This study investigates the effect of survey order in measuring self-reported cognitive load. Understanding how survey order influences responses is crucial, but it has been largely overlooked in the context of cognitive load. Using a 2 × 2 experimental design with 319 high school students, the study manipulated intrinsic cognitive load (ICL)…
Descriptors: Surveys, Test Construction, Measurement, Cognitive Processes
Camilla M. McMahon; Maryellen Brunson McClain; Savannah Wells; Sophia Thompson; Jeffrey D. Shahidullah – Journal of Autism and Developmental Disorders, 2025
Purpose: The goal of the current study was to conduct a substantive validity review of four autism knowledge assessments with prior psychometric support (Gillespie-Lynch in J Autism and Dev Disord 45(8):2553-2566, 2015; Harrison in J Autism and Dev Disord 47(10):3281-3295, 2017; McClain in J Autism and Dev Disord 50(3):998-1006, 2020; McMahon…
Descriptors: Measures (Individuals), Psychometrics, Test Items, Accuracy
Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025
This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…
Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis
Abdullah Faruk Kiliç; Meltem Acar Güvendir; Gül Güler; Tugay Kaçak – Measurement: Interdisciplinary Research and Perspectives, 2025
In this study, the extent to wording effects impact structure and factor loadings, internal consistency and measurement invariance was outlined. The modified form, which includes items that semantically reversed, explains %21.5 more variance than the original form. Also, reversed items' factor loadings are higher. As a result of CFA, indexes…
Descriptors: Test Items, Factor Structure, Test Reliability, Semantics
Lee Wolff; Haydn Till; Bruce Watt – Journal of Psychoeducational Assessment, 2025
Fetal Alcohol Spectrum Disorder (FASD) is a significant public health concern arising from prenatal alcohol exposure. This study examines the clinical utility of Wechsler intelligence tests in assessing cognition in 108 children with confirmed prenatal alcohol exposure. Data were analysed using multidimensional scaling and Guttman's Structural…
Descriptors: Fetal Alcohol Syndrome, Intelligence Tests, Children, Multidimensional Scaling
Christoph Ableitinger; Christian Dorner – International Journal of Mathematical Education in Science and Technology, 2025
The number of complaints university lecturers make about a lack of knowledge, especially first-year students' procedural knowledge, has increased recently. Due to missing adequate empirical evidence, a survey of procedural knowledge among students of Austrian high schools in their final year was conducted. For this purpose, test items for…
Descriptors: Knowledge Level, Cognitive Processes, High School Seniors, Foreign Countries
Betul Aydin; Suleyman Sadi Seferoglu – Turkish Online Journal of Distance Education, 2025
This study aims to conduct validity and reliability of a measurement tool developed to determine university students' levels of digital risk taking. 646 undergraduate students from 8 different universities voluntarily participated in the study. Exploratory and confirmatory factor analyses were conducted to reveal the factor structure of the…
Descriptors: Undergraduate Students, Measures (Individuals), Risk, Test Validity
Ina Mielke; Simon M. Breil; Johanna Hissbach; Maren Ehrhardt; Mirjana Knorr – Advances in Health Sciences Education, 2025
Situational Judgement Tests (SJTs) are popular to screen for social skills during undergraduate medical admission as they have been shown to predict relevant study outcomes. Two different types of SJTs can be distinguished: Traditional SJTs, which measure general effective behavior, and construct-driven SJTs which are designed to measure specific…
Descriptors: Undergraduate Students, Situational Tests, Medical Students, Foreign Countries
Justin Jihao Hong; Victor Lei; Xuan Li – Annenberg Institute for School Reform at Brown University, 2025
High-stakes exams are often administered at designated test centers, requiring many students to test in unfamiliar environments. We investigate whether such arrangements impact students' test performance and, by extension, access to educational opportunities. Using unique administrative data from China's national college entrance examination…
Descriptors: Test Wiseness, Testing, High Stakes Tests, College Entrance Examinations
Mary Witt; Anna J. Esbensen; Ayesha Harisinghani; Nicolas M. Oreskovic; Michelle Palumbo; Stephanie L. Santoro – Journal of Applied Research in Intellectual Disabilities, 2025
Introduction: The Anxiety, Depression and Mood Scale (ADAMS), a mental health screening tool developed for individuals with intellectual disabilities, has yet to be evaluated in adults with Down syndrome. We included the ADAMS in a Dementia Protocol. Method: We reviewed the charts of 71 adults with Down syndrome seen in a specialty clinic and…
Descriptors: Anxiety, Depression (Psychology), Screening Tests, Down Syndrome
Danwei Cai; Ben Naismith; Maria Kostromitina; Zhongwei Teng; Kevin P. Yancey; Geoffrey T. LaFlair – Language Learning, 2025
Globalization and increases in the numbers of English language learners have led to a growing demand for English proficiency assessments of spoken language. In this paper, we describe the development of an automatic pronunciation scorer built on state-of-the-art deep neural network models. The model is trained on a bespoke human-rated dataset that…
Descriptors: Automation, Scoring, Pronunciation, Speech Tests
Berna Kiliç; Mahmut Selvi – International Journal of Assessment Tools in Education, 2025
It is important to determine the level of pedagogical content knowledge of teachers regarding skills. The aim of this study is to establish the theoretical framework of skill-specific pedagogical content knowledge and to develop a reliable and valid scale to measure teachers' entrepreneurship pedagogical content knowledge. The draft scale form was…
Descriptors: Entrepreneurship, Pedagogical Content Knowledge, Teacher Competency Testing, Test Reliability

Peer reviewed
Direct link
