Publication Date
| In 2026 | 0 |
| Since 2025 | 6 |
| Since 2022 (last 5 years) | 14 |
| Since 2017 (last 10 years) | 39 |
| Since 2007 (last 20 years) | 73 |
Descriptor
| Test Reliability | 252 |
| Tests | 252 |
| Test Validity | 238 |
| Test Construction | 75 |
| Testing | 41 |
| Foreign Countries | 32 |
| Evaluation Methods | 31 |
| Scoring | 26 |
| Test Interpretation | 26 |
| Student Evaluation | 24 |
| Item Analysis | 23 |
| More ▼ | |
Source
Author
| Skoczylas, Rudolph V. | 3 |
| Guthrie, P. D. | 2 |
| Göçer, Ali | 2 |
| Hoepfner, Ralph | 2 |
| Koos, Eugenia M. | 2 |
| Linn, Robert L. | 2 |
| Miles, David T. | 2 |
| Petrosko, Joseph M. | 2 |
| Weiss, David J. | 2 |
| Abdullah Uysal | 1 |
| Adkins, Dorothy C. | 1 |
| More ▼ | |
Publication Type
Education Level
| Higher Education | 21 |
| Postsecondary Education | 19 |
| Elementary Education | 17 |
| Secondary Education | 14 |
| Junior High Schools | 7 |
| Middle Schools | 7 |
| Elementary Secondary Education | 5 |
| Grade 8 | 5 |
| Grade 3 | 4 |
| Grade 5 | 4 |
| Grade 7 | 4 |
| More ▼ | |
Location
| New York | 6 |
| Turkey | 6 |
| Australia | 3 |
| Canada | 3 |
| Germany | 3 |
| United Kingdom | 3 |
| United Kingdom (England) | 3 |
| Jordan | 2 |
| New Jersey | 2 |
| Taiwan | 2 |
| United Kingdom (Great Britain) | 2 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Clara González-Sanguino; Alba Ayuso-Lanchares; Sara Castrillo-San Mamés; Jairo Rodríguez-Medina – Journal of Applied Research in Intellectual Disabilities, 2025
Background: Mental health (MH) problems are more common in people with intellectual disabilities (ID), yet under-diagnosis persists, which may be partly due to a lack of appropriate assessment tools. This study presents a systematic review of instruments used to assess MH problems in Spanish-speaking adults with ID. Method: Following PRISMA…
Descriptors: Mental Health, Adults, Intellectual Disability, Spanish Speaking
Daniel González-Devesa; José Carlos Diz-Gómez; Miguel Adriano Sanchez-Lastra; Aroa Otero Rodríguez; Carlos Ayán-Pérez – Measurement in Physical Education and Exercise Science, 2025
The aim of this study is to examine the available scientific evidence on the reliability and criterion validity of 6-minute run walk field-based test when administered to children and adolescents. Systematic searches were performed in three electronic databases (MEDLINE/PubMed, SPORTDiscuss and Scopus) from their inception until February 2024,…
Descriptors: Child Health, Health Related Fitness, Literature Reviews, Meta Analysis
Sümeyye Arkan; Sema Tan – International Journal of Assessment Tools in Education, 2025
Teachers' perceptions, attitudes, and opinions about students, curricula, or evaluation methods contribute to the development of students' talents. Thus, researchers often collect data from teachers to identify gifted students, determine educational practices to meet the students' needs and assess gifted education programs. Researchers often…
Descriptors: Talent Identification, Academically Gifted, Evaluation Methods, Measurement Techniques
Yücel Makaraci; Kazim Nas; Kerem Gündüz; Abdullah Uysal; Samuel T. Orange; Juan D. Ruiz-Cárdenas – Measurement in Physical Education and Exercise Science, 2024
The aim was to determine the validity and test-retest reliability of the Sit to Stand App variables (rising time, vertical velocity, and power) for measuring single-leg sit-to-stand (STS) test compared to those derived from ground reaction force data. Twenty-seven female athletes performed the single-leg STS test over three consecutive sessions…
Descriptors: Computer Simulation, Measurement Techniques, Athletics, Physical Fitness
Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025
Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…
Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment
Anne Wicks; Robin Berkley – George W. Bush Institute, 2025
Assessments are one of the most important--and often misunderstood--elements of education. In most cases, tests are administered by the state as well as by districts and schools. Assessments at each of these levels have distinct purposes, yield different information, and are part of a powerful, coordinated approach to improving student outcomes.…
Descriptors: Student Evaluation, Testing, Tests, Standardized Tests
de Ruiter, Laura E.; Bers, Marina U. – Computer Science Education, 2022
Background and Context: Despite the increasing implementation of coding in early curricula, there are few valid and reliable assessments of coding abilities for young children. This impedes studying learning outcomes and the development and evaluation of curricula. Objective: Developing and validating a new instrument for assessing young…
Descriptors: Programming Languages, Computer Software, Coding, Computer Science Education
Fergadiotis, Gerasimos; Casilio, Marianne; Dickey, Michael Walsh; Steel, Stacey; Nicholson, Hannele; Fleegle, Mikala; Swiderski, Alexander; Hula, William D. – Journal of Speech, Language, and Hearing Research, 2023
Purpose: Item response theory (IRT) is a modern psychometric framework with several advantageous properties as compared with classical test theory. IRT has been successfully used to model performance on anomia tests in individuals with aphasia; however, all efforts to date have focused on noun production accuracy. The purpose of this study is to…
Descriptors: Item Response Theory, Psychometrics, Verbs, Naming
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Wesolowski, Brian C. – Music Educators Journal, 2020
Validity, reliability, and fairness are three prominent indicators for evaluating the quality of assessment processes. Each of the indicators is most often written about and applied in the context of large-scale assessment. As a result, the technical properties of these indicators make them limited in both their practicality and relevance for…
Descriptors: Music Education, Test Validity, Test Reliability, Student Evaluation
Miranda, Constanza; Goñi, Julian; Pickenpack, Astrid; Sotomayor, Trinidad – International Journal of Technology and Design Education, 2022
K-12 Engineering Education has placed a lot of attention on students' attitudes or predispositions towards science and technology. However, most assessment methods are focused on STEM as a whole or only on technology. In this article, we will discuss the instrument called Technology and Engineering Attitude Scale (TEAS) which focuses on attitudes…
Descriptors: Elementary Secondary Education, Engineering Education, Test Validity, Foreign Countries
Liu, Xiaolu; Keating, Xiaofen D. – European Physical Education Review, 2021
Pre-service physical education teachers (PPETs) may be implementing health-related fitness testing (HRFT) in schools in the future. Thus, exploring their attitudes toward HRFT would help us understand physical education (PE) teachers' attitudes toward HRFT. This study investigated PPET attitudes toward HRFT in the USA and the effects of teacher…
Descriptors: Preservice Teachers, Physical Education Teachers, Student Attitudes, Physical Fitness
Alkis Küçükaydin, Mensure; Akkanat, Çigdem – Problems of Education in the 21st Century, 2022
Computational thinking is recognized as a vital skill related to problem-solving in technological and non-technological fields. The existence of different sub-domains related to this skill has been pointed out. Therefore, there is a need for tools that measure these different sub-domains. Because of its structure that includes different skills,…
Descriptors: Elementary School Students, Thinking Skills, Computation, Tests
Schmitz, Boris; Pfeifer, Carina; Thorwesten, Lothar; Krüger, Michael; Klose, Andreas; Brand, Stefan-Martin – Research Quarterly for Exercise and Sport, 2020
Purpose: This study analyzed the physiological response during Yo-Yo Intermittent Recovery Level 1 (YYIR1) test and re-test by in-field ergospirometry and time-series analyses of respiratory parameters. Methods: Ten moderately trained males (23.4 ± 2.01 years, VO[subscript 2peak]= 56.81 ± 10.75 mL·kg[superscript -1]·min[superscript -1]) completed…
Descriptors: Exercise Physiology, Males, Physical Activities, Test Validity
Jelicic, Mario; Ivancev, Vladimir; Cular, Dražen; Covic, Nedim; Stojanovic, Emilija; Scanlan, Aaron T.; Milanovic, Zoran – Research Quarterly for Exercise and Sport, 2020
Purpose: The purpose of this study was to determine the reliability, validity, and usefulness of 30--15 Intermittent Fitness Test (30-15[subscript IFT]) in female basketball players. Methods: Nineteen female basketball players (17.82 ± 1.94 yr, 175.4 ± 7.3 cm, 67.9 ± 7.7 kg) competing in the National Croatian League performed one trial of a…
Descriptors: Physical Fitness, Females, Athletes, Team Sports

Peer reviewed
Direct link
