Publication Date
| In 2026 | 0 |
| Since 2025 | 52 |
| Since 2022 (last 5 years) | 194 |
| Since 2017 (last 10 years) | 494 |
| Since 2007 (last 20 years) | 742 |
Descriptor
| Test Items | 1186 |
| Test Reliability | 1186 |
| Test Validity | 684 |
| Test Construction | 565 |
| Foreign Countries | 348 |
| Difficulty Level | 279 |
| Item Analysis | 252 |
| Psychometrics | 233 |
| Item Response Theory | 219 |
| Factor Analysis | 183 |
| Multiple Choice Tests | 172 |
| More ▼ | |
Source
Author
| Schoen, Robert C. | 12 |
| LaVenia, Mark | 5 |
| Liu, Ou Lydia | 5 |
| Anderson, Daniel | 4 |
| Bauduin, Charity | 4 |
| DiLuzio, Geneva J. | 4 |
| Farina, Kristy | 4 |
| Haladyna, Thomas M. | 4 |
| Huck, Schuyler W. | 4 |
| Petscher, Yaacov | 4 |
| Stansfield, Charles W. | 4 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 39 |
| Researchers | 30 |
| Teachers | 24 |
| Administrators | 13 |
| Support Staff | 3 |
| Counselors | 2 |
| Students | 2 |
| Community | 1 |
| Parents | 1 |
| Policymakers | 1 |
Location
| Turkey | 68 |
| Indonesia | 37 |
| Germany | 20 |
| Canada | 17 |
| Florida | 17 |
| China | 16 |
| Australia | 15 |
| California | 12 |
| Iran | 11 |
| India | 10 |
| New York | 9 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
Nurussaniah Nurussaniah; Punaji Setyosari; Dedi Kuswandi; Saida Ulfa – Journal of Baltic Science Education, 2025
The accurate assessment of analytical thinking in physics, particularly in magnetism, poses substantial challenges due to the limitations of conventional tools in measuring higher-order cognitive skills. This study aimed to validate an analytical skills test in physics, based on Bloom's Revised Taxonomy, with an emphasis on the dimensions of…
Descriptors: Physics, Science Tests, Science Instruction, Thinking Skills
Sparks, Jesse R.; van Rijn, Peter W.; Deane, Paul – Educational Assessment, 2021
Effectively evaluating the credibility and accuracy of multiple sources is critical for college readiness. We developed 24 source evaluation tasks spanning four predicted difficulty levels of a hypothesized learning progression (LP) and piloted these tasks to evaluate the utility of an LP-based approach to designing formative literacy assessments.…
Descriptors: Middle School Students, Information Sources, Grade 6, Grade 7
Slepkov, A. D.; Van Bussel, M. L.; Fitze, K. M.; Burr, W. S. – SAGE Open, 2021
There is a broad literature in multiple-choice test development, both in terms of item-writing guidelines, and psychometric functionality as a measurement tool. However, most of the published literature concerns multiple-choice testing in the context of expert-designed high-stakes standardized assessments, with little attention being paid to the…
Descriptors: Foreign Countries, Undergraduate Students, Student Evaluation, Multiple Choice Tests
DeCandia, Carmela J.; Unick, George J.; Volk, Katherine T. – Journal of Psychoeducational Assessment, 2021
The Neurodevelopmental Ecological Screening Tool (NEST) is a new instrument to screen children for developmental challenges. This article describes the validation of the NEST neurodevelopmental domain. Data were collected from a nationwide purposely restricted sample of caregivers of children aged 3-5 years (n = 231) living in poverty and…
Descriptors: Screening Tests, Preschool Children, Child Development, Poverty
Stephanie M. Werner; Ying Chen; Mike Stieff – Grantee Submission, 2021
The Chemistry Self-Concept Inventory (CSCI) is a widely used instrument within chemistry education research. Yet, agreement on its overall reliability and validity is lacking, and psychometric analyses of the instrument remain outstanding. This study examined the psychometric properties of the subscale and item function of the CSCI on 1,140 high…
Descriptors: Self Concept Measures, Chemistry, Psychometrics, Item Response Theory
Villarreal, Victor – Journal of Psychoeducational Assessment, 2019
The "Rating Scale of Impairment" (RSI; Goldstein & Naglieri, 2016b) is a norm-referenced measure of functional impairment. The RSI measures impairment in six domains, as well as overall impairment, based in part on the International Classification of Functioning, Disability, and Health. Functional impairment, as defined by the ICF…
Descriptors: Rating Scales, Norm Referenced Tests, Disabilities, Test Construction
Rigney, Alexander M. – Journal of Psychoeducational Assessment, 2019
The "Detroit Tests of Learning Aptitude" has been in use for more than three quarters of a century (Baker & Leland, 1935). Its longevity in the field speaks to its popularity as a broad measure of cognitive abilities. Its most recent iteration, in the form of the "Detroit Tests of Learning Abilities--Fifth Edition" (DTLA-5;…
Descriptors: Aptitude Tests, Cognitive Ability, Test Construction, Test Items
Mandracchia, Nina R.; Sims, Wesley A. – Computers in the Schools, 2020
As technology use continues to rapidly increase, so too does consumer use of web-based resources. While important, accessibility is often overemphasized by users when consuming and evaluating web resources. This prioritization may have particularly negative consequences for the selection of supports or interventions in educational settings. This…
Descriptors: Internet, Resources, Selection, Rating Scales
Strait, Julia Englund; Dawson, Peg; Walther, Christine A. P.; Strait, Gerald Gill; Barton, Amy K.; Brunson McClain, Maryellen – Contemporary School Psychology, 2020
Executive functioning (EF) skills are vital for academic success. Along with the recent explosion of interventions targeting these skills comes the need for affordable, efficient, and ecologically valid measures for planning and tailoring interventions and monitoring outcomes. The current study describes the refinement and initial psychometric…
Descriptors: Executive Function, Questionnaires, Rating Scales, Test Items
Rao, Dhawaleswar; Saha, Sujan Kumar – IEEE Transactions on Learning Technologies, 2020
Automatic multiple choice question (MCQ) generation from a text is a popular research area. MCQs are widely accepted for large-scale assessment in various domains and applications. However, manual generation of MCQs is expensive and time-consuming. Therefore, researchers have been attracted toward automatic MCQ generation since the late 90's.…
Descriptors: Multiple Choice Tests, Test Construction, Automation, Computer Software
Tutar, Miyase; Karamustafaoglu, Orhan – International Journal of Curriculum and Instruction, 2020
This study is aimed to develop an adaptation test to determine the adaptation level of the 6th grade students about the concepts and topics to their daily life included in the unit ''Systems and Health in Our Body''. The test items were prepared by considering the reasons of the problems that they can encounter in their daily life about the…
Descriptors: Test Construction, Grade 6, Human Body, Health Education
Hartono, Wahyu; Hadi, Samsul; Rosnawati, Raden; Retnawati, Heri – Pegem Journal of Education and Instruction, 2023
Researchers design diagnostic assessments to measure students' knowledge structures and processing skills to provide information about their cognitive attribute. The purpose of this study is to determine the instrument's validity and score reliability, as well as to investigate the use of classical test theory to identify item characteristics. The…
Descriptors: Diagnostic Tests, Test Validity, Item Response Theory, Content Validity
Ibrahim Kasujja; Hugo Melgar-Quinonez; Joweria Nambooze – SAGE Open, 2023
Background: School feeding programs' evaluation requires the measurement of food insecurity, a more objective indicator, within school in low-income countries. The Global Child Nutrition Foundation (GCNF) uses subjective indicators to report school feeding coverage rates across many countries that participate in the global survey of school meal…
Descriptors: Hunger, Food, Program Effectiveness, Psychometrics
Caroline G. Hodgson; Wes Bonifay; Wenxi Yang; Keith C. Herman – Grantee Submission, 2023
Background: Technically sound measures are necessary for accurately identifying youth at risk for depression, but many studies rely on classical test theory metrics or adult samples to evaluate measures. This study examined the use of the PHQ-8, a common and freely available pediatric depression screener, in an adolescent sample using item…
Descriptors: Depression (Psychology), Measurement, Screening Tests, Adolescents
Astuti, Budi; Purwanta, Edi; Ayriza, Yulia; Bhakti, Caraka Putra; Lestari, Rizqi; Herwin, Herwin – Cypriot Journal of Educational Sciences, 2022
This study aims to produce a school connectedness instrument for high school students in Yogyakarta. This research is quantitative and descriptive. The sample in this study was 526 high school students in Yogyakarta who had implemented online learning. The sampling technique used is simple random sampling. The data collection of this research was…
Descriptors: Foreign Countries, High School Students, Student School Relationship, COVID-19

Peer reviewed
Direct link
