Publication Date
| In 2026 | 0 |
| Since 2025 | 41 |
| Since 2022 (last 5 years) | 151 |
| Since 2017 (last 10 years) | 357 |
| Since 2007 (last 20 years) | 531 |
Descriptor
| Test Items | 755 |
| Test Reliability | 755 |
| Test Validity | 684 |
| Test Construction | 427 |
| Foreign Countries | 262 |
| Psychometrics | 164 |
| Item Analysis | 156 |
| Difficulty Level | 154 |
| Factor Analysis | 143 |
| Item Response Theory | 127 |
| Multiple Choice Tests | 94 |
| More ▼ | |
Source
Author
| Schoen, Robert C. | 10 |
| LaVenia, Mark | 5 |
| Liu, Ou Lydia | 4 |
| Stansfield, Charles W. | 4 |
| Bauduin, Charity | 3 |
| Farina, Kristy | 3 |
| Haladyna, Thomas M. | 3 |
| Paek, Insu | 3 |
| Petscher, Yaacov | 3 |
| Roid, Gale | 3 |
| Sachin Nedungadi | 3 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 29 |
| Teachers | 18 |
| Researchers | 16 |
| Administrators | 12 |
| Support Staff | 3 |
| Students | 2 |
| Community | 1 |
| Counselors | 1 |
| Parents | 1 |
| Policymakers | 1 |
Location
| Turkey | 59 |
| Indonesia | 30 |
| China | 12 |
| Germany | 12 |
| Australia | 11 |
| Canada | 10 |
| Florida | 10 |
| California | 7 |
| India | 7 |
| Iran | 7 |
| Malaysia | 7 |
| More ▼ | |
Laws, Policies, & Programs
| Individuals with Disabilities… | 4 |
| Every Student Succeeds Act… | 3 |
| Rehabilitation Act 1973… | 3 |
| No Child Left Behind Act 2001 | 2 |
| Head Start | 1 |
| Job Training Partnership Act… | 1 |
| United Nations Convention on… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Shuchen Guo; Lehong Shi; Xiaoming Zhai – Education and Information Technologies, 2025
As artificial intelligence (AI) receives wider attention in education, examining teachers' acceptance of AI (TAAI) becomes essential. However, existing instruments measuring TAAI reported limited validity evidence and faced some design challenges, such as missing informed definitions of AI to participants. To fill this gap, this study developed…
Descriptors: Artificial Intelligence, Technology Uses in Education, Teacher Attitudes, Test Construction
Eyüp Yurt – International Journal of Education in Mathematics, Science and Technology, 2025
This study aimed to develop and validate the Creative Problem-Solving Skills Test (CPSS-T), grounded in Torrance's creativity theory, to assess these skills in university students. The CPSS-T consists of five open-ended question types, each designed to measure different aspects of creative problem-solving: Alternative Use, Hypothetical Scenario,…
Descriptors: Creativity Tests, Creativity, Creative Thinking, Problem Solving
Fadime Hatice Inci; Ferhat Çelik – Psychology in the Schools, 2025
The aim of this study is to examine the validity, reliability, and responsiveness of the Turkish version of the Adolescent Health Promotion-Short Form (AHP-SF). This cross-sectional study was completed with 1483 students. Confirmatory factor analysis (CFA) supported the construct validity of the scale, demonstrating a good model fit with…
Descriptors: Foreign Countries, Measures (Individuals), Adolescents, Health Promotion
Dwi Rismi Ocy; Iva Sarifah; Riyadi – Journal of Research and Advances in Mathematics Education, 2025
Mathematical abstraction skills are fundamental for advanced reasoning and problem-solving, yet assessing these skills in senior high school students poses challenges due to limited validated instruments. This study aims to develop and validate a test instrument for measuring mathematical abstraction skills in Indonesian high school students. The…
Descriptors: Abstract Reasoning, Mathematics Tests, Mathematics Instruction, Mathematics Teachers
Patrisius Istiarto Djiwandono; Daniel Ginting – Language Education & Assessment, 2025
The teaching of English as a foreign language in Indonesia has a long history, and it is always important to ask whether the assessment of the students' language skills has been valid and reliable. A screening of many articles in several prominent databases reveal that a number of evaluation studies have been done by Indonesian scholars in the…
Descriptors: Foreign Countries, Language Tests, English (Second Language), Second Language Learning
Sarah K. Cowan; Michael Hout; Stuart Perrett – Sociological Methods & Research, 2024
Long-running surveys need a systematic way to reflect social change and to keep items relevant to respondents, especially when they ask about controversial subjects, or they threaten the items' validity. We propose a protocol for updating measures that preserves content and construct validity. First, substantive experts articulate the current and…
Descriptors: Surveys, Public Opinion, Social Attitudes, Pregnancy
Bin Tan; Nour Armoush; Elisabetta Mazzullo; Okan Bulut; Mark J. Gierl – International Journal of Assessment Tools in Education, 2025
This study reviews existing research on the use of large language models (LLMs) for automatic item generation (AIG). We performed a comprehensive literature search across seven research databases, selected studies based on predefined criteria, and summarized 60 relevant studies that employed LLMs in the AIG process. We identified the most commonly…
Descriptors: Artificial Intelligence, Test Items, Automation, Test Format
Hartono, Wahyu; Hadi, Samsul; Rosnawati, Raden; Retnawati, Heri – Pegem Journal of Education and Instruction, 2023
Researchers design diagnostic assessments to measure students' knowledge structures and processing skills to provide information about their cognitive attribute. The purpose of this study is to determine the instrument's validity and score reliability, as well as to investigate the use of classical test theory to identify item characteristics. The…
Descriptors: Diagnostic Tests, Test Validity, Item Response Theory, Content Validity
Mohammad Nayef Ayasrah; Mohamad Ahmad Saleem Khasawneh; Mazen Omar Almulla; Amoura Hassan Aboutaleb – Journal of Computer Assisted Learning, 2025
Background: One area that has been dramatically changed by artificial intelligence (AI) is educational environments. Chatbots, Recommender Systems, Adaptive Learning Systems and Large Language Models have been emerging as practical tools for facilitating learning. However, using such tools appropriately is challenging. In this regard, the…
Descriptors: Test Construction, Test Validity, Test Reliability, Rating Scales
Leo, Francisco M.; Fernández-Río, Javier; Pulido, Juan J.; Rodríguez-González, Pablo; López-Gajardo, Miguel A. – Social Psychology of Education: An International Journal, 2023
The aim of this study was to develop and validate a psychometrically-sound instrument to assess students' perceptions about class cohesion. Two studies were conducted. In Study 1, four steps were established: (1) development of the Class Cohesion Questionnaire (CCQ); (2) item selection; (3) item compression; and (4) exploration of psychometric…
Descriptors: Classroom Environment, Group Unity, Elementary School Students, Secondary School Students
Christopher J. Anthony; Stephen N. Elliott – School Mental Health, 2025
Stress is a complex construct that is related to resilience and general health starting in childhood. Despite its importance for student health and well-being, there are few measures of stress designed for school-based applications. In this study, we developed and initially validated a Stress Indicators Scale using five samples of teachers,…
Descriptors: Test Construction, Stress Variables, Test Validity, Test Items
Meyer, J. Patrick; Hu, Ann; Li, Sylvia – NWEA, 2023
The Content Proximity Project was designed to improve the content validity of the MAP® Growth™ assessments while retaining the ability for the test to adapt off-grade and meet students wherever they are in their learning. Two main features of the project were the development of an enhanced item selection algorithm, and a spring pilot study…
Descriptors: Achievement Tests, Mathematics Achievement, Content Validity, Mathematics Tests
Güntay Tasçi – Science Insights Education Frontiers, 2024
The present study has aimed to develop and validate a protein concept inventory (PCI) consisting of 25 multiple-choice (MC) questions to assess students' understanding of protein, which is a fundamental concept across different biology disciplines. The development process of the PCI involved a literature review to identify protein-related content,…
Descriptors: Science Instruction, Science Tests, Multiple Choice Tests, Biology
David Bell; Vikki O'Neill; Vivienne Crawford – Practitioner Research in Higher Education, 2023
We compared the influence of open-book extended duration versus closed book time-limited format on reliability and validity of written assessments of pharmacology learning outcomes within our medical and dental courses. Our dental cohort undertake a mid-year test (30xfree-response short answer to a question, SAQ) and end-of-year paper (4xSAQ,…
Descriptors: Undergraduate Students, Pharmacology, Pharmaceutical Education, Test Format
Ntumi, Simon; Agbenyo, Sheilla; Bulala, Tapela – Shanlax International Journal of Education, 2023
There is no need or point to testing of knowledge, attributes, traits, behaviours or abilities of an individual if information obtained from the test is inaccurate. However, by and large, it seems the estimation of psychometric properties of test items in classroomshas been completely ignored otherwise dying slowly in most testing environments. In…
Descriptors: Psychometrics, Accuracy, Test Validity, Factor Analysis

Peer reviewed
Direct link
