Publication Date
| In 2026 | 0 |
| Since 2025 | 10 |
| Since 2022 (last 5 years) | 37 |
| Since 2017 (last 10 years) | 82 |
| Since 2007 (last 20 years) | 228 |
Descriptor
| Item Analysis | 502 |
| Test Construction | 502 |
| Test Validity | 425 |
| Test Reliability | 274 |
| Test Items | 184 |
| Foreign Countries | 121 |
| Factor Analysis | 113 |
| Psychometrics | 95 |
| Multiple Choice Tests | 71 |
| Achievement Tests | 63 |
| Statistical Analysis | 61 |
| More ▼ | |
Source
Author
| Hambleton, Ronald K. | 5 |
| Dedrick, Robert F. | 4 |
| Ferron, John | 4 |
| Haladyna, Tom | 4 |
| Roid, Gale | 4 |
| Shaunessy-Dedrick, Elizabeth | 4 |
| Suldo, Shannon M. | 4 |
| Echternacht, Gary | 3 |
| Filby, Nikola N. | 3 |
| Green, Donald Ross | 3 |
| Pyrczak, Fred | 3 |
| More ▼ | |
Publication Type
Education Level
Laws, Policies, & Programs
| Individuals with Disabilities… | 2 |
| No Child Left Behind Act 2001 | 2 |
Assessments and Surveys
What Works Clearinghouse Rating
Shadi Noroozi; Hossein Karami – Language Testing in Asia, 2024
Recently, psychometricians and researchers have voiced their concern over the exploration of language test items in light of Messick's validation framework. Validity has been central to test development and use; however, it has not received due attention in language tests having grave consequences for test takers. The present study sought to…
Descriptors: Foreign Countries, Doctoral Students, Graduate Students, Language Proficiency
Patrisius Istiarto Djiwandono; Daniel Ginting – Language Education & Assessment, 2025
The teaching of English as a foreign language in Indonesia has a long history, and it is always important to ask whether the assessment of the students' language skills has been valid and reliable. A screening of many articles in several prominent databases reveal that a number of evaluation studies have been done by Indonesian scholars in the…
Descriptors: Foreign Countries, Language Tests, English (Second Language), Second Language Learning
Güntay Tasçi – Science Insights Education Frontiers, 2024
The present study has aimed to develop and validate a protein concept inventory (PCI) consisting of 25 multiple-choice (MC) questions to assess students' understanding of protein, which is a fundamental concept across different biology disciplines. The development process of the PCI involved a literature review to identify protein-related content,…
Descriptors: Science Instruction, Science Tests, Multiple Choice Tests, Biology
Gregory H. Peterson; Michael B. Kozlowski – Measurement and Evaluation in Counseling and Development, 2024
This study aimed to develop a scale to assess counselors' ability to provide counseling to address the mental health impacts of climate change. Over three studies, we provide reliability and validity evidence for a Climate Change Counseling Scale (3CS) in a large representative sample of counselors across the US. In study one and two, an…
Descriptors: Counselors, Mental Health, Climate, Test Construction
Lawrence Scahill; Luc Lecavalier; Michael C. Edwards; Megan L. Wenzell; Leah M. Barto; Arielle Mulligan; Auscia T. Williams; Opal Ousley; Cynthia B. Sinha; Christopher A. Taylor; Soo Youn Kim; Laura M. Johnson; Scott E. Gillespie; Cynthia R. Johnson – Autism: The International Journal of Research and Practice, 2024
This report presents a new parent-rated outcome measure of insomnia for children with autism spectrum disorder. Parents of 1185 children with autism spectrum disorder (aged 3-12; 80.3% male) completed the first draft of the measure online. Factor and item response theory analyses reduced the set of 40 items to the final 21-item Pediatric Insomnia…
Descriptors: Autism Spectrum Disorders, Children, Sleep, Test Construction
Kimmia Lyon; Jessica B. Koslouski; Sandra M. Chafouleas; Amy M. Briesch; Jacqueline M. Caemmerer – Grantee Submission, 2025
Existing educational assessments have typically been developed without appropriate attention to the intended and unintended consequences of measure implementation and interpretation. We are developing the Expanding Screening to Support Youth (ESSY) Whole Child Screener using a mixed methods approach that attends to the intended and unintended…
Descriptors: Student Attitudes, Screening Tests, Validity, Grade 3
Kimmia Lyon; Jessica B. Koslouski; Sandra M. Chafouleas; Amy M. Briesch; Jacqueline M. Caemmerer – School Mental Health, 2025
Existing educational assessments have typically been developed without appropriate attention to the intended and unintended consequences of measure implementation and interpretation. We are developing the Expanding Screening to Support Youth (ESSY) Whole Child Screener using a mixed methods approach that attends to the intended and unintended…
Descriptors: Student Attitudes, Screening Tests, Validity, Grade 3
Pablo Robles-García; Stuart McLean; Jeffrey Stewart; Ji-young Shin; Claudia Helena Sánchez-Gutiérrez – Language Assessment Quarterly, 2024
Recent literature in the field of L2 vocabulary assessment has advocated for the development of written receptive vocabulary tests such as Vocabulary Levels Tests (VLTs) that use: (a) meaning-recall item formats, (b) a minimum of 40 item counts per 1,000-frequency band to improve level estimates, and (c) lemmas (not word-families) as the lexical…
Descriptors: Spanish, Test Validity, Test Construction, Vocabulary Development
Ismail, Fouzul Kareema Mohamed; Zubairi, Ainol Madziah Bt. – English Language Teaching, 2022
This paper presents the findings of a study that intended to seek the content validity (CV) evidence of an instrument to measure the reading ability of university students in Sri Lanka. The reading passages and items were adapted from CEFR aligned Learning Resource Network (LRN) materials. The items were designed based on the cognitive processing…
Descriptors: Foreign Countries, Test Items, Content Validity, Reading Tests
Meike Akveld; George Kinnear – International Journal of Mathematical Education in Science and Technology, 2024
Many universities use diagnostic tests to assess incoming students' preparedness for mathematics courses. Diagnostic test results can help students to identify topics where they need more practice and give lecturers a summary of strengths and weaknesses in their class. We demonstrate a process that can be used to make improvements to a mathematics…
Descriptors: Mathematics Tests, Diagnostic Tests, Test Items, Item Analysis
Achmad Rante Suparman; Eli Rohaeti; Sri Wening – Journal on Efficiency and Responsibility in Education and Science, 2024
This study focuses on developing a five-tier chemical diagnostic test based on a computer-based test with 11 assessment categories with an assessment score from 0 to 10. A total of 20 items produced were validated by education experts, material experts, measurement experts, and media experts, and an average index of the Aiken test > 0.70 was…
Descriptors: Chemistry, Diagnostic Tests, Computer Assisted Testing, Credits
Dogan, Fatma; Aydin, Hasan – International Journal of Educational Reform, 2019
Applicability of multilingual education, which is applied in many countries, has increasingly proficiency and learning been a question of debate in Turkey because of the inclusion of living languages and dialects lessons into educational institutions. The purpose of this study is to develop a valid and reliable Likert-type scale to determine the…
Descriptors: Foreign Countries, Bilingual Education, Multilingualism, Test Construction
Yalalem Assefa; Bekalu Tadesse Moges; Shouket Ahmad Tilwani – Journal of Applied Research in Higher Education, 2024
Purpose: Lifelong learning has become one of the most interesting areas of research. Hence, the current study was aimed at developing and validating a tool that helps to study how well people working in higher education institutions are engaged in lifelong learning. Design/methodology/approach: A review of theories in the literature and experts'…
Descriptors: Lifelong Learning, Measures (Individuals), Likert Scales, Test Construction
Thayaamol Upapong; Apantee Poonputta – Educational Process: International Journal, 2025
Background/purpose: The purposes of this research are to develop a reliable and valid assessment tool for measuring systems thinking skills in upper primary students in Thailand and to establish a normative criterion for evaluating their systems thinking abilities based on educational standards. Materials/methods: The study followed a three-phase…
Descriptors: Thinking Skills, Elementary School Students, Measures (Individuals), Foreign Countries
Testing Anatomy: Dissecting Spatial and Non-Spatial Knowledge in Multiple-Choice Question Assessment
Julie Dickson; Darren J. Shaw; Andrew Gardiner; Susan Rhind – Anatomical Sciences Education, 2024
Limited research has been conducted on the spatial ability of veterinary students and how this is evaluated within anatomy assessments. This study describes the creation and evaluation of a split design multiple-choice question (MCQ) assessment (totaling 30 questions divided into 15 non-spatial MCQs and 15 spatial MCQs). Two cohorts were tested,…
Descriptors: Anatomy, Spatial Ability, Multiple Choice Tests, Factor Analysis

Peer reviewed
Direct link
