Publication Date
| In 2026 | 0 |
| Since 2025 | 197 |
| Since 2022 (last 5 years) | 1067 |
| Since 2017 (last 10 years) | 2577 |
| Since 2007 (last 20 years) | 4938 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Abdalla, Widad – ProQuest LLC, 2019
Trend scoring is often used in large-scale assessments to monitor for rater drift when the same constructed response items are administered in multiple test administrations. In trend scoring, a set of responses from Time "A" are rescored by raters at Time "B." The purpose of this study is to examine the ability of…
Descriptors: Scoring, Interrater Reliability, Test Items, Error Patterns
Zhong Jian Chee; Anke M. Scheeren; Marieke de Vries – Autism: The International Journal of Research and Practice, 2024
Despite several psychometric advantages over the 50-item Autism Spectrum Quotient, an instrument used to measure autistic traits, the abridged AQ-28 and its cross-cultural validity have not been examined as extensively. Therefore, this study aimed to examine the factor structure and measurement invariance of the AQ-28 in 818 Dutch (M[subscript…
Descriptors: Autism Spectrum Disorders, Questionnaires, Factor Structure, Factor Analysis
Jose A. Diaz; Steven M. Nelson; A. Alexander Beaujean; Adam E. Green; Michael K. Scullin – Creativity Research Journal, 2024
The compound Remote Associates Test (RAT) is a classic measure of creativity. Participants are shown three cue words (sore-shoulder-sweat) and asked to generate a word that connects them (cold). Theoretical views of RAT performance differ in the degree to which they conceptualize performance as depending on automatic spreading activation across…
Descriptors: Test Items, Creative Thinking, Creativity Tests, Performance
Rayne Bozeman; Robyn K. Mallett; Linas Mitchell; R. Scott Tindale – Active Learning in Higher Education, 2024
Two-phase testing assesses individual performance (phase 1) and then allows collaborative learning within small groups (phase 2). While groups typically outperform individuals, less is known about the social decision schemes that influence member collaboration. In a classroom setting, we compared individual and group performance on a standard test…
Descriptors: Testing, Group Testing, Cooperative Learning, Learning Experience
Zofia Mazur-Socha; Mariola Laguna; Peter Gollwitzer – Music Education Research, 2024
This article reports on the development and validation of the Instrumental Practice Goal Realization Inventory (IPGRI) designed to assess the process of self-directed study, beginning with setting the intention to practice and ending with the evaluation of one's performance. This new tool is based on the theoretical model of action phases. The…
Descriptors: Music Education, Music Activities, Measures (Individuals), Test Construction
Büsra Kilinç; Mehmet Diyaddin Yasar – Science Insights Education Frontiers, 2024
In this study, it was aimed to develop an achievement test taking into account the subject acquisitions of the sound and properties unit in the sixth-grade science course. In the test development phase, firstly, literature review for the study was conducted. Then, 30 multiple choice questions in align with the subject acquisition in the 2018…
Descriptors: Science Tests, Test Construction, Grade 6, Science Instruction
Yajun Wei; Xiaotong Chen; Yi Zhong; Guangyi Liu; Mengjun Wang; Feipeng Pi; Changhong Li – Journal of Baltic Science Education, 2024
Numerous studies compared the effectiveness of various formats of video-based teaching, yet their focus has primarily been on relatively straightforward content, such as concepts and basic procedures. Research on the effectiveness of teaching complex content through different formats of videos remains limited. This study addresses this gap by…
Descriptors: Physics, Science Instruction, Problem Solving, Video Technology
Qian Liu; Navé Wald; Chandima Daskon; Tony Harland – Innovations in Education and Teaching International, 2024
This qualitative study looks at multiple-choice questions (MCQs) in examinations and their effectiveness in testing higher-order cognition. While there are claims that MCQs can do this, we consider many assertions problematic because of the difficulty in interpreting what higher-order cognition consists of and whether or not assessment tasks…
Descriptors: Multiple Choice Tests, Critical Thinking, College Faculty, Student Evaluation
Amber DeBono; Michele Heimbauer; Elizabeth Mendelsohn – Society for Research on Educational Effectiveness, 2024
Prior Theory: Social-emotional learning (SEL) is the ability to manage emotions and social interactions adaptively to succeed in school, at work, in relationships, and in communities (Jones & Doolittle, 2017). The most cited SEL theory is the Collaborative for Academic, Social, and Emotional Learning (CASEL) 5 model, which focuses on promoting…
Descriptors: Social Emotional Learning, Students with Disabilities, Learning Disabilities, Test Validity
Hae In Park – English Teaching, 2024
The present study aimed to validate a 70-item Korean bilingual version of the Vocabulary Size Test (VST) using Rasch modeling. The goal was to assess the applicability of this Korean version of the VST for Korean learners of English in an English as a foreign language (EFL) context by examining validity evidence based on Messick's framework.…
Descriptors: Korean, Bilingualism, English (Second Language), Second Language Learning
Jo Lein; Jennifer Gripado – Learning Professional, 2024
There are many valuable sources of evaluation data, including -- but not limited to -- professional learning participants. In the authors' work on leadership development and organizational learning for Tulsa Public Schools in Oklahoma, they regularly ask educators to share feedback and perceptions of usefulness of their professional learning. The…
Descriptors: Participant Satisfaction, Surveys, Test Items, Feedback (Response)
Cobern, William W.; Adams, Betty A. J. – International Journal of Assessment Tools in Education, 2020
What follows is a practical guide for establishing the validity of a survey for research purposes. The motivation for providing this guide is our observation that researchers, not necessarily being survey researchers per se, but wanting to use a survey method, lack a concise resource on validity. There is far more to know about surveys and survey…
Descriptors: Surveys, Test Validity, Test Construction, Test Items
Antino, Mirko; Alvarado, Jesús M.; Asún, Rodrigo A.; Bliese, Paul – Sociological Methods & Research, 2020
The need to determine the correct dimensionality of theoretical constructs and generate valid measurement instruments when underlying items are categorical has generated a significant volume of research in the social sciences. This article presents two studies contrasting different categorical exploratory techniques. The first study compares…
Descriptors: Nonparametric Statistics, Factor Analysis, Item Analysis, Robustness (Statistics)
Köhler, Carmen; Robitzsch, Alexander; Hartig, Johannes – Journal of Educational and Behavioral Statistics, 2020
Testing whether items fit the assumptions of an item response theory model is an important step in evaluating a test. In the literature, numerous item fit statistics exist, many of which show severe limitations. The current study investigates the root mean squared deviation (RMSD) item fit statistic, which is used for evaluating item fit in…
Descriptors: Test Items, Goodness of Fit, Statistics, Bias
Abdelhamid, Gomaa S. M.; Gómez-Benito, Juana; Abdeltawwab, Ahmed T. M.; Abu Bakr, Mostafa H. S.; Kazem, Amina M. – Journal of Psychoeducational Assessment, 2020
The fourth edition of the Wechsler Adult Intelligence Scale (WAIS-IV) has been used extensively for assessing adult intelligence. This study uses Mokken scale analysis to investigate the psychometric proprieties of WAIS-IV subtests adapted for the Egyptian population in a sample of 250 adults between 18 and 25 years of age. The monotone…
Descriptors: Foreign Countries, Item Analysis, Adults, Intelligence Tests

Direct link
Peer reviewed
