Publication Date
| In 2026 | 0 |
| Since 2025 | 389 |
| Since 2022 (last 5 years) | 1887 |
| Since 2017 (last 10 years) | 4031 |
| Since 2007 (last 20 years) | 6737 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 644 |
| Teachers | 455 |
| Researchers | 440 |
| Administrators | 126 |
| Policymakers | 68 |
| Students | 68 |
| Counselors | 26 |
| Parents | 24 |
| Community | 10 |
| Support Staff | 5 |
| Media Staff | 3 |
| More ▼ | |
Location
| Turkey | 603 |
| Australia | 339 |
| Canada | 254 |
| China | 180 |
| Indonesia | 147 |
| United States | 143 |
| United Kingdom | 130 |
| Germany | 116 |
| Taiwan | 111 |
| California | 109 |
| Spain | 107 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 2 |
Bangun Sartono; Widha Sunarno; Baskoro Adi Prayitno; Nurma Yunita Indriyanti – Educational Process: International Journal, 2025
Background/purpose: Critical thinking (CT) is fundamental in science education, but instruments to measure CT in specific domains, such as physics, are still limited. The present study aims to develop and validate the Critical Thinking Test in Temperature and Heat (CTTH), an instrument designed to measure critical thinking skills in the topic of…
Descriptors: Test Construction, Test Validity, Critical Thinking, Evaluation Methods
Ildiko Porter-Szucs; Cynthia J. Macknish; Suzanne Toohey – John Wiley & Sons, Inc, 2025
"A Practical Guide to Language Assessment" helps educators at every level redefine their approach to language assessment. Grounded in extensive research and aligned with the latest advances in language education, this comprehensive guide introduces foundational concepts and explores key principles in test development and item writing.…
Descriptors: Student Evaluation, Language Tests, Test Construction, Test Items
Kashinath Boral; Krishna Kanta Mondal – Journal of Educational Technology Systems, 2025
This study evaluates the performance of three leading AI chatbots--OpenAI's ChatGPT, Google's Gemini, and Microsoft Bing Copilot--in answering multiple choice questions (MCQs) from the UGC-NET Education paper. Using 150 randomly selected questions from examination cycles between June 2019 and December 2023, the chatbots' accuracy was assessed…
Descriptors: Artificial Intelligence, Technology Uses in Education, Multiple Choice Tests, Program Effectiveness
Brent A. Stevenor; Nadine LeBarron McBride; Charles Anyanwu – Journal of Applied Testing Technology, 2025
Enemy items are two test items that should not be presented to a candidate on the same test. Identifying enemies is essential for personnel assessment, as they weaken the measurement precision and validity of a test. In this research, we examined the effectiveness of lexical and semantic natural language processing techniques for identifying enemy…
Descriptors: Test Items, Natural Language Processing, Occupational Tests, Test Construction
Monica Casella; Pasquale Dolce; Michela Ponticorvo; Nicola Milano; Davide Marocco – Educational and Psychological Measurement, 2024
Short-form development is an important topic in psychometric research, which requires researchers to face methodological choices at different steps. The statistical techniques traditionally used for shortening tests, which belong to the so-called exploratory model, make assumptions not always verified in psychological data. This article proposes a…
Descriptors: Artificial Intelligence, Test Construction, Test Format, Psychometrics
Dian R. Sawitri; Seger Handoyo; Hasnida; Peter A. Creed; Unika Prihatsanti; Ika F. Kristiana; Mirwan S. Perdhana; Fajrianthi; Reza L. Sari; Etti Rahmawati; Siti Zahreni – International Journal for Educational and Vocational Guidance, 2024
We developed and provided initial validation for a 15-item scale for use with academics. In Phase 1, we utilized a review of the literature, focus groups, and expert feedback to generate 36 items. In Phase 2, we conducted item and exploratory factor analyses to reduce the number of items and assess the factor structure (N = 212; 51.4% female; mean…
Descriptors: Research and Development, Test Construction, Test Validity, Factor Analysis
A Method for Generating Course Test Questions Based on Natural Language Processing and Deep Learning
Hei-Chia Wang; Yu-Hung Chiang; I-Fan Chen – Education and Information Technologies, 2024
Assessment is viewed as an important means to understand learners' performance in the learning process. A good assessment method is based on high-quality examination questions. However, generating high-quality examination questions manually by teachers is a time-consuming task, and it is not easy for students to obtain question banks. To solve…
Descriptors: Natural Language Processing, Test Construction, Test Items, Models
Charlotte Lybaert; Lies Debruyne; Eva Kyndt; Fleur Marchand – Journal of Agricultural Education and Extension, 2024
Purpose: This article describes the development and validation of a survey designed to measure the vision of European agricultural advisors towards innovation. Design/Methodology/Approach: The items of the instrument were developed based on the conceptual framework provided by the position paper for the Transformative Innovation Policy Consortium.…
Descriptors: Foreign Countries, Agriculture, Test Validity, Test Construction
Osman Birgin; Elif Seval Peker – Psychology in the Schools, 2025
The aim of this study was to develop an instrument for assessing sixth-grade students' number sense skills in fractions and decimals. This study was conducted on 452 sixth graders (10-11 years old) from the western region of Turkey. The construct validity of the number sense test (NST) was examined via exploratory factor analysis (EFA) and…
Descriptors: Foreign Countries, Grade 6, Test Construction, Mathematics Education
Haixiang Zhang – Structural Equation Modeling: A Multidisciplinary Journal, 2025
Mediation analysis is an important statistical tool in many research fields, where the joint significance test is widely utilized for examining mediation effects. Nevertheless, the limitation of this mediation testing method stems from its conservative Type I error, which reduces its statistical power and imposes certain constraints on its…
Descriptors: Structural Equation Models, Statistical Significance, Robustness (Statistics), Comparative Testing
Ed Harris; Katherine Curry; Jentre Olsen; Ashlyn Fiegener; Jam Khojasteh – Current Issues in Education, 2025
Wide agreement exists about the value and power of learning in social contexts, and social influences on learning have been studied from multiple perspectives. However, before this study, no known measure of the value of learning that happens in social spaces had been developed. This study introduces a scale to measure value created through…
Descriptors: Foreign Countries, Professional Development, Communities of Practice, Test Validity
Laura M. Crothers; Taylor Steeves; Jered B. Kolbert; James B. Schreiber; Ara J. Schmitt; Brianna Drischler; Kelly Paulson; Jessica Cowley; Amelia Klass; Athena Vafiadis; Kayla Perfetto – Contemporary School Psychology, 2025
In this exploratory study, we adapted items from a previously developed measure of job satisfaction, the Measure of Job Satisfaction (MJS), an instrument first developed for use with community nurses in the UK, to create a brief, 15-item instrument (Job Satisfaction--Brief) applicable to practitioners of school psychology from Pennsylvania (N =…
Descriptors: Job Satisfaction, School Psychology, School Psychologists, Factor Structure
Nan Xie; Zhengxu Li; Haipeng Lu; Wei Pang; Jiayin Song; Beier Lu – IEEE Transactions on Learning Technologies, 2025
Classroom engagement is a critical factor for evaluating students' learning outcomes and teachers' instructional strategies. Traditional methods for detecting classroom engagement, such as coding and questionnaires, are often limited by delays, subjectivity, and external interference. While some neural network models have been proposed to detect…
Descriptors: Learner Engagement, Artificial Intelligence, Technology Uses in Education, Educational Technology
Mehmet Emin Ören; Servet Atik – International Journal of Assessment Tools in Education, 2025
In this study, it was aimed to adapt the DigiFuehr 2.0 Scale developed by Claassen et al. (2023) to Turkish and to conduct validity and reliability studies on three groups of participants consisting of teachers. In the study, exploratory and confirmatory factor analyses were performed in line with translation study, linguistic application, and…
Descriptors: Test Reliability, Test Validity, Test Construction, Translation
Harold Doran; Testsuhiro Yamada; Ted Diaz; Emre Gonulates; Vanessa Culver – Journal of Educational Measurement, 2025
Computer adaptive testing (CAT) is an increasingly common mode of test administration offering improved test security, better measurement precision, and the potential for shorter testing experiences. This article presents a new item selection algorithm based on a generalized objective function to support multiple types of testing conditions and…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Algorithms

Peer reviewed
Direct link
