Publication Date
| In 2026 | 0 |
| Since 2025 | 197 |
| Since 2022 (last 5 years) | 1067 |
| Since 2017 (last 10 years) | 2577 |
| Since 2007 (last 20 years) | 4938 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Pramudya Wahyu Pradana; Supahar – Knowledge Management & E-Learning, 2025
Sound waves are one of the important topics studied in physics. However, students' graph representation is still low, leading to their low concept understanding of physics learning. Therefore, it is necessary to optimize the learning process to improve students' graph representation by providing a valid and reliable graph representation test…
Descriptors: Graphs, Acoustics, Physics, Science Tests
Gerald Tindal; Cengiz Zopluoglu; Aaron Glasgow; James Llewellyn – Behavioral Research and Teaching, 2025
This technical report describes the entire development process for all mathematics items and tests used in CBMSkills. A brief introduction highlights the purpose of the assessment system, noting their important relationship with easyCBM. Given the items are designed to provide teachers diagnostic information on students' specific skills within…
Descriptors: Test Construction, Test Items, Mathematics Tests, Item Analysis
Jeong-eun Kim – English Teaching, 2025
This study investigated the thematic and lexical characteristics of high-difficulty English reading items--commonly referred to as "killer questions"--in the Korean College Scholastic Ability Test (CSAT) between 2018 and 2025. Using text mining methods, including Latent Dirichlet Allocation (LDA) and CEFR-based lexical profiling, the…
Descriptors: English (Second Language), Difficulty Level, Test Items, Questioning Techniques
Eyüp Yurt – International Journal of Education in Mathematics, Science and Technology, 2025
This study aimed to develop and validate the Creative Problem-Solving Skills Test (CPSS-T), grounded in Torrance's creativity theory, to assess these skills in university students. The CPSS-T consists of five open-ended question types, each designed to measure different aspects of creative problem-solving: Alternative Use, Hypothetical Scenario,…
Descriptors: Creativity Tests, Creativity, Creative Thinking, Problem Solving
Kübra Karakaya Özyer – International Journal of Technology in Education and Science, 2025
This study aims to delve into the process and perceptions of pre-service teachers as they engage in generating multiple-choice questions with the assistance of generative AI tools. Adopting a single-case study design, the research involved the participation of 35 pre-service teachers. The participants were tasked with utilizing generative AI tools…
Descriptors: Preservice Teachers, Preservice Teacher Education, Artificial Intelligence, Multiple Choice Tests
Metsämuuronen, Jari – International Journal of Educational Methodology, 2021
Although Goodman-Kruskal gamma (G) is used relatively rarely it has promising potential as a coefficient of association in educational settings. Characteristics of G are studied in three sub-studies related to educational measurement settings. G appears to be unexpectedly appealing as an estimator of association between an item and a score because…
Descriptors: Educational Assessment, Measurement, Item Analysis, Correlation
Brabec, Jordan Andrew; Pan, Steven C.; Bjork, Elizabeth Ligon; Bjork, Robert A. – Educational Psychology Review, 2021
Although widely used, the true-false test is often regarded as a superficial or even harmful test, one that lacks the pedagogical efficacy of more substantive tests (e.g., cued-recall or short-answer tests). Such charges, however, lack conclusive evidence and may, in some cases, be false. Across four experiments, we investigated how true-false…
Descriptors: Objective Tests, Accuracy, Cues, Recall (Psychology)
Deng, Jacky M.; Streja, Nicholas; Flynn, Alison B. – Journal of Chemical Education, 2021
Response process validity evidence can provide researchers with insight into how and why participants interpret items on instruments such as tests and questionnaires. In chemistry education research literature and the social sciences more broadly, response process validity evidence has been used and reported in a variety of ways. This paper's…
Descriptors: Chemistry, Science Education, Educational Research, Validity
Lanrong Li – ProQuest LLC, 2021
When developing a test, it is essential to ensure that the test is free of items with differential item functioning (DIF). DIF occurs when examinees of equal ability, but from different examinee subgroups, have different chances of getting the item correct. According to the multidimensional perspective, DIF occurs because the test measures more…
Descriptors: Test Bias, Test Items, Meta Analysis, Effect Size
Lanrong Li; Betsy Jane Becker – Journal of Educational Measurement, 2021
Differential bundle functioning (DBF) has been proposed to quantify the accumulated amount of differential item functioning (DIF) in an item cluster/bundle (Douglas, Roussos, and Stout). The simultaneous item bias test (SIBTEST, Shealy and Stout) has been used to test for DBF (e.g., Walker, Zhang, and Surber). Research on DBF may have the…
Descriptors: Test Bias, Test Items, Meta Analysis, Effect Size
Lang, Joseph B. – Journal of Educational and Behavioral Statistics, 2023
This article is concerned with the statistical detection of copying on multiple-choice exams. As an alternative to existing permutation- and model-based copy-detection approaches, a simple randomization p-value (RP) test is proposed. The RP test, which is based on an intuitive match-score statistic, makes no assumptions about the distribution of…
Descriptors: Identification, Cheating, Multiple Choice Tests, Item Response Theory
Demirkaya, Onur; Bezirhan, Ummugul; Zhang, Jinming – Journal of Educational and Behavioral Statistics, 2023
Examinees with item preknowledge tend to obtain inflated test scores that undermine test score validity. With the availability of process data collected in computer-based assessments, the research on detecting item preknowledge has progressed on using both item scores and response times. Item revisit patterns of examinees can also be utilized as…
Descriptors: Test Items, Prior Learning, Knowledge Level, Reaction Time
Constantinou, Filio – Research Papers in Education, 2023
Examination questions need to be sufficiently novel if they are to be effective as measurement instruments. Novelty, however, presupposes creativity, suggesting that question writing is, or should be, a creative process. To explore the boundaries of creativity in question writing, this study made use of two data sources: two corpora of examination…
Descriptors: Test Items, Creativity, Writing (Composition), Test Construction
Raykov, Tenko – Measurement: Interdisciplinary Research and Perspectives, 2023
This software review discusses the capabilities of Stata to conduct item response theory modeling. The commands needed for fitting the popular one-, two-, and three-parameter logistic models are initially discussed. The procedure for testing the discrimination parameter equality in the one-parameter model is then outlined. The commands for fitting…
Descriptors: Item Response Theory, Models, Comparative Analysis, Item Analysis
Man, Kaiwen; Harring, Jeffrey R. – Educational and Psychological Measurement, 2023
Preknowledge cheating jeopardizes the validity of inferences based on test results. Many methods have been developed to detect preknowledge cheating by jointly analyzing item responses and response times. Gaze fixations, an essential eye-tracker measure, can be utilized to help detect aberrant testing behavior with improved accuracy beyond using…
Descriptors: Cheating, Reaction Time, Test Items, Responses

Peer reviewed
Direct link
