Publication Date
| In 2026 | 0 |
| Since 2025 | 197 |
| Since 2022 (last 5 years) | 1067 |
| Since 2017 (last 10 years) | 2577 |
| Since 2007 (last 20 years) | 4938 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Ozyeter, Neslihan Tugce – International Journal of Assessment Tools in Education, 2022
In education, examining students' learning in detail, determining their strengths and weaknesses and giving effective feedback have gained importance over time. The aim of this study is to determine the distribution of students' answers to the reading comprehension achievement test items which were written at different cognitive levels and to…
Descriptors: Student Evaluation, Feedback (Response), Scoring Rubrics, Reading Comprehension
Hrnjicic, Anela; Alihodžic, Adis; Cunjalo, Fikret; Kamber Hamzic, Dina – European Journal of Science and Mathematics Education, 2022
It is known that students have many misconceptions about concepts related to function. By discovering misconceptions using an appropriate measurement instrument, we can determine what changes we need to make in the real functions curriculum to improve learning outcomes. Therefore, we designed an item bank for measuring conceptual understandings of…
Descriptors: Item Banks, Item Analysis, Test Items, College Freshmen
Koziol, Natalie A.; Goodrich, J. Marc; Yoon, HyeonJin – Educational and Psychological Measurement, 2022
Differential item functioning (DIF) is often used to examine validity evidence of alternate form test accommodations. Unfortunately, traditional approaches for evaluating DIF are prone to selection bias. This article proposes a novel DIF framework that capitalizes on regression discontinuity design analysis to control for selection bias. A…
Descriptors: Regression (Statistics), Item Analysis, Validity, Testing Accommodations
Lundgren, Erik – Journal of Educational Data Mining, 2022
Response process data have the potential to provide a rich description of test-takers' thinking processes. However, retrieving insights from these data presents a challenge for educational assessments and educational data mining as they are complex and not well annotated. The present study addresses this challenge by developing a computational…
Descriptors: Problem Solving, Classification, Accuracy, Foreign Countries
Sutiarso, Sugeng; Rosidin, Undang; Sulistiawan, Aan – European Journal of Educational Research, 2022
This research is a developmental research aiming at developing a good mathematical test instrument using polytomous responses based on classical and modern theories. This research design uses the Plomp model, which consists of five stages, (1) preliminary investigation, (2) design, (3) realization/construction, (4) revision, and (5) implementation…
Descriptors: Mathematics Instruction, Mathematics Tests, Item Response Theory, Test Items
Saatcioglu, Fatima Munevver; Atar, Hakan Yavuz – International Journal of Assessment Tools in Education, 2022
This study aims to examine the effects of mixture item response theory (IRT) models on item parameter estimation and classification accuracy under different conditions. The manipulated variables of the simulation study are set as mixture IRT models (Rasch, 2PL, 3PL); sample size (600, 1000); the number of items (10, 30); the number of latent…
Descriptors: Accuracy, Classification, Item Response Theory, Programming Languages
Kim, Dong-In; Julian, Marc; Hermann, Pam – Online Submission, 2022
In test equating, one critical equating property is the group invariance property which indicates that the equating function used to convert performance on each alternate form to the reporting scale should be the same for various subgroups. To mitigate the impact of disrupted learning on the item parameters during the COVID-19 pandemic, a…
Descriptors: COVID-19, Pandemics, Test Format, Equated Scores
David J. Francis; Paulina A. Kulesz; Shiva Khalaf; Martin Walczak; Sharon R. Vaughn – Grantee Submission, 2022
Intervention research in education is sometimes criticized for the use of experimenter developed assessments, especially when these are over aligned with treatment. At the same time, intervention researchers sometimes prefer locally developed assessments because they appear to be more sensitive to treatment effects even when the test is not…
Descriptors: Intervention, Standardized Tests, Test Items, Grade 8
Atilgan, Hakan; Demir, Elif Kübra; Ogretmen, Tuncay; Basokcu, Tahsin Oguz – International Journal of Progressive Education, 2020
It has become a critical question what the reliability level would be when open-ended questions are used in large-scale selection tests. One of the aims of the present study is to determine what the reliability would be in the event that the answers given by test-takers are scored by experts when open-ended short answer questions are used in…
Descriptors: Foreign Countries, Secondary School Students, Test Items, Test Reliability
Inaltekin, Tufan; Goksu, Volkan – International Journal of Progressive Education, 2020
The aim of this study is to analyse the science questions in terms of visual content in the higher education entrance exams in Turkey. In this context, 1714 questions in total prepared by the Center for Measurement, Selection and Placement (CMSP) between 1999 and 2019 in the fields of Physics (n=631), Chemistry (n=553) and Biology (n=530)…
Descriptors: Foreign Countries, College Entrance Examinations, Science Tests, Test Items
Sauder, Derek; DeMars, Christine – Applied Measurement in Education, 2020
We used simulation techniques to assess the item-level and familywise Type I error control and power of an IRT item-fit statistic, the "S-X"[superscript 2]. Previous research indicated that the "S-X"[superscript 2] has good Type I error control and decent power, but no previous research examined familywise Type I error control.…
Descriptors: Item Response Theory, Test Items, Sample Size, Test Length
Tang, Xiaodan; Schultz, Matthew – Practical Assessment, Research & Evaluation, 2020
This study aims to examine the potential impacts on repeat examinees' performance by reusing simulation-based items in a high-stakes standardized assessment. We examined change patterns of item scores, ability estimate, score pattern change, response time and compared the performance of repeat examinees who have received repeat items and those who…
Descriptors: Test Items, Repetition, Simulation, Standardized Tests
Wu, Haiyan; Liang, Xinya; Yürekli, Hülya; Becker, Betsy Jane; Paek, Insu; Binici, Salih – Journal of Psychoeducational Assessment, 2020
The demand for diagnostic feedback has triggered extensive research on cognitive diagnostic models (CDMs), such as the deterministic input, noisy output "and" gate (DINA) model. This study explored two Q-matrix specifications with the DINA model in a statewide large-scale mathematics assessment. The first Q-matrix was developed based on…
Descriptors: Mathematics Tests, Cognitive Measurement, Models, Test Items
Eaton, Philip; Frank, Barrett; Willoughby, Shannon – Physical Review Physics Education Research, 2020
Items that are chained, or blocked, together appear on many of the conceptual assessments utilized for physics education research. However, when items are chained together there is the potential to introduce local dependence between those items, which would violate the assumption of item independence required by classical test theory,…
Descriptors: Science Instruction, Physics, Motion, Scientific Concepts
Petscher, Yaacov; Compton, Donald L.; Steacy, Laura; Kinnon, Hannah – Annals of Dyslexia, 2020
Models of word reading that simultaneously take into account item-level and person-level fixed and random effects are broadly known as explanatory item response models (EIRM). Although many variants of the EIRM are available, the field has generally focused on the doubly explanatory model for modeling individual differences on item responses.…
Descriptors: Item Response Theory, Reading Skills, Individual Differences, Models

Peer reviewed
Direct link
