ERIC - Search Results

Publication Date

In 2026	0
Since 2025	37

Publication Type

Reports - Research	37
Journal Articles	35
Tests/Questionnaires	2
Speeches/Meeting Papers	1

Education Level

Higher Education	14
Postsecondary Education	14
Secondary Education	9
Junior High Schools	4
Middle Schools	4
Elementary Education	3
High Schools	2
Early Childhood Education	1
Elementary Secondary Education	1
Grade 2	1
Grade 8	1
Primary Education	1
More ▼

Audience

Location

Indonesia	4
Germany	3
China	2
Malaysia	2
South Korea	2
Taiwan	2
United Kingdom	2
Bosnia and Herzegovina	1
Chile	1
Iran	1
Oman	1
Qatar	1
Singapore	1
Slovakia	1
Switzerland	1
Thailand (Bangkok)	1
Turkey	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

International English…	2
Ages and Stages Questionnaires	1
Big Five Inventory	1
Program for International…	1
Test of English as a Foreign…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 37 results Save | Export

Are You Doubtful? Oh, It Might Be Difficult Then! Exploring the Use of Model Uncertainty for Question Difficulty Estimation

Peer reviewed
PDF on ERIC

Download full text

Leonidas Zotos; Hedderik van Rijn; Malvina Nissim – International Educational Data Mining Society, 2025

In an educational setting, an estimate of the difficulty of Multiple-Choice Questions (MCQs), a commonly used strategy to assess learning progress, constitutes very useful information for both teachers and students. Since human assessment is costly from multiple points of view, automatic approaches to MCQ item difficulty estimation are…

Descriptors: Multiple Choice Tests, Test Items, Difficulty Level, Artificial Intelligence

Evaluation of Exam Questions Using Bootstrapping: Practical Applications in R and SPSS with a Case Study

Peer reviewed

Direct link

Changiz Mohiyeddini – Anatomical Sciences Education, 2025

This article presents a step-by-step guide to using R and SPSS to bootstrap exam questions. Bootstrapping, a versatile nonparametric analytical technique, can help to improve the psychometric qualities of exam questions in the process of quality assurance. Bootstrapping is particularly useful in disciplines such as medical education, where student…

Descriptors: Test Items, Sampling, Statistical Inference, Nonparametric Statistics

Embedding Embedded Standard Setting: An Application of Cross-Classified Item Response Theory. CRESST Report 876

Download full text

Yun-Kyung Kim; Li Cai – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2025

This paper introduces an application of cross-classified item response theory (IRT) modeling to an assessment utilizing the embedded standard setting (ESS) method (Lewis & Cook). The cross-classified IRT model is used to treat both item and person effects as random, where the item effects are regressed on the target performance levels (target…

Descriptors: Standard Setting (Scoring), Item Response Theory, Test Items, Difficulty Level

From Framework to Functionality: A Cross-Country Analysis of PISA 2018 Reading Assessment Framework's Item Features as Determinants of Item Difficulty

Peer reviewed

Direct link

Kseniia Marcq; Johan Braeken – Large-scale Assessments in Education, 2025

Background: Theoretical frameworks excel in conceptualising reading literacy, yet their value hinges on their applicability for real-world purposes, such as assessment. By combining diverse theoretical frameworks, the Programme for International Student Assessment (PISA) 2018 designed an assessment framework for assessing the reading literacy of…

Descriptors: International Assessment, Achievement Tests, Foreign Countries, Secondary School Students

Revisiting the Lexical Differences between Academic and General Training IELTS Reading Tests

Peer reviewed

Direct link

Linh Thi Thao Le; Nam Thi Phuong Ho; Nguyen Huynh Trang; Hung Tan Ha – SAGE Open, 2025

The International English Language Testing System (IELTS) has served as one of the most reliable proofs of people's English language proficiency. There have been rumors about the discrepancy in difficulty between the two modules of IELTS, namely Academic (AC) and General Training (GT); however, there is little empirical evidence to confirm such a…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Reading Tests

The Accuracy of Estimating Parameters of Multiple-Choice Test Items, Following Item-Response Theory: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Aiman Mohammad Freihat; Omar Saleh Bani Yassin – Educational Process: International Journal, 2025

Background/purpose: This study aimed to reveal the accuracy of estimation of multiple-choice test items parameters following the models of the item-response theory in measurement. Materials/methods: The researchers depended on the measurement accuracy indicators, which express the absolute difference between the estimated and actual values of the…

Descriptors: Accuracy, Computation, Multiple Choice Tests, Test Items

Interaction of Social Deference and Cognitive Processing in the Prediction of Acquiescence

Peer reviewed

Direct link

Patrik Havan; Michal Kohút; Peter Halama – International Journal of Testing, 2025

Acquiescence is the tendency of participants to shift their responses to agreement. Lechner et al. (2019) introduced the following mechanisms of acquiescence: social deference and cognitive processing. We added their interaction into a theoretical framework. The sample consists of 557 participants. We found significant medium strong relationship…

Descriptors: Cognitive Processes, Attention, Difficulty Level, Reflection

How Do Alternative Gendered Linguistic Forms Affect Response Behavior in Surveys?

Peer reviewed

Direct link

Cornelia E. Neuert – Field Methods, 2025

Using masculine forms in surveys is still common practice, with researchers presumably assuming they operate in a generic way. However, the generic masculine has been found to lead to male-biased representations in various contexts. This article studies the effects of alternative gendered linguistic forms in surveys. The language forms are…

Descriptors: Language Usage, Surveys, Response Style (Tests), Gender Bias

Validation and Psychometric Properties of the Computational Thinking Multidimensional Test

Peer reviewed
PDF on ERIC

Download full text

Ali Orhan; Inan Tekin; Sedat Sen – International Journal of Assessment Tools in Education, 2025

In this study, it was aimed to translate and adapt the Computational Thinking Multidimensional Test (CTMT) developed by Kang et al. (2023) into Turkish and to investigate its psychometric qualities with Turkish university students. Following the translation procedures of the CTMT with 12 multiple-choice questions developed based on real-life…

Descriptors: Cognitive Tests, Thinking Skills, Computation, Test Validity

Comparative Evaluation of C-Test Reliability Using Classical and Modern Psychometric Methods

Peer reviewed
PDF on ERIC

Download full text

Neda Kianinezhad; Mohsen Kianinezhad – Language Education & Assessment, 2025

This study presents a comparative analysis of classical reliability measures, including Cronbach's alpha, test-retest, and parallel forms reliability, alongside modern psychometric methods such as the Rasch model and Mokken scaling, to evaluate the reliability of C-tests in language proficiency assessment. Utilizing data from 150 participants…

Descriptors: Psychometrics, Test Reliability, Language Proficiency, Language Tests

Empirically Deriving Cut Scores in the Positive Behavioral Interventions and Supports (PBIS) Tiered Fidelity Inventory (TFI) through a Bookmarking Process

Peer reviewed

Direct link

Jerin Kim; Kent McIntosh – Journal of Positive Behavior Interventions, 2025

We aimed to identify empirically valid cut scores on the positive behavioral interventions and supports (PBIS) Tiered Fidelity Inventory (TFI) through an expert panel process known as bookmarking. The TFI is a measurement tool to evaluate the fidelity of implementation of PBIS. In the bookmark method, experts reviewed all TFI items and item scores…

Descriptors: Positive Behavior Supports, Cutting Scores, Fidelity, Program Evaluation

Investigating Construct Validity of Cognitive Load Measurement Using Single-Item Subjective Rating Scales

Peer reviewed

Direct link

Katrin Schuessler; Vanessa Fischer; Maik Walpuski – Instructional Science: An International Journal of the Learning Sciences, 2025

Cognitive load studies are mostly centered on information on perceived cognitive load. Single-item subjective rating scales are the dominant measurement practice to investigate overall cognitive load. Usually, either invested mental effort or perceived task difficulty is used as an overall cognitive load measure. However, the extent to which the…

Descriptors: Cognitive Processes, Difficulty Level, Rating Scales, Construct Validity

Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2025

To mitigate the potential damaging consequences of rapid guessing (RG), a form of noneffortful responding, researchers have proposed a number of scoring approaches. The present simulation study examines the robustness of the most popular of these approaches, the unidimensional effort-moderated (EM) scoring procedure, to multidimensional RG (i.e.,…

Descriptors: Scoring, Guessing (Tests), Reaction Time, Item Response Theory

Inventory of Galilean Transformation of Uniform Linear Motion in Position-Time Graphs

Peer reviewed

Direct link

E.?B. Merki; S.?I. Hofer; A. Vaterlaus; A. Lichtenberger – Physical Review Physics Education Research, 2025

When describing motion in physics, the selection of a frame of reference is crucial. The graph of a moving object can look quite different based on the frame of reference. In recent years, various tests have been developed to assess the interpretation of kinematic graphs, but none of these tests have specifically addressed differences in reference…

Descriptors: Graphs, Motion, Physics, Secondary School Students

Unpacking the Impact of Item Difficulty: Traditional Testing in Online Learning

Peer reviewed
PDF on ERIC

Download full text

Necati Taskin – International Journal of Technology in Education, 2025

This study examines the effect of item order (random, increasingly difficult, and decreasingly difficult) on student performance, test parameters, and student perceptions in multiple-choice tests administered in a paper-and-pencil format after online learning. In the research conducted using an explanatory sequential mixed methods design,…

Descriptors: Test Items, Difficulty Level, Online Courses, College Freshmen

Previous Page | Next Page »

Pages: 1 | 2 | 3

Physical Review Physics…	3
International Journal of…	2
SAGE Open	2
Anatomical Sciences Education	1
Chemistry Education Research…	1
ETS Research Report Series	1
Educational Process:…	1
Educational Psychology Review	1
Educational and Psychological…	1
English Language Teaching…	1
English Teaching	1
Field Methods	1
Infants and Young Children	1
Instructional Science: An…	1
International Educational…	1
International Electronic…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Applied Research…	1
Journal of Autism and…	1
Journal of Biological…	1
Journal of Education and…	1
More ▼

Maik Walpuski	2
A. Lichtenberger	1
A. Vaterlaus	1
Agus Widyantoro	1
Ahmed Al - Badri	1
Aiman Mohammad Freihat	1
Alexander Kah	1
Ali Orhan	1
Amelia Pearson	1
Anatri Desstya	1
Anggun Resdasari Prasetyo	1
Apichat Khamboonruang	1
Arandha May Rachmawati	1
Bambang Sumintono	1
Benjamin W. Domingue	1
Carolin Eitemüller	1
Changiz Mohiyeddini	1
Charlotte Broadhurst	1
Chieh-Yu Chen	1
Ching-I Chen	1
Cornelia E. Neuert	1
Cui-Yan Hoe	1
Dina Kamber Hamzic	1
Dwayne Lieck	1
E.?B. Merki	1
More ▼

Difficulty Level	37
Test Items	37
Foreign Countries	21
Test Reliability	14
Test Validity	14
Item Analysis	10
Item Response Theory	10
Multiple Choice Tests	10
Psychometrics	9
Test Construction	9
English (Second Language)	8
Undergraduate Students	8
Language Tests	7
Science Tests	7
Cognitive Processes	6
Reading Tests	5
Second Language Learning	5
College Students	4
Language Usage	4
Questionnaires	4
Second Language Instruction	4
Accuracy	3
Achievement Tests	3
Artificial Intelligence	3
Comparative Analysis	3
More ▼