ERIC - Search Results

Publication Date

In 2026	0
Since 2025	13
Since 2022 (last 5 years)	69
Since 2017 (last 10 years)	225
Since 2007 (last 20 years)	463

Descriptor

Difficulty Level	584
Item Response Theory	584
Test Items	460
Foreign Countries	162
Test Construction	110
Psychometrics	98
Models	97
Item Analysis	91
Comparative Analysis	89
Test Reliability	80
Multiple Choice Tests	79
Scores	78
Mathematics Tests	73
Statistical Analysis	72
Computer Assisted Testing	66
Correlation	61
Test Validity	59
Computation	54
Simulation	53
Test Format	51
Science Tests	49
Elementary School Students	46
Error of Measurement	45
Language Tests	45
Achievement Tests	44
More ▼

Publication Type

Journal Articles	430
Reports - Research	427
Reports - Evaluative	100
Speeches/Meeting Papers	67
Dissertations/Theses -…	31
Numerical/Quantitative Data	23
Reports - Descriptive	20
Tests/Questionnaires	14
Information Analyses	6
Collected Works - Proceedings	3
ERIC Digests in Full Text	1
ERIC Publications	1
Non-Print Media	1
Opinion Papers	1
Reference Materials - General	1
More ▼

Education Level

Higher Education	96
Secondary Education	94
Elementary Education	85
Postsecondary Education	84
Middle Schools	45
Junior High Schools	40
High Schools	36
Grade 8	31
Elementary Secondary Education	29
Early Childhood Education	23
Intermediate Grades	22
Primary Education	22
Grade 3	20
Grade 7	20
Grade 5	18
Grade 4	17
Grade 6	17
Grade 1	10
Grade 2	10
Kindergarten	9
Grade 9	8
Grade 12	7
Grade 10	5
Adult Education	3
Grade 11	2
More ▼

Audience

Practitioners

Location

Turkey	18
Germany	14
Indonesia	14
Taiwan	9
United States	9
Australia	8
Nigeria	8
Canada	7
Florida	7
Japan	6
South Africa	6
Brazil	5
California	5
Belgium	4
China	4
Greece	4
Iran	4
Malaysia	4
Netherlands	4
South Korea	4
United Kingdom	4
Hong Kong	3
Illinois	3
Indiana	3
Israel	3
More ▼

Laws, Policies, & Programs

What Works Clearinghouse Rating

Showing 1 to 15 of 584 results Save | Export

"C"ing the Light -- Assessing Code Comprehension in Novice Programmers Using C Code Patterns

Peer reviewed

Direct link

Christina Glasauer; Martin K. Yeh; Lois Anne DeLong; Yu Yan; Yanyan Zhuang – Computer Science Education, 2025

Background and Context: Feedback on one's progress is essential to new programming language learners, particularly in out-of-classroom settings. Though many study materials offer assessment mechanisms, most do not examine the accuracy of the feedback they deliver, nor give evidence on its validity. Objective: We investigate the potential use of a…

Descriptors: Novices, Computer Science Education, Programming, Accuracy

Embedding Embedded Standard Setting: An Application of Cross-Classified Item Response Theory. CRESST Report 876

Download full text

Yun-Kyung Kim; Li Cai – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2025

This paper introduces an application of cross-classified item response theory (IRT) modeling to an assessment utilizing the embedded standard setting (ESS) method (Lewis & Cook). The cross-classified IRT model is used to treat both item and person effects as random, where the item effects are regressed on the target performance levels (target…

Descriptors: Standard Setting (Scoring), Item Response Theory, Test Items, Difficulty Level

The Accuracy of Estimating Parameters of Multiple-Choice Test Items, Following Item-Response Theory: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Aiman Mohammad Freihat; Omar Saleh Bani Yassin – Educational Process: International Journal, 2025

Background/purpose: This study aimed to reveal the accuracy of estimation of multiple-choice test items parameters following the models of the item-response theory in measurement. Materials/methods: The researchers depended on the measurement accuracy indicators, which express the absolute difference between the estimated and actual values of the…

Descriptors: Accuracy, Computation, Multiple Choice Tests, Test Items

Exploring German Secondary School Students' Conceptual Knowledge of Density

Peer reviewed
PDF on ERIC

Download full text

Zenger, Tim; Bitzenbauer, Philipp – Science Education International, 2022

This article reports on the development and piloting of a German version of a concept test to assess students' conceptual knowledge of density. The concept test was administered in paper-pencil format to 222 German secondary school students as a post-test after instruction in all relevant concepts of density. We provide a psychometric…

Descriptors: Foreign Countries, Secondary School Students, Concept Formation, Psychometrics

Investigation of the Effect of Parameter Estimation and Classification Accuracy in Mixture IRT Models under Different Conditions

Peer reviewed
PDF on ERIC

Download full text

Saatcioglu, Fatima Munevver; Atar, Hakan Yavuz – International Journal of Assessment Tools in Education, 2022

This study aims to examine the effects of mixture item response theory (IRT) models on item parameter estimation and classification accuracy under different conditions. The manipulated variables of the simulation study are set as mixture IRT models (Rasch, 2PL, 3PL); sample size (600, 1000); the number of items (10, 30); the number of latent…

Descriptors: Accuracy, Classification, Item Response Theory, Programming Languages

The Impact of Insufficient Effort Responses on the Order of Category Thresholds in the Polytomous Rasch Model

Peer reviewed

Direct link

Kuan-Yu Jin; Thomas Eckes – Educational and Psychological Measurement, 2024

Insufficient effort responding (IER) refers to a lack of effort when answering survey or questionnaire items. Such items typically offer more than two ordered response categories, with Likert-type scales as the most prominent example. The underlying assumption is that the successive categories reflect increasing levels of the latent variable…

Descriptors: Item Response Theory, Test Items, Test Wiseness, Surveys

Parameters and Models of Item Response Theory (IRT): A Review of Literature

Peer reviewed

Direct link

Gyamfi, Abraham; Acquaye, Rosemary – Acta Educationis Generalis, 2023

Introduction: Item response theory (IRT) has received much attention in validation of assessment instrument because it allows the estimation of students' ability from any set of the items. Item response theory allows the difficulty and discrimination levels of each item on the test to be estimated. In the framework of IRT, item characteristics are…

Descriptors: Item Response Theory, Models, Test Items, Difficulty Level

Modeling Item Difficulty in a Dynamic Test

Peer reviewed

Direct link

Hauenstein, Clifford E.; Embretson, Susan E. – Journal of Cognitive Education and Psychology, 2020

The Concept Formation subtest of the Woodcock Johnson Tests of Cognitive Abilities represents a dynamic test due to continual provision of feedback from examiner to examinee. Yet, the original scoring protocol for the test largely ignores this dynamic structure. The current analysis applies a dynamic adaptation of an explanatory item response…

Descriptors: Test Items, Difficulty Level, Cognitive Tests, Cognitive Ability

An Investigation of the Nature and Consequence of the Relationship between IRT Difficulty and Discrimination

Peer reviewed

Direct link

Sweeney, Sandra M.; Sinharay, Sandip; Johnson, Matthew S.; Steinhauer, Eric W. – Educational Measurement: Issues and Practice, 2022

The focus of this paper is on the empirical relationship between item difficulty and item discrimination. Two studies--an empirical investigation and a simulation study--were conducted to examine the association between item difficulty and item discrimination under classical test theory and item response theory (IRT), and the effects of the…

Descriptors: Correlation, Item Response Theory, Item Analysis, Difficulty Level

Comparative Evaluation of C-Test Reliability Using Classical and Modern Psychometric Methods

Peer reviewed
PDF on ERIC

Download full text

Neda Kianinezhad; Mohsen Kianinezhad – Language Education & Assessment, 2025

This study presents a comparative analysis of classical reliability measures, including Cronbach's alpha, test-retest, and parallel forms reliability, alongside modern psychometric methods such as the Rasch model and Mokken scaling, to evaluate the reliability of C-tests in language proficiency assessment. Utilizing data from 150 participants…

Descriptors: Psychometrics, Test Reliability, Language Proficiency, Language Tests

Validation of an Elicited Imitation Test as a Measure of Korean Language Proficiency

Peer reviewed

Direct link

Hojung Kim; Changkyung Song; Jiyoung Kim; Hyeyun Jeong; Jisoo Park – Language Testing in Asia, 2024

This study presents a modified version of the Korean Elicited Imitation (EI) test, designed to resemble natural spoken language, and validates its reliability as a measure of proficiency. The study assesses the correlation between average test scores and Test of Proficiency in Korean (TOPIK) levels, examining score distributions among beginner,…

Descriptors: Korean, Test Validity, Test Reliability, Imitation

Semisupervised Learning Method to Adjust Biased Item Difficulty Estimates Caused by Nonignorable Missingness in a Virtual Learning Environment

Peer reviewed
PDF on ERIC

Download full text

Direct link

Xue, Kang; Huggins-Manley, Anne Corinne; Leite, Walter – Educational and Psychological Measurement, 2022

In data collected from virtual learning environments (VLEs), item response theory (IRT) models can be used to guide the ongoing measurement of student ability. However, such applications of IRT rely on unbiased item parameter estimates associated with test items in the VLE. Without formal piloting of the items, one can expect a large amount of…

Descriptors: Virtual Classrooms, Artificial Intelligence, Item Response Theory, Item Analysis

Application of Two-Parameter Item Response Theory for Determining Form-Dependent Items on Exams Using Different Item Orders

Peer reviewed
PDF on ERIC

Download full text

Pentecost, Thomas C.; Raker, Jeffery R.; Murphy, Kristen L. – Practical Assessment, Research & Evaluation, 2023

Using multiple versions of an assessment has the potential to introduce item environment effects. These types of effects result in version dependent item characteristics (i.e., difficulty and discrimination). Methods to detect such effects and resulting implications are important for all levels of assessment where multiple forms of an assessment…

Descriptors: Item Response Theory, Test Items, Test Format, Science Tests

Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2025

To mitigate the potential damaging consequences of rapid guessing (RG), a form of noneffortful responding, researchers have proposed a number of scoring approaches. The present simulation study examines the robustness of the most popular of these approaches, the unidimensional effort-moderated (EM) scoring procedure, to multidimensional RG (i.e.,…

Descriptors: Scoring, Guessing (Tests), Reaction Time, Item Response Theory

Seeing the Forest and the Trees: Comparison of Two IRTree Models to Investigate the Impact of Full versus Endpoint-Only Response Option Labeling

Peer reviewed

Direct link

Spratto, Elisabeth M.; Leventhal, Brian C.; Bandalos, Deborah L. – Educational and Psychological Measurement, 2021

In this study, we examined the results and interpretations produced from two different IRTree models--one using paths consisting of only dichotomous decisions, and one using paths consisting of both dichotomous and polytomous decisions. We used data from two versions of an impulsivity measure. In the first version, all the response options had…

Descriptors: Comparative Analysis, Item Response Theory, Decision Making, Data Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 39

Educational and Psychological…	41
ProQuest LLC	31
Journal of Educational…	28
Applied Measurement in…	19
Applied Psychological…	18
ETS Research Report Series	18
Behavioral Research and…	16
Grantee Submission	14
International Journal of…	11
Online Submission	10
Psychometrika	10
International Journal of…	9
Journal of Educational and…	8
Language Testing	8
Practical Assessment,…	8
International Educational…	7
Journal of Psychoeducational…	6
Journal of Speech, Language,…	6
Language Assessment Quarterly	6
International Journal of…	5
African Journal of Research…	4
Assessment for Effective…	4
Assessment in Education:…	4
Educational Measurement:…	4
Eurasian Journal of…	4
More ▼

Tindal, Gerald	16
Alonzo, Julie	12
Anderson, Daniel	9
Park, Bitnara Jasmine	8
Paek, Insu	7
Irvin, P. Shawn	6
Petscher, Yaacov	6
Saven, Jessica L.	6
Schoen, Robert C.	6
Bulut, Okan	5
DeBoer, George E.	5
DeMars, Christine E.	5
Engelhard, George, Jr.	5
Herrmann-Abell, Cari F.	5
Finch, Holmes	4
Guo, Hongwen	4
He, Wei	4
Jin, Kuan-Yu	4
Liu, Kimy	4
Long, Caroline	4
Sinharay, Sandip	4
Wise, Steven L.	4
Yang, Xiaotong	4
Andrich, David	3
More ▼

Program for International…	11
Trends in International…	8
Test of English as a Foreign…	6
Graduate Record Examinations	5
Advanced Placement…	3
International English…	3
National Assessment of…	3
Progress in International…	3
Raven Progressive Matrices	3
SAT (College Admission Test)	3
Measures of Academic Progress	2
Peabody Picture Vocabulary…	2
Raven Advanced Progressive…	2
Remote Associates Test	2
ACT Assessment	1
Child Behavior Checklist	1
Childrens Manifest Anxiety…	1
Connecticut Mastery Testing…	1
Defining Issues Test	1
Dynamic Indicators of Basic…	1
English Proficiency Test	1
Gates MacGinitie Reading Tests	1
General Aptitude Test Battery	1
Hidden Figures Test	1
Iowa Tests of Basic Skills	1
More ▼