ERIC - Search Results

Publication Date

In 2026	0
Since 2025	8
Since 2022 (last 5 years)	31
Since 2017 (last 10 years)	79
Since 2007 (last 20 years)	136

Descriptor

Test Items	226
Test Length	226
Item Response Theory	90
Sample Size	66
Test Construction	66
Computer Assisted Testing	54
Adaptive Testing	52
Simulation	51
Test Reliability	44
Error of Measurement	41
Comparative Analysis	40
Difficulty Level	40
Accuracy	38
Item Analysis	37
Test Format	37
Test Validity	32
Correlation	30
Computation	29
Statistical Analysis	29
Test Bias	29
Monte Carlo Methods	28
Models	27
Scores	27
Item Banks	26
Goodness of Fit	23
More ▼

Publication Type

Reports - Research	155
Journal Articles	138
Reports - Evaluative	41
Speeches/Meeting Papers	32
Dissertations/Theses -…	19
Reports - Descriptive	7
Numerical/Quantitative Data	6
Guides - Non-Classroom	4
Tests/Questionnaires	3
Information Analyses	2
Opinion Papers	2
Historical Materials	1
Reference Materials -…	1
More ▼

Education Level

Higher Education	14
Postsecondary Education	13
Secondary Education	8
Elementary Education	6
Elementary Secondary Education	6
High Schools	4
Early Childhood Education	3
Grade 3	3
Middle Schools	3
Grade 6	2
Intermediate Grades	2
Primary Education	2
Grade 11	1
Grade 12	1
Junior High Schools	1
Preschool Education	1
More ▼

Audience

Researchers	9
Administrators	1
Community	1
Practitioners	1

Location

Turkey	2
Alabama	1
Asia	1
Australia	1
Germany	1
Illinois (Chicago)	1
Indiana	1
Iran	1
Israel	1
Japan	1
Netherlands	1
New Jersey	1
Peru	1
South Korea	1
Taiwan	1
Ukraine	1
More ▼

Laws, Policies, & Programs

Job Training Partnership Act…	1
Race to the Top	1

What Works Clearinghouse Rating

Test Items X

Showing 16 to 30 of 226 results Save | Export

Evaluation of Factors Affecting the Performance of the "S - X[superscript 2]" Item-Fit Index

Peer reviewed

Direct link

Kim, Hyung Jin; Lee, Won-Chan – Journal of Educational Measurement, 2022

Orlando and Thissen (2000) introduced the "S - X[superscript 2]" item-fit index for testing goodness-of-fit with dichotomous item response theory (IRT) models. This study considers and evaluates an alternative approach for computing "S - X[superscript 2]" values and other factors associated with collapsing tables of observed…

Descriptors: Goodness of Fit, Test Items, Item Response Theory, Computation

A Comparison of the Efficacies of Differential Item Functioning Detection Methods

Peer reviewed
PDF on ERIC

Download full text

Basman, Munevver – International Journal of Assessment Tools in Education, 2023

To ensure the validity of the tests is to check that all items have similar results across different groups of individuals. However, differential item functioning (DIF) occurs when the results of individuals with equal ability levels from different groups differ from each other on the same test item. Based on Item Response Theory and Classic Test…

Descriptors: Test Bias, Test Items, Test Validity, Item Response Theory

The Enhanced ACT Linking Study Report. ACT Research. Research Paper. R2515

Download full text

Dongmei Li; Shalini Kapoor; Ann Arthur; Chi-Yu Huang; YoungWoo Cho; Chen Qiu; Hongling Wang – ACT Education Corp., 2025

Starting in April 2025, ACT will introduce enhanced forms of the ACT® test for national online testing, with a full rollout to all paper and online test takers in national, state and district, and international test administrations by Spring 2026. ACT introduced major updates by changing the test lengths and testing times, providing more time per…

Descriptors: College Entrance Examinations, Testing, Change, Scoring

Real-Life Applications of Competence-Based Test Development to the Construction, Improvement, and Shortening of Tests

Peer reviewed

Direct link

Pasquale Anselmi; Jürgen Heller; Luca Stefanutti; Egidio Robusto; Giulia Barillari – Education and Information Technologies, 2025

Competence-based test development (CbTD) is a novel method for constructing tests that are as informative as possible about the competence state (the set of skills an individual masters) underlying the item responses. If desired, the tests can also be minimal, meaning that no item can be eliminated without reducing their informativeness. To…

Descriptors: Competency Based Education, Test Construction, Test Length, Usability

Examining the Impact of Violations of Local Item Independence Assumption on Test Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Mehmet Fatih Doguyurt; Seref Tan – International Journal of Assessment Tools in Education, 2025

This study investigates the impact of violating the local item independence assumption by loading certain items onto a second dimension on test equating errors in unidimensional and dichotomous tests. The research was designed as a simulation study, using data generated based on the PISA 2018 mathematics exam. Analyses were conducted under 36…

Descriptors: Equated Scores, Test Items, Mathematics Tests, International Assessment

The Effect of Ratio of Items Indicating Differential Item Functioning on Computer Adaptive and Multi-Stage Tests

Peer reviewed
PDF on ERIC

Download full text

Erdem-Kara, Basak; Dogan, Nuri – International Journal of Assessment Tools in Education, 2022

Recently, adaptive test approaches have become a viable alternative to traditional fixed-item tests. The main advantage of adaptive tests is that they reach desired measurement precision with fewer items. However, fewer items mean that each item has a more significant effect on ability estimation and therefore those tests are open to more…

Descriptors: Item Analysis, Computer Assisted Testing, Test Items, Test Construction

Differential Performance of Computerized Adaptive Testing in Students with and without Disabilities -- A Simulation Study

Peer reviewed

Direct link

Nikola Ebenbeck; Markus Gebhardt – Journal of Special Education Technology, 2024

Technologies that enable individualization for students have significant potential in special education. Computerized Adaptive Testing (CAT) refers to digital assessments that automatically adjust their difficulty level based on students' abilities, allowing for personalized, efficient, and accurate measurement. This article examines whether CAT…

Descriptors: Computer Assisted Testing, Students with Disabilities, Special Education, Grade 3

Evaluation of the Goodness-of-Fit Index M[subscript ord] in Polytomous DCMS with Hierarchical Attribute Structures

Direct link

Haimiao Yuan – ProQuest LLC, 2022

The application of diagnostic classification models (DCMs) in the field of educational measurement is getting more attention in recent years. To make a valid inference from the model, it is important to ensure that the model fits the data. The purpose of the present study was to investigate the performance of the limited information…

Descriptors: Goodness of Fit, Educational Assessment, Educational Diagnosis, Models

An Evaluation of Fit Indices Used in Model Selection of Dichotomous Mixture IRT Models

Peer reviewed

Direct link

Sedat Sen; Allan S. Cohen – Educational and Psychological Measurement, 2024

A Monte Carlo simulation study was conducted to compare fit indices used for detecting the correct latent class in three dichotomous mixture item response theory (IRT) models. Ten indices were considered: Akaike's information criterion (AIC), the corrected AIC (AICc), Bayesian information criterion (BIC), consistent AIC (CAIC), Draper's…

Descriptors: Goodness of Fit, Item Response Theory, Sample Size, Classification

IRT Models for Learning with Item-Specific Learning Parameters

Peer reviewed

Direct link

Yu, Albert; Douglas, Jeffrey A. – Journal of Educational and Behavioral Statistics, 2023

We propose a new item response theory growth model with item-specific learning parameters, or ISLP, and two variations of this model. In the ISLP model, either items or blocks of items have their own learning parameters. This model may be used to improve the efficiency of learning in a formative assessment. We show ways that the ISLP model's…

Descriptors: Item Response Theory, Learning, Markov Processes, Monte Carlo Methods

The Recovery of Correlation between Latent Abilities Using Compensatory and Noncompensatory Multidimensional IRT Models

Peer reviewed

Direct link

Fu, Yanyan; Strachan, Tyler; Ip, Edward H.; Willse, John T.; Chen, Shyh-Huei; Ackerman, Terry – International Journal of Testing, 2020

This research examined correlation estimates between latent abilities when using the two-dimensional and three-dimensional compensatory and noncompensatory item response theory models. Simulation study results showed that the recovery of the latent correlation was best when the test contained 100% of simple structure items for all models and…

Descriptors: Item Response Theory, Models, Test Items, Simulation

Performance of the S-X[superscript 2] Statistic for the Multidimensional Graded Response Model

Peer reviewed

Direct link

Su, Shiyang; Wang, Chun; Weiss, David J. – Educational and Psychological Measurement, 2021

S-X[superscript 2] is a popular item fit index that is available in commercial software packages such as "flex"MIRT. However, no research has systematically examined the performance of S-X[superscript 2] for detecting item misfit within the context of the multidimensional graded response model (MGRM). The primary goal of this study was…

Descriptors: Statistics, Goodness of Fit, Test Items, Models

Can Auxiliary Information Improve Rasch Estimation at Small Sample Sizes?

Direct link

Derek Sauder – ProQuest LLC, 2020

The Rasch model is commonly used to calibrate multiple choice items. However, the sample sizes needed to estimate the Rasch model can be difficult to attain (e.g., consider a small testing company trying to pretest new items). With small sample sizes, auxiliary information besides the item responses may improve estimation of the item parameters.…

Descriptors: Item Response Theory, Sample Size, Computation, Test Length

Closed Formula of Test Length Required for Adaptive Testing with Medium Probability of Solution

Peer reviewed

Direct link

Kárász, Judit T.; Széll, Krisztián; Takács, Szabolcs – Quality Assurance in Education: An International Perspective, 2023

Purpose: Based on the general formula, which depends on the length and difficulty of the test, the number of respondents and the number of ability levels, this study aims to provide a closed formula for the adaptive tests with medium difficulty (probability of solution is p = 1/2) to determine the accuracy of the parameters for each item and in…

Descriptors: Test Length, Probability, Comparative Analysis, Difficulty Level

Two IRT Characteristic Curve Linking Methods Weighted by Information

Peer reviewed

Direct link

Wang, Shaojie; Zhang, Minqiang; Lee, Won-Chan; Huang, Feifei; Li, Zonglong; Li, Yixing; Yu, Sufang – Journal of Educational Measurement, 2022

Traditional IRT characteristic curve linking methods ignore parameter estimation errors, which may undermine the accuracy of estimated linking constants. Two new linking methods are proposed that take into account parameter estimation errors. The item- (IWCC) and test-information-weighted characteristic curve (TWCC) methods employ weighting…

Descriptors: Item Response Theory, Error of Measurement, Accuracy, Monte Carlo Methods

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 16

Educational and Psychological…	33
ProQuest LLC	19
Journal of Educational…	16
Applied Measurement in…	9
Applied Psychological…	9
ETS Research Report Series	9
International Journal of…	7
International Journal of…	7
Journal of Educational and…	5
Measurement:…	4
Journal of Psychoeducational…	3
Assessment & Evaluation in…	2
Education and Information…	2
Educational Sciences: Theory…	2
Eurasian Journal of…	2
Grantee Submission	2
Journal of Experimental…	2
Journal of Technology,…	2
Physical Review Physics…	2
ACT Education Corp.	1
AERA Online Paper Repository	1
Advanced Education	1
Anatomical Sciences Education	1
Asia Pacific Education Review	1
Assessment and Evaluation in…	1
More ▼

Wainer, Howard	6
Hambleton, Ronald K.	4
Wang, Wen-Chung	4
Berk, Ronald A.	3
Burton, Richard F.	3
Cohen, Allan S.	3
Huggins-Manley, Anne Corinne	3
Lee, Won-Chan	3
Lee, Yi-Hsuan	3
Pommerich, Mary	3
Reckase, Mark D.	3
Sijtsma, Klaas	3
Wang, Chun	3
Weiss, David J.	3
Zhang, Jinming	3
Bradshaw, Laine	2
Bulut, Okan	2
Chen, Shu-Ying	2
Cheng, Ying	2
Chernyshenko, Oleksandr S.	2
Cui, Ying	2
De Ayala, R. J.	2
Diao, Qi	2
Dogan, Nuri	2
More ▼

Program for International…	4
Test of English as a Foreign…	3
Trends in International…	3
SAT (College Admission Test)	2
ACT Assessment	1
Advanced Placement…	1
Armed Forces Qualification…	1
COMPASS (Computer Assisted…	1
Comprehensive Tests of Basic…	1
Force Concept Inventory	1
Iowa Tests of Basic Skills	1
MacArthur Communicative…	1
Medical College Admission Test	1
National Longitudinal Study…	1
New Jersey College Basic…	1
Otis Lennon School Ability…	1
Raven Advanced Progressive…	1
School and College Ability…	1
Stanford Binet Intelligence…	1
Texas Assessment of Basic…	1
Texas Educational Assessment…	1
Wechsler Intelligence Scale…	1
Wechsler Intelligence Scales…	1
More ▼