ERIC - Search Results

Publication Date

In 2026	0
Since 2025	4
Since 2022 (last 5 years)	11
Since 2017 (last 10 years)	23
Since 2007 (last 20 years)	38

Descriptor

Accuracy	38
Test Items	38
Test Length	38
Item Response Theory	18
Computer Assisted Testing	16
Adaptive Testing	13
Computation	13
Sample Size	13
Classification	10
Comparative Analysis	10
Correlation	9
Measurement	9
Monte Carlo Methods	8
Foreign Countries	7
Simulation	7
Test Construction	7
Item Banks	6
Reliability	6
Test Format	6
Ability	5
Bayesian Statistics	5
Error of Measurement	5
Goodness of Fit	5
Item Analysis	5
Scoring	5
More ▼

Publication Type

Reports - Research	29
Journal Articles	27
Dissertations/Theses -…	8
Speeches/Meeting Papers	2
Numerical/Quantitative Data	1
Reports - Evaluative	1

Education Level

Higher Education	3
Postsecondary Education	3
Secondary Education	3
Early Childhood Education	2
Elementary Education	2
Elementary Secondary Education	1
Grade 3	1
High Schools	1
Junior High Schools	1
Middle Schools	1
Preschool Education	1
Primary Education	1
More ▼

Audience

Location

Germany	1
Japan	1
Turkey	1
Ukraine	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	2
Force Concept Inventory	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 38 results Save | Export

The Effect of Polytomous Item Ratio on Ability Estimation in Multistage Tests

Peer reviewed
PDF on ERIC

Download full text

Hasibe Yahsi Sari; Hulya Kelecioglu – International Journal of Assessment Tools in Education, 2025

The aim of the study is to examine the effect of polytomous item ratio on ability estimation in different conditions in multistage tests (MST) using mixed tests. The study is simulation-based research. In the PISA 2018 application, the ability parameters of the individuals and the item pool were created by using the item parameters estimated from…

Descriptors: Test Items, Test Format, Accuracy, Test Length

The Impact of Scoring Later on Mixed Format Adaptive Testing

Direct link

Jing Ma – ProQuest LLC, 2024

This study investigated the impact of scoring polytomous items later on measurement precision, classification accuracy, and test security in mixed-format adaptive testing. Utilizing the shadow test approach, a simulation study was conducted across various test designs, lengths, number and location of polytomous item. Results showed that while…

Descriptors: Scoring, Adaptive Testing, Test Items, Classification

A Simulation Study on the Performance of Different Reliability Estimation Methods

Peer reviewed

Direct link

Edwards, Ashley A.; Joyner, Keanan J.; Schatschneider, Christopher – Educational and Psychological Measurement, 2021

The accuracy of certain internal consistency estimators have been questioned in recent years. The present study tests the accuracy of six reliability estimators (Cronbach's alpha, omega, omega hierarchical, Revelle's omega, and greatest lower bound) in 140 simulated conditions of unidimensional continuous data with uncorrelated errors with varying…

Descriptors: Reliability, Computation, Accuracy, Sample Size

Utilizing Response Time for Item Selection in On-the-Fly Multistage Adaptive Testing for PISA Assessment

Peer reviewed

Direct link

Xiuxiu Tang; Yi Zheng; Tong Wu; Kit-Tai Hau; Hua-Hua Chang – Journal of Educational Measurement, 2025

Multistage adaptive testing (MST) has been recently adopted for international large-scale assessments such as Programme for International Student Assessment (PISA). MST offers improved measurement efficiency over traditional nonadaptive tests and improved practical convenience over single-item-adaptive computerized adaptive testing (CAT). As a…

Descriptors: Reaction Time, Test Items, Achievement Tests, Foreign Countries

IRT Characteristic Curve Linking Methods Weighted by Information for Mixed-Format Tests

Peer reviewed

Direct link

Shaojie Wang; Won-Chan Lee; Minqiang Zhang; Lixin Yuan – Applied Measurement in Education, 2024

To reduce the impact of parameter estimation errors on IRT linking results, recent work introduced two information-weighted characteristic curve methods for dichotomous items. These two methods showed outstanding performance in both simulation and pseudo-form pseudo-group analysis. The current study expands upon the concept of information…

Descriptors: Item Response Theory, Test Format, Test Length, Error of Measurement

Exploring Number of Response Categories in Factor Analysis: Implications for Sample Size

Peer reviewed
PDF on ERIC

Download full text

Fatih Orçan – International Journal of Assessment Tools in Education, 2025

Factor analysis is a statistical method to explore the relationships among observed variables and identify latent structures. It is crucial in scale development and validity analysis. Key factors affecting the accuracy of factor analysis results include the type of data, sample size, and the number of response categories. While some studies…

Descriptors: Factor Analysis, Factor Structure, Item Response Theory, Sample Size

Real-Life Applications of Competence-Based Test Development to the Construction, Improvement, and Shortening of Tests

Peer reviewed

Direct link

Pasquale Anselmi; Jürgen Heller; Luca Stefanutti; Egidio Robusto; Giulia Barillari – Education and Information Technologies, 2025

Competence-based test development (CbTD) is a novel method for constructing tests that are as informative as possible about the competence state (the set of skills an individual masters) underlying the item responses. If desired, the tests can also be minimal, meaning that no item can be eliminated without reducing their informativeness. To…

Descriptors: Competency Based Education, Test Construction, Test Length, Usability

Differential Performance of Computerized Adaptive Testing in Students with and without Disabilities -- A Simulation Study

Peer reviewed

Direct link

Nikola Ebenbeck; Markus Gebhardt – Journal of Special Education Technology, 2024

Technologies that enable individualization for students have significant potential in special education. Computerized Adaptive Testing (CAT) refers to digital assessments that automatically adjust their difficulty level based on students' abilities, allowing for personalized, efficient, and accurate measurement. This article examines whether CAT…

Descriptors: Computer Assisted Testing, Students with Disabilities, Special Education, Grade 3

An Evaluation of Fit Indices Used in Model Selection of Dichotomous Mixture IRT Models

Peer reviewed

Direct link

Sedat Sen; Allan S. Cohen – Educational and Psychological Measurement, 2024

A Monte Carlo simulation study was conducted to compare fit indices used for detecting the correct latent class in three dichotomous mixture item response theory (IRT) models. Ten indices were considered: Akaike's information criterion (AIC), the corrected AIC (AICc), Bayesian information criterion (BIC), consistent AIC (CAIC), Draper's…

Descriptors: Goodness of Fit, Item Response Theory, Sample Size, Classification

Two IRT Characteristic Curve Linking Methods Weighted by Information

Peer reviewed

Direct link

Wang, Shaojie; Zhang, Minqiang; Lee, Won-Chan; Huang, Feifei; Li, Zonglong; Li, Yixing; Yu, Sufang – Journal of Educational Measurement, 2022

Traditional IRT characteristic curve linking methods ignore parameter estimation errors, which may undermine the accuracy of estimated linking constants. Two new linking methods are proposed that take into account parameter estimation errors. The item- (IWCC) and test-information-weighted characteristic curve (TWCC) methods employ weighting…

Descriptors: Item Response Theory, Error of Measurement, Accuracy, Monte Carlo Methods

Improving Test Security and Efficiency of Computerized Adaptive Testing for the Force Concept Inventory

Peer reviewed

Direct link

Yasuda, Jun-ichiro; Hull, Michael M.; Mae, Naohiro – Physical Review Physics Education Research, 2022

This paper presents improvements made to a computerized adaptive testing (CAT)-based version of the FCI (FCI-CAT) in regards to test security and test efficiency. First, we will discuss measures to enhance test security by controlling for item overexposure, decreasing the risk that respondents may (i) memorize the content of a pretest for use on…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Risk Management

A Special Case of Brennan's Index for Tests That Aim to Select a Limited Number of Students: A Monte Carlo Simulation Study

Peer reviewed

Direct link

Arikan, Serkan; Aybek, Eren Can – Educational Measurement: Issues and Practice, 2022

Many scholars compared various item discrimination indices in real or simulated data. Item discrimination indices, such as item-total correlation, item-rest correlation, and IRT item discrimination parameter, provide information about individual differences among all participants. However, there are tests that aim to select a very limited number…

Descriptors: Monte Carlo Methods, Item Analysis, Correlation, Individual Differences

An Empirical Research on Identifiability and Q-Matrix Design for DINA Model

Peer reviewed
PDF on ERIC

Download full text

Xu, Peng; Desmarais, Michel C. – International Educational Data Mining Society, 2018

In most contexts of student skills assessment, whether the test material is administered by the teacher or within a learning environment, there is a strong incentive to minimize the number of questions or exercises administered in order to get an accurate assessment. This minimization objective can be framed as a Q-matrix design problem: given a…

Descriptors: Test Items, Accuracy, Test Construction, Skills

How the Length and Characteristics of Routing Module Affect Ability Estimation in ca-MST?

Peer reviewed
PDF on ERIC

Download full text

Öztürk, Nagihan Boztunç – Universal Journal of Educational Research, 2019

In this study, how the length and characteristics of routing module in different panel designs affect measurement precision is examined. In the scope of the study, six different routing module length, nine different routing module characteristics, and two different panel design are handled. At the end of the study, the effects of conditions on…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Length, Test Format

The Effect of Person Misfit on Item Parameter Estimation and Classification Accuracy: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Mousavi, Amin; Cui, Ying – Education Sciences, 2020

Often, important decisions regarding accountability and placement of students in performance categories are made on the basis of test scores generated from tests, therefore, it is important to evaluate the validity of the inferences derived from test results. One of the threats to the validity of such inferences is aberrant responding. Several…

Descriptors: Student Evaluation, Educational Testing, Psychological Testing, Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2 | 3

Educational and Psychological…	10
ProQuest LLC	8
International Journal of…	3
Journal of Educational…	2
Advanced Education	1
Applied Measurement in…	1
Applied Psychological…	1
Education Sciences	1
Education and Information…	1
Educational Measurement:…	1
Educational Sciences: Theory…	1
Grantee Submission	1
International Educational…	1
Journal of Educational and…	1
Journal of Special Education…	1
Measurement:…	1
Pearson	1
Physical Review Physics…	1
Universal Journal of…	1
More ▼

Bradshaw, Laine	2
He, Wei	2
Huggins-Manley, Anne Corinne	2
Wang, Chun	2
Allan S. Cohen	1
Anil, Duygu	1
Arikan, Serkan	1
Aybek, Eren Can	1
Bao, Yu	1
Cheng, Ying	1
Chien, Yuehmei	1
Cui, Ying	1
David J. Weiss	1
Deng, Nina	1
Desmarais, Michel C.	1
Diao, Qi	1
Dodd, Barbara G.	1
Dogan, Nuri	1
Edwards, Ashley A.	1
Egidio Robusto	1
Fatih Orçan	1
Fu, Qiong	1
Galindo, Jennifer L.	1
Gawliczek, Piotr	1
Geisinger, Kurt F.	1
More ▼