ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	7
Since 2017 (last 10 years)	19
Since 2007 (last 20 years)	53

Descriptor

Computation	61
Simulation	61
Test Items	61
Item Response Theory	44
Maximum Likelihood Statistics	18
Models	17
Test Bias	13
Comparative Analysis	12
Difficulty Level	12
Accuracy	11
Statistical Analysis	11
Correlation	10
Error of Measurement	10
Sample Size	10
Data Analysis	8
Measurement	8
Probability	8
Scores	8
Computer Assisted Testing	7
Test Length	7
Adaptive Testing	6
Bayesian Statistics	6
Evaluation Methods	6
Psychometrics	6
Error Patterns	5
More ▼

Source

Educational and Psychological…	14
Journal of Educational…	11
Applied Psychological…	6
ProQuest LLC	5
Journal of Educational and…	4
Applied Measurement in…	3
Grantee Submission	3
International Journal of…	3
ETS Research Report Series	2
Measurement:…	2
Structural Equation Modeling:…	2
Alberta Journal of…	1
Educational Process:…	1
Educational Testing Service	1
International Journal of…	1
Perspectives in Education	1
Psychometrika	1
More ▼

Publication Type

Journal Articles	52
Reports - Research	41
Reports - Evaluative	12
Dissertations/Theses -…	5
Reports - Descriptive	3
Numerical/Quantitative Data	1

Education Level

Junior High Schools	4
Middle Schools	4
Secondary Education	4
Early Childhood Education	1
Grade 9	1
High Schools	1
Higher Education	1
Postsecondary Education	1
Preschool Education	1

Audience

Location

China	1
Germany	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	2
Graduate Record Examinations	1
National Assessment of…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 61 results Save | Export

The Accuracy of Estimating Parameters of Multiple-Choice Test Items, Following Item-Response Theory: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Aiman Mohammad Freihat; Omar Saleh Bani Yassin – Educational Process: International Journal, 2025

Background/purpose: This study aimed to reveal the accuracy of estimation of multiple-choice test items parameters following the models of the item-response theory in measurement. Materials/methods: The researchers depended on the measurement accuracy indicators, which express the absolute difference between the estimated and actual values of the…

Descriptors: Accuracy, Computation, Multiple Choice Tests, Test Items

The Impact of Non-Effortful Responding on Item and Person Parameters in Item-Pool Scaling Linking

Peer reviewed

Direct link

Yue Liu; Zhen Li; Hongyun Liu; Xiaofeng You – Applied Measurement in Education, 2024

Low test-taking effort of examinees has been considered a source of construct-irrelevant variance in item response modeling, leading to serious consequences on parameter estimation. This study aims to investigate how non-effortful response (NER) influences the estimation of item and person parameters in item-pool scale linking (IPSL) and whether…

Descriptors: Item Response Theory, Computation, Simulation, Responses

Online Calibration in Multidimensional Computerized Adaptive Testing with Polytomously Scored Items

Peer reviewed

Direct link

Yuan, Lu; Huang, Yingshi; Li, Shuhang; Chen, Ping – Journal of Educational Measurement, 2023

Online calibration is a key technology for item calibration in computerized adaptive testing (CAT) and has been widely used in various forms of CAT, including unidimensional CAT, multidimensional CAT (MCAT), CAT with polytomously scored items, and cognitive diagnostic CAT. However, as multidimensional and polytomous assessment data become more…

Descriptors: Computer Assisted Testing, Adaptive Testing, Computation, Test Items

Multi-Group Regularized Gaussian Variational Estimation: Fast Detection of DIF

Peer reviewed

Direct link

Weicong Lyu; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Data harmonization is an emerging approach to strategically combining data from multiple independent studies, enabling addressing new research questions that are not answerable by a single contributing study. A fundamental psychometric challenge for data harmonization is to create commensurate measures for the constructs of interest across…

Descriptors: Data Analysis, Test Items, Psychometrics, Item Response Theory

Estimating Difference-Score Reliability in Pretest-Posttest Settings

Peer reviewed

Direct link

Gu, Zhengguo; Emons, Wilco H. M.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2021

Clinical, medical, and health psychologists use difference scores obtained from pretest--posttest designs employing the same test to assess intraindividual change possibly caused by an intervention addressing, for example, anxiety, depression, eating disorder, or addiction. Reliability of difference scores is important for interpreting observed…

Descriptors: Test Reliability, Scores, Pretests Posttests, Computation

Modeling Nonlinear Effects of Person-by-Item Covariates in Explanatory Item Response Models: Exploratory Plots and Modeling Using Smooth Functions

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sun-Joo Cho; Amanda Goodwin; Matthew Naveiras; Paul De Boeck – Grantee Submission, 2024

Explanatory item response models (EIRMs) have been applied to investigate the effects of person covariates, item covariates, and their interactions in the fields of reading education and psycholinguistics. In practice, it is often assumed that the relationships between the covariates and the logit transformation of item response probability are…

Descriptors: Item Response Theory, Test Items, Models, Maximum Likelihood Statistics

Modeling Nonlinear Effects of Person-by-Item Covariates in Explanatory Item Response Models: Exploratory Plots and Modeling Using Smooth Functions

Peer reviewed

Direct link

Sun-Joo Cho; Amanda Goodwin; Matthew Naveiras; Paul De Boeck – Journal of Educational Measurement, 2024

Descriptors: Item Response Theory, Test Items, Models, Maximum Likelihood Statistics

Item Selection and Exposure Control Methods for Computerized Adaptive Testing with Multidimensional Ranking Items

Peer reviewed

Direct link

Chen, Chia-Wen; Wang, Wen-Chung; Chiu, Ming Ming; Ro, Sage – Journal of Educational Measurement, 2020

The use of computerized adaptive testing algorithms for ranking items (e.g., college preferences, career choices) involves two major challenges: unacceptably high computation times (selecting from a large item pool with many dimensions) and biased results (enhanced preferences or intensified examinee responses because of repeated statements across…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Selection

Classical Item Analysis from a Signal Detection Perspective

Peer reviewed

Direct link

DeCarlo, Lawrence T. – Journal of Educational Measurement, 2023

A conceptualization of multiple-choice exams in terms of signal detection theory (SDT) leads to simple measures of item difficulty and item discrimination that are closely related to, but also distinct from, those used in classical item analysis (CIA). The theory defines a "true split," depending on whether or not examinees know an item,…

Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Test Wiseness

Can Auxiliary Information Improve Rasch Estimation at Small Sample Sizes?

Direct link

Derek Sauder – ProQuest LLC, 2020

The Rasch model is commonly used to calibrate multiple choice items. However, the sample sizes needed to estimate the Rasch model can be difficult to attain (e.g., consider a small testing company trying to pretest new items). With small sample sizes, auxiliary information besides the item responses may improve estimation of the item parameters.…

Descriptors: Item Response Theory, Sample Size, Computation, Test Length

Item Calibration Methods with Multiple Subscale Multistage Testing

Peer reviewed

Direct link

Chun Wang; Ping Chen; Shengyu Jiang – Journal of Educational Measurement, 2020

Many large-scale educational surveys have moved from linear form design to multistage testing (MST) design. One advantage of MST is that it can provide more accurate latent trait [theta] estimates using fewer items than required by linear tests. However, MST generates incomplete response data by design; hence, questions remain as to how to…

Descriptors: Test Construction, Test Items, Adaptive Testing, Maximum Likelihood Statistics

Careful with Those Priors: A Note on Bayesian Estimation in Two-Parameter Logistic Item Response Theory Models

Peer reviewed

Direct link

Marcoulides, Katerina M. – Measurement: Interdisciplinary Research and Perspectives, 2018

This study examined the use of Bayesian analysis methods for the estimation of item parameters in a two-parameter logistic item response theory model. Using simulated data under various design conditions with both informative and non-informative priors, the parameter recovery of Bayesian analysis methods were examined. Overall results showed that…

Descriptors: Bayesian Statistics, Item Response Theory, Probability, Difficulty Level

Optimizing the Use of Response Times for Item Selection in Computerized Adaptive Testing

Peer reviewed

Direct link

Choe, Edison M.; Kern, Justin L.; Chang, Hua-Hua – Journal of Educational and Behavioral Statistics, 2018

Despite common operationalization, measurement efficiency of computerized adaptive testing should not only be assessed in terms of the number of items administered but also the time it takes to complete the test. To this end, a recent study introduced a novel item selection criterion that maximizes Fisher information per unit of expected response…

Descriptors: Computer Assisted Testing, Reaction Time, Item Response Theory, Test Items

A Generalized Approach to Defining Item Discrimination for DCMs

Peer reviewed

Direct link

Henson, Robert; DiBello, Lou; Stout, Bill – Measurement: Interdisciplinary Research and Perspectives, 2018

Diagnostic classification models (DCMs, also known as cognitive diagnosis models) hold the promise of providing detailed classroom information about the skills a student has or has not mastered. Specifically, DCMs are special cases of constrained latent class models where classes are defined based on mastery/nonmastery of a set of attributes (or…

Descriptors: Classification, Diagnostic Tests, Models, Mastery Learning

Large Sample Confidence Intervals for Item Response Theory Reliability Coefficients

Peer reviewed

Direct link

Andersson, Björn; Xin, Tao – Educational and Psychological Measurement, 2018

In applications of item response theory (IRT), an estimate of the reliability of the ability estimates or sum scores is often reported. However, analytical expressions for the standard errors of the estimators of the reliability coefficients are not available in the literature and therefore the variability associated with the estimated reliability…

Descriptors: Item Response Theory, Test Reliability, Test Items, Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Wang, Wen-Chung	5
Penfield, Randall D.	3
Amanda Goodwin	2
Cho, Sun-Joo	2
Chun Wang	2
Jin, Kuan-Yu	2
Matthew Naveiras	2
Paek, Insu	2
Paul De Boeck	2
Suh, Youngsuk	2
Sun-Joo Cho	2
Zhang, Jinming	2
Aiman Mohammad Freihat	1
Alhija, Fadia Nasser-Abu	1
Andersson, Björn	1
Banks, Kathleen	1
Belur, Madhu N.	1
Bolt, Daniel M.	1
Cappaert, Kevin	1
Carvajal, Jorge	1
Chan, Tsze	1
Chang, Hua-Hua	1
Chaporkar, Prasanna	1
Chen, Chia-Wen	1
Chen, Ping	1
More ▼