ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	12
Since 2016 (last 10 years)	19
Since 2006 (last 20 years)	30

Descriptor

Monte Carlo Methods	42
Item Response Theory	20
Test Items	17
Models	11
Markov Processes	10
Computation	9
Bayesian Statistics	8
Evaluation Methods	8
Test Bias	8
Accuracy	7
Achievement Tests	7
Comparative Analysis	6
Correlation	5
Difficulty Level	5
Goodness of Fit	5
Mathematical Models	5
Reaction Time	5
Sampling	5
Simulation	5
Computer Simulation	4
Error of Measurement	4
Foreign Countries	4
International Assessment	4
Item Analysis	4
Measurement	4
More ▼

Source

Journal of Educational…

Publication Type

Journal Articles	41
Reports - Research	32
Reports - Evaluative	7
Reports - Descriptive	2
Speeches/Meeting Papers	2

Education Level

Secondary Education

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	4
National Assessment of…	1
Program for the International…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 42 results Save | Export

A Unified Comparison of IRT-Based Effect Sizes for DIF Investigations

Peer reviewed

Direct link

Chalmers, R. Philip – Journal of Educational Measurement, 2023

Several marginal effect size (ES) statistics suitable for quantifying the magnitude of differential item functioning (DIF) have been proposed in the area of item response theory; for instance, the Differential Functioning of Items and Tests (DFIT) statistics, signed and unsigned item difference in the sample statistics (SIDS, UIDS, NSIDS, and…

Descriptors: Test Bias, Item Response Theory, Definitions, Monte Carlo Methods

Several Variations of Simple-Structure MIRT Equating

Peer reviewed

Direct link

Kim, Stella Y.; Lee, Won-Chan – Journal of Educational Measurement, 2023

The current study proposed several variants of simple-structure multidimensional item response theory equating procedures. Four distinct sets of data were used to demonstrate feasibility of proposed equating methods for two different equating designs: a random groups design and a common-item nonequivalent groups design. Findings indicated some…

Descriptors: Item Response Theory, Equated Scores, Monte Carlo Methods, Research Methodology

A New Bayesian Person-Fit Analysis Method Using Pivotal Discrepancy Measures

Peer reviewed

Direct link

Combs, Adam – Journal of Educational Measurement, 2023

A common method of checking person-fit in Bayesian item response theory (IRT) is the posterior-predictive (PP) method. In recent years, more powerful approaches have been proposed that are based on resampling methods using the popular L*[subscript z] statistic. There has also been proposed a new Bayesian model checking method based on pivotal…

Descriptors: Bayesian Statistics, Goodness of Fit, Evaluation Methods, Monte Carlo Methods

An Exponentially Weighted Moving Average Procedure for Detecting Back Random Responding Behavior

Peer reviewed

Direct link

He, Yinhong – Journal of Educational Measurement, 2023

Back random responding (BRR) behavior is one of the commonly observed careless response behaviors. Accurately detecting BRR behavior can improve test validities. Yu and Cheng (2019) showed that the change point analysis (CPA) procedure based on weighted residual (CPA-WR) performed well in detecting BRR. Compared with the CPA procedure, the…

Descriptors: Test Validity, Item Response Theory, Measurement, Monte Carlo Methods

A Recursion-Based Analytical Approach to Evaluate the Performance of MST

Peer reviewed

Direct link

Lim, Hwanggyu; Davey, Tim; Wells, Craig S. – Journal of Educational Measurement, 2021

This study proposed a recursion-based analytical approach to assess measurement precision of ability estimation and classification accuracy in multistage adaptive tests (MSTs). A simulation study was conducted to compare the proposed recursion-based analytical method with an analytical method proposed by Park, Kim, Chung, and Dodd and with the…

Descriptors: Adaptive Testing, Measurement, Accuracy, Classification

A Nonparametric Composite Group DIF Index for Focal Groups Stemming from Multicategorical Variables

Peer reviewed

Direct link

Corinne Huggins-Manley; Anthony W. Raborn; Peggy K. Jones; Ted Myers – Journal of Educational Measurement, 2024

The purpose of this study is to develop a nonparametric DIF method that (a) compares focal groups directly to the composite group that will be used to develop the reported test score scale, and (b) allows practitioners to explore for DIF related to focal groups stemming from multicategorical variables that constitute a small proportion of the…

Descriptors: Nonparametric Statistics, Test Bias, Scores, Statistical Significance

Detecting Differential Item Functioning Using Posterior Predictive Model Checking: A Comparison of Discrepancy Statistics

Peer reviewed

Direct link

Joo, Seang-Hwane; Lee, Philseok – Journal of Educational Measurement, 2022

Abstract This study proposes a new Bayesian differential item functioning (DIF) detection method using posterior predictive model checking (PPMC). Item fit measures including infit, outfit, observed score distribution (OSD), and Q1 were considered as discrepancy statistics for the PPMC DIF methods. The performance of the PPMC DIF method was…

Descriptors: Test Items, Bayesian Statistics, Monte Carlo Methods, Prediction

Incorporating Test-Taking Engagement into Multistage Adaptive Testing Design for Large-Scale Assessments

Peer reviewed

Direct link

Okan Bulut; Guher Gorgun; Hacer Karamese – Journal of Educational Measurement, 2025

The use of multistage adaptive testing (MST) has gradually increased in large-scale testing programs as MST achieves a balanced compromise between linear test design and item-level adaptive testing. MST works on the premise that each examinee gives their best effort when attempting the items, and their responses truly reflect what they know or can…

Descriptors: Response Style (Tests), Testing Problems, Testing Accommodations, Measurement

Standard Errors of Variance Components, Measurement Errors and Generalizability Coefficients for Crossed Designs

Peer reviewed

Direct link

Almehrizi, Rashid S. – Journal of Educational Measurement, 2021

Estimates of various variance components, universe score variance, measurement error variances, and generalizability coefficients, like all statistics, are subject to sampling variability, particularly in small samples. Such variability is quantified traditionally through estimated standard errors and/or confidence intervals. The paper derived new…

Descriptors: Error of Measurement, Statistics, Design, Generalizability Theory

Bayesian Extension of Biweight and Huber Weight for Robust Ability Estimation

Peer reviewed

Direct link

Maeda, Hotaka; Zhang, Bo – Journal of Educational Measurement, 2020

When a response pattern does not fit a selected measurement model, one may resort to robust ability estimation. Two popular robust methods are biweight and Huber weight. So far, research on these methods has been quite limited. This article proposes the maximum a posteriori biweight (BMAP) and Huber weight (HMAP) estimation methods. These methods…

Descriptors: Bayesian Statistics, Robustness (Statistics), Computation, Monte Carlo Methods

Two IRT Characteristic Curve Linking Methods Weighted by Information

Peer reviewed

Direct link

Wang, Shaojie; Zhang, Minqiang; Lee, Won-Chan; Huang, Feifei; Li, Zonglong; Li, Yixing; Yu, Sufang – Journal of Educational Measurement, 2022

Traditional IRT characteristic curve linking methods ignore parameter estimation errors, which may undermine the accuracy of estimated linking constants. Two new linking methods are proposed that take into account parameter estimation errors. The item- (IWCC) and test-information-weighted characteristic curve (TWCC) methods employ weighting…

Descriptors: Item Response Theory, Error of Measurement, Accuracy, Monte Carlo Methods

Linking via Pseudo-Equivalent Group Design: Methodological Considerations and an Application to the PISA and PIACC Assessments

Peer reviewed

Direct link

Pokropek, Artur; Borgonovi, Francesca – Journal of Educational Measurement, 2020

This article presents the pseudo-equivalent group approach and discusses how it can enhance the quality of linking in the presence of nonequivalent groups. The pseudo-equivalent group approach allows to achieve pseudo-equivalence using propensity score reweighting techniques. We use it to perform linking to establish scale concordance between two…

Descriptors: Foreign Countries, Secondary School Students, Achievement Tests, International Assessment

Multiple-Group Joint Modeling of Item Responses, Response Times, and Action Counts with the Conway-Maxwell-Poisson Distribution

Peer reviewed

Direct link

Qiao, Xin; Jiao, Hong; He, Qiwei – Journal of Educational Measurement, 2023

Multiple group modeling is one of the methods to address the measurement noninvariance issue. Traditional studies on multiple group modeling have mainly focused on item responses. In computer-based assessments, joint modeling of response times and action counts with item responses helps estimate the latent speed and action levels in addition to…

Descriptors: Multivariate Analysis, Models, Item Response Theory, Statistical Distributions

Explanatory Cognitive Diagnostic Modeling Incorporating Response Times

Peer reviewed

Direct link

Qiao, Xin; Jiao, Hong – Journal of Educational Measurement, 2021

This study proposes explanatory cognitive diagnostic model (CDM) jointly incorporating responses and response times (RTs) with the inclusion of item covariates related to both item responses and RTs. The joint modeling of item responses and RTs intends to provide more information for cognitive diagnosis while item covariates can be used to predict…

Descriptors: Cognitive Measurement, Models, Reaction Time, Test Items

On-the-Fly Constraint-Controlled Assembly Methods for Multistage Adaptive Testing for Cognitive Diagnosis

Peer reviewed

Direct link

Liu, Shuchang; Cai, Yan; Tu, Dongbo – Journal of Educational Measurement, 2018

This study applied the mode of on-the-fly assembled multistage adaptive testing to cognitive diagnosis (CD-OMST). Several and several module assembly methods for CD-OMST were proposed and compared in terms of measurement precision, test security, and constrain management. The module assembly methods in the study included the maximum priority index…

Descriptors: Adaptive Testing, Monte Carlo Methods, Computer Security, Clinical Diagnosis

Previous Page | Next Page »

Pages: 1 | 2 | 3

Jiao, Hong	4
Lee, Won-Chan	3
Wang, Wen-Chung	3
Ankenmann, Robert D.	2
Chalmers, R. Philip	2
Qiao, Xin	2
Wang, Shudong	2
Wilson, Mark	2
Ackerman, Terry A.	1
Allen, Nancy L.	1
Almehrizi, Rashid S.	1
Ames, Allison	1
Anthony W. Raborn	1
Barcikowski, Robert S.	1
Borgonovi, Francesca	1
Briggs, Derek C.	1
Cai, Yan	1
Chang, Hua-Hua	1
Cohen, Jon	1
Combs, Adam	1
Corinne Huggins-Manley	1
Davey, Tim	1
Donoghue, John R.	1
Dunbar, Stephen B.	1
More ▼