ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	35

Descriptor

Computation	37
Simulation	37
Item Response Theory	17
Test Items	14
Correlation	9
Error of Measurement	9
Models	9
Statistical Analysis	9
Test Bias	9
Comparative Analysis	8
Maximum Likelihood Statistics	8
Monte Carlo Methods	8
Sample Size	6
Equations (Mathematics)	5
Goodness of Fit	5
Sampling	5
Test Reliability	5
Accuracy	4
Classification	4
Data Analysis	4
Evaluation Methods	4
Factor Analysis	4
Mathematics	4
Probability	4
Adaptive Testing	3
More ▼

Source

Educational and Psychological…

Publication Type

Journal Articles	37
Reports - Research	25
Reports - Evaluative	12

Education Level

Junior High Schools	2
Middle Schools	2
Secondary Education	2
Early Childhood Education	1
Grade 9	1
High Schools	1
Higher Education	1
Postsecondary Education	1
Preschool Education	1

Audience

Location

China	1
Germany	1
Hong Kong	1
Taiwan	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	1
Law School Admission Test	1
Program for International…	1
Self Directed Search	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 37 results Save | Export

KR20 and KR21 for Some Nondichotomous Data (It's Not Just Cronbach's Alpha)

Peer reviewed

Direct link

Foster, Robert C. – Educational and Psychological Measurement, 2021

This article presents some equivalent forms of the common Kuder-Richardson Formula 21 and 20 estimators for nondichotomous data belonging to certain other exponential families, such as Poisson count data, exponential data, or geometric counts of trials until failure. Using the generalized framework of Foster (2020), an equation for the reliability…

Descriptors: Test Reliability, Data, Computation, Mathematical Formulas

Non-Iterative Conditional Pairwise Estimation for the Rating Scale Model

Peer reviewed

Direct link

Elliott, Mark; Buttery, Paula – Educational and Psychological Measurement, 2022

We investigate two non-iterative estimation procedures for Rasch models, the pair-wise estimation procedure (PAIR) and the Eigenvector method (EVM), and identify theoretical issues with EVM for rating scale model (RSM) threshold estimation. We develop a new procedure to resolve these issues--the conditional pairwise adjacent thresholds procedure…

Descriptors: Item Response Theory, Rating Scales, Computation, Simulation

Large Sample Confidence Intervals for Item Response Theory Reliability Coefficients

Peer reviewed

Direct link

Andersson, Björn; Xin, Tao – Educational and Psychological Measurement, 2018

In applications of item response theory (IRT), an estimate of the reliability of the ability estimates or sum scores is often reported. However, analytical expressions for the standard errors of the estimators of the reliability coefficients are not available in the literature and therefore the variability associated with the estimated reliability…

Descriptors: Item Response Theory, Test Reliability, Test Items, Scores

Unidimensional IRT Item Parameter Estimates across Equivalent Test Forms with Confounding Specifications within Dimensions

Peer reviewed

Direct link

Matlock, Ki Lynn; Turner, Ronna – Educational and Psychological Measurement, 2016

When constructing multiple test forms, the number of items and the total test difficulty are often equivalent. Not all test developers match the number of items and/or average item difficulty within subcontent areas. In this simulation study, six test forms were constructed having an equal number of items and average item difficulty overall.…

Descriptors: Item Response Theory, Computation, Test Items, Difficulty Level

A Simulation Study on Methods of Correcting for the Effects of Extreme Response Style

Peer reviewed

Direct link

Wetzel, Eunike; Böhnke, Jan R.; Rose, Norman – Educational and Psychological Measurement, 2016

The impact of response styles such as extreme response style (ERS) on trait estimation has long been a matter of concern to researchers and practitioners. This simulation study investigated three methods that have been proposed for the correction of trait estimates for ERS effects: (a) mixed Rasch models, (b) multidimensional item response models,…

Descriptors: Response Style (Tests), Simulation, Methods, Computation

Partially Compensatory Multidimensional Item Response Theory Models: Two Alternate Model Forms

Peer reviewed

Direct link

DeMars, Christine E. – Educational and Psychological Measurement, 2016

Partially compensatory models may capture the cognitive skills needed to answer test items more realistically than compensatory models, but estimating the model parameters may be a challenge. Data were simulated to follow two different partially compensatory models, a model with an interaction term and a product model. The model parameters were…

Descriptors: Item Response Theory, Models, Thinking Skills, Test Items

A Cautionary Note on the Use of the Vale and Maurelli Method to Generate Multivariate, Nonnormal Data for Simulation Purposes

Peer reviewed

Direct link

Olvera Astivia, Oscar L.; Zumbo, Bruno D. – Educational and Psychological Measurement, 2015

To further understand the properties of data-generation algorithms for multivariate, nonnormal data, two Monte Carlo simulation studies comparing the Vale and Maurelli method and the Headrick fifth-order polynomial method were implemented. Combinations of skewness and kurtosis found in four published articles were run and attention was…

Descriptors: Data, Simulation, Monte Carlo Methods, Comparative Analysis

Correcting Model Fit Criteria for Small Sample Latent Growth Models with Incomplete Data

Peer reviewed

Direct link

McNeish, Daniel; Harring, Jeffrey R. – Educational and Psychological Measurement, 2017

To date, small sample problems with latent growth models (LGMs) have not received the amount of attention in the literature as related mixed-effect models (MEMs). Although many models can be interchangeably framed as a LGM or a MEM, LGMs uniquely provide criteria to assess global data-model fit. However, previous studies have demonstrated poor…

Descriptors: Growth Models, Goodness of Fit, Error Correction, Sampling

Scale Reliability Evaluation with Heterogeneous Populations

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2015

A latent variable modeling approach for scale reliability evaluation in heterogeneous populations is discussed. The method can be used for point and interval estimation of reliability of multicomponent measuring instruments in populations representing mixtures of an unknown number of latent classes or subpopulations. The procedure is helpful also…

Descriptors: Test Reliability, Evaluation Methods, Measurement Techniques, Computation

Assessing Spurious Interaction Effects in Structural Equation Modeling

Peer reviewed

Direct link

Harring, Jeffrey R.; Weiss, Brandi A.; Li, Ming – Educational and Psychological Measurement, 2015

Several studies have stressed the importance of simultaneously estimating interaction and quadratic effects in multiple regression analyses, even if theory only suggests an interaction effect should be present. Specifically, past studies suggested that failing to simultaneously include quadratic effects when testing for interaction effects could…

Descriptors: Structural Equation Models, Statistical Analysis, Monte Carlo Methods, Computation

Effects of Design Properties on Parameter Estimation in Large-Scale Assessments

Peer reviewed

Direct link

Hecht, Martin; Weirich, Sebastian; Siegle, Thilo; Frey, Andreas – Educational and Psychological Measurement, 2015

The selection of an appropriate booklet design is an important element of large-scale assessments of student achievement. Two design properties that are typically optimized are the "balance" with respect to the positions the items are presented and with respect to the mutual occurrence of pairs of items in the same booklet. The purpose…

Descriptors: Measurement, Computation, Test Format, Test Items

Best Design for Multidimensional Computerized Adaptive Testing with the Bifactor Model

Peer reviewed

Direct link

Seo, Dong Gi; Weiss, David J. – Educational and Psychological Measurement, 2015

Most computerized adaptive tests (CATs) have been studied using the framework of unidimensional item response theory. However, many psychological variables are multidimensional and might benefit from using a multidimensional approach to CATs. This study investigated the accuracy, fidelity, and efficiency of a fully multidimensional CAT algorithm…

Descriptors: Computer Assisted Testing, Adaptive Testing, Accuracy, Fidelity

The Langer-Improved Wald Test for DIF Testing with Multiple Groups: Evaluation and Comparison to Two-Group IRT

Peer reviewed

Direct link

Woods, Carol M.; Cai, Li; Wang, Mian – Educational and Psychological Measurement, 2013

Differential item functioning (DIF) occurs when the probability of responding in a particular category to an item differs for members of different groups who are matched on the construct being measured. The identification of DIF is important for valid measurement. This research evaluates an improved version of Lord's X[superscript 2] Wald test for…

Descriptors: Test Bias, Item Response Theory, Computation, Comparative Analysis

Numerical Differentiation Methods for Computing Error Covariance Matrices in Item Response Theory Modeling: An Evaluation and a New Proposal

Peer reviewed

Direct link

Tian, Wei; Cai, Li; Thissen, David; Xin, Tao – Educational and Psychological Measurement, 2013

In item response theory (IRT) modeling, the item parameter error covariance matrix plays a critical role in statistical inference procedures. When item parameters are estimated using the EM algorithm, the parameter error covariance matrix is not an automatic by-product of item calibration. Cai proposed the use of Supplemented EM algorithm for…

Descriptors: Item Response Theory, Computation, Matrices, Statistical Inference

Establishing Effect Size Guidelines for Interpreting the Results of Differential Bundle Functioning Analyses Using SIBTEST

Peer reviewed

Direct link

Walker, Cindy M.; Zhang, Bo; Banks, Kathleen; Cappaert, Kevin – Educational and Psychological Measurement, 2012

The purpose of this simulation study was to establish general effect size guidelines for interpreting the results of differential bundle functioning (DBF) analyses using simultaneous item bias test (SIBTEST). Three factors were manipulated: number of items in a bundle, test length, and magnitude of uniform differential item functioning (DIF)…

Descriptors: Test Bias, Test Length, Simulation, Guidelines

Previous Page | Next Page »

Pages: 1 | 2 | 3

Cai, Li	2
Harring, Jeffrey R.	2
Woods, Carol M.	2
Xin, Tao	2
Yang, Xiangdong	2
Zumbo, Bruno D.	2
Andersson, Björn	1
Banks, Kathleen	1
Bjornstad, Jan F.	1
Brennan, Robert L.	1
Buttery, Paula	1
Böhnke, Jan R.	1
Cappaert, Kevin	1
Carvajal, Jorge	1
Chan, Wai	1
Cui, Ying	1
Davison, Mark L.	1
DeMars, Christine E.	1
Ding, Cody S.	1
Dumenci, Levent	1
Elliott, Mark	1
Enders, Craig K.	1
Foster, Robert C.	1
Frey, Andreas	1
Gierl, Mark J.	1
More ▼