ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	19

Descriptor

Error Patterns	39
Simulation	13
Evaluation Methods	10
Statistical Analysis	9
Correlation	8
Test Items	8
Item Response Theory	7
Monte Carlo Methods	7
Probability	7
Classification	6
Models	6
Rating Scales	6
Comparative Analysis	5
Measurement Techniques	5
Sample Size	5
Test Bias	5
Test Reliability	5
Computation	4
Multiple Choice Tests	4
Sampling	4
Scores	4
Test Validity	4
Data Analysis	3
Multiple Regression Analysis	3
Research Methodology	3
More ▼

Source

Educational and Psychological…

Publication Type

Journal Articles	28
Reports - Research	24
Reports - Evaluative	3
Reports - Descriptive	1

Education Level

Elementary Secondary Education

Audience

Location

Lebanon

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 39 results Save | Export

DIF Detection with Zero-Inflation under the Factor Mixture Modeling Framework

Peer reviewed

Direct link

Lee, Sooyong; Han, Suhwa; Choi, Seung W. – Educational and Psychological Measurement, 2022

Response data containing an excessive number of zeros are referred to as zero-inflated data. When differential item functioning (DIF) detection is of interest, zero-inflation can attenuate DIF effects in the total sample and lead to underdetection of DIF items. The current study presents a DIF detection procedure for response data with excess…

Descriptors: Test Bias, Monte Carlo Methods, Simulation, Models

Extended Multivariate Generalizability Theory with Complex Design Structures

Peer reviewed

Direct link

Brennan, Robert L.; Kim, Stella Y.; Lee, Won-Chan – Educational and Psychological Measurement, 2022

This article extends multivariate generalizability theory (MGT) to tests with different random-effects designs for each level of a fixed facet. There are numerous situations in which the design of a test and the resulting data structure are not definable by a single design. One example is mixed-format tests that are composed of multiple-choice and…

Descriptors: Multivariate Analysis, Generalizability Theory, Multiple Choice Tests, Test Construction

Hybrid Threshold-Based Sequential Procedures for Detecting Compromised Items in a Computerized Adaptive Testing Licensure Exam

Peer reviewed

Direct link

Lee, Chansoon; Qian, Hong – Educational and Psychological Measurement, 2022

Using classical test theory and item response theory, this study applied sequential procedures to a real operational item pool in a variable-length computerized adaptive testing (CAT) to detect items whose security may be compromised. Moreover, this study proposed a hybrid threshold approach to improve the detection power of the sequential…

Descriptors: Computer Assisted Testing, Adaptive Testing, Licensing Examinations (Professions), Item Response Theory

A Graphical Method for Displaying the Model Fit of Item Response Theory Trace Lines

Peer reviewed

Direct link

Kalinowski, Steven T. – Educational and Psychological Measurement, 2019

Item response theory (IRT) is a statistical paradigm for developing educational tests and assessing students. IRT, however, currently lacks an established graphical method for examining model fit for the three-parameter logistic model, the most flexible and popular IRT model in educational testing. A method is presented here to do this. The graph,…

Descriptors: Item Response Theory, Educational Assessment, Goodness of Fit, Probability

Detecting Test Tampering Using Item Response Theory

Peer reviewed

Direct link

Wollack, James A.; Cohen, Allan S.; Eckerly, Carol A. – Educational and Psychological Measurement, 2015

Test tampering, especially on tests for educational accountability, is an unfortunate reality, necessitating that the state (or its testing vendor) perform data forensic analyses, such as erasure analyses, to look for signs of possible malfeasance. Few statistical approaches exist for detecting fraudulent erasures, and those that do largely do not…

Descriptors: Tests, Cheating, Item Response Theory, Accountability

Testing Mediation in Structural Equation Modeling: The Effectiveness of the Test of Joint Significance

Peer reviewed

Direct link

Leth-Steensen, Craig; Gallitto, Elena – Educational and Psychological Measurement, 2016

A large number of approaches have been proposed for estimating and testing the significance of indirect effects in mediation models. In this study, four sets of Monte Carlo simulations involving full latent variable structural equation models were run in order to contrast the effectiveness of the currently popular bias-corrected bootstrapping…

Descriptors: Mediation Theory, Structural Equation Models, Monte Carlo Methods, Simulation

Assessing Spurious Interaction Effects in Structural Equation Modeling

Peer reviewed

Direct link

Harring, Jeffrey R.; Weiss, Brandi A.; Li, Ming – Educational and Psychological Measurement, 2015

Several studies have stressed the importance of simultaneously estimating interaction and quadratic effects in multiple regression analyses, even if theory only suggests an interaction effect should be present. Specifically, past studies suggested that failing to simultaneously include quadratic effects when testing for interaction effects could…

Descriptors: Structural Equation Models, Statistical Analysis, Monte Carlo Methods, Computation

Reducing Bias and Error in the Correlation Coefficient Due to Nonnormality

Peer reviewed

Direct link

Bishara, Anthony J.; Hittner, James B. – Educational and Psychological Measurement, 2015

It is more common for educational and psychological data to be nonnormal than to be approximately normal. This tendency may lead to bias and error in point estimates of the Pearson correlation coefficient. In a series of Monte Carlo simulations, the Pearson correlation was examined under conditions of normal and nonnormal data, and it was compared…

Descriptors: Research Methodology, Monte Carlo Methods, Correlation, Simulation

A Skew-Normal Mixture Regression Model

Peer reviewed

Direct link

Liu, Min; Lin, Tsung-I – Educational and Psychological Measurement, 2014

A challenge associated with traditional mixture regression models (MRMs), which rest on the assumption of normally distributed errors, is determining the number of unobserved groups. Specifically, even slight deviations from normality can lead to the detection of spurious classes. The current work aims to (a) examine how sensitive the commonly…

Descriptors: Regression (Statistics), Evaluation Methods, Indexes, Models

An Investigation of Sample Size Splitting on ATFIND and DIMTEST

Peer reviewed

Direct link

Socha, Alan; DeMars, Christine E. – Educational and Psychological Measurement, 2013

Modeling multidimensional test data with a unidimensional model can result in serious statistical errors, such as bias in item parameter estimates. Many methods exist for assessing the dimensionality of a test. The current study focused on DIMTEST. Using simulated data, the effects of sample size splitting for use with the ATFIND procedure for…

Descriptors: Sample Size, Test Length, Correlation, Test Format

Testing Measurement Invariance Using MIMIC: Likelihood Ratio Test with a Critical Value Adjustment

Peer reviewed

Direct link

Kim, Eun Sook; Yoon, Myeongsun; Lee, Taehun – Educational and Psychological Measurement, 2012

Multiple-indicators multiple-causes (MIMIC) modeling is often used to test a latent group mean difference while assuming the equivalence of factor loadings and intercepts over groups. However, this study demonstrated that MIMIC was insensitive to the presence of factor loading noninvariance, which implies that factor loading invariance should be…

Descriptors: Test Items, Simulation, Testing, Statistical Analysis

Adjusting the Adjusted X[superscript 2]/df Ratio Statistic for Dichotomous Item Response Theory Analyses: Does the Model Fit?

Peer reviewed

Direct link

Tay, Louis; Drasgow, Fritz – Educational and Psychological Measurement, 2012

Two Monte Carlo simulation studies investigated the effectiveness of the mean adjusted X[superscript 2]/df statistic proposed by Drasgow and colleagues and, because of problems with the method, a new approach for assessing the goodness of fit of an item response theory model was developed. It has been previously recommended that mean adjusted…

Descriptors: Test Length, Monte Carlo Methods, Goodness of Fit, Item Response Theory

Using a Model of Analysts' Judgments to Augment an Item Calibration Process

Peer reviewed

Direct link

Hauser, Carl; Thum, Yeow Meng; He, Wei; Ma, Lingling – Educational and Psychological Measurement, 2015

When conducting item reviews, analysts evaluate an array of statistical and graphical information to assess the fit of a field test (FT) item to an item response theory model. The process can be tedious, particularly when the number of human reviews (HR) to be completed is large. Furthermore, such a process leads to decisions that are susceptible…

Descriptors: Test Items, Item Response Theory, Research Methodology, Decision Making

Performance of the S - [chi][squared] Statistic for Full-Information Bifactor Models

Peer reviewed

Direct link

Li, Ying; Rupp, Andre A. – Educational and Psychological Measurement, 2011

This study investigated the Type I error rate and power of the multivariate extension of the S - [chi][squared] statistic using unidimensional and multidimensional item response theory (UIRT and MIRT, respectively) models as well as full-information bifactor (FI-bifactor) models through simulation. Manipulated factors included test length, sample…

Descriptors: Test Length, Item Response Theory, Statistical Analysis, Error Patterns

The Effects of Initially Misclassified Data on the Effectiveness of Discriminant Function Analysis and Finite Mixture Modeling

Peer reviewed

Direct link

Holden, Jocelyn E.; Kelley, Ken – Educational and Psychological Measurement, 2010

Classification procedures are common and useful in behavioral, educational, social, and managerial research. Supervised classification techniques such as discriminant function analysis assume training data are perfectly classified when estimating parameters or classifying. In contrast, unsupervised classification techniques such as finite mixture…

Descriptors: Discriminant Analysis, Classification, Computation, Behavioral Science Research

Previous Page | Next Page »

Pages: 1 | 2 | 3

Drasgow, Fritz	2
Albaum, Gerald	1
Bailit, Howard	1
Barton, Karen E.	1
Benson, Philip G.	1
Bishara, Anthony J.	1
Boruch, Robert F.	1
Brennan, Robert L.	1
Bullard, Cedric G.	1
Carter, Walter H., Jr.	1
Choi, Seung W.	1
Claudy, John G.	1
Cohen, Allan S.	1
Daubney, John H.	1
DeMars, Christine E.	1
Dickinson, Terry L.	1
Eckerly, Carol A.	1
Ferreres, Doris	1
Fidalgo, Angel M.	1
Fisicaro, Sebastiano A.	1
Gallitto, Elena	1
Goodstadt, Michael S.	1
Green, Kathy	1
Han, Suhwa	1
More ▼