ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	0
Since 2007 (last 20 years)	12

Descriptor

Computation	14
Sample Size	14
Item Response Theory	8
Test Length	7
Error of Measurement	5
Models	5
Monte Carlo Methods	5
Statistical Analysis	5
Simulation	4
Accuracy	3
Computer Software	3
Maximum Likelihood Statistics	3
Test Bias	3
Comparative Analysis	2
Correlation	2
Evaluation Methods	2
Factor Analysis	2
Least Squares Statistics	2
Markov Processes	2
Nonparametric Statistics	2
Statistical Bias	2
Test Items	2
Bias	1
Classification	1
Cutting Scores	1
More ▼

Source

Applied Psychological…

Publication Type

Journal Articles	14
Reports - Research	8
Reports - Evaluative	5
Reports - Descriptive	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Small-Sample DIF Estimation Using SIBTEST, Cochran's Z, and Log-Linear Smoothing

Peer reviewed

Direct link

Lei, Pui-Wa; Li, Hongli – Applied Psychological Measurement, 2013

Minimum sample sizes of about 200 to 250 per group are often recommended for differential item functioning (DIF) analyses. However, there are times when sample sizes for one or both groups of interest are smaller than 200 due to practical constraints. This study attempts to examine the performance of Simultaneous Item Bias Test (SIBTEST),…

Descriptors: Sample Size, Test Bias, Computation, Accuracy

Two Approaches to Estimation of Classification Accuracy Rate under Item Response Theory

Peer reviewed

Direct link

Lathrop, Quinn N.; Cheng, Ying – Applied Psychological Measurement, 2013

Within the framework of item response theory (IRT), there are two recent lines of work on the estimation of classification accuracy (CA) rate. One approach estimates CA when decisions are made based on total sum scores, the other based on latent trait estimates. The former is referred to as the Lee approach, and the latter, the Rudner approach,…

Descriptors: Item Response Theory, Accuracy, Classification, Computation

Effects of Vertical Scaling Methods on Linear Growth Estimation

Peer reviewed

Direct link

Lei, Pui-Wa; Zhao, Yu – Applied Psychological Measurement, 2012

Vertical scaling is necessary to facilitate comparison of scores from test forms of different difficulty levels. It is widely used to enable the tracking of student growth in academic performance over time. Most previous studies on vertical scaling methods assume relatively long tests and large samples. Little is known about their performance when…

Descriptors: Scaling, Item Response Theory, Test Length, Sample Size

Accuracy of DIF Estimates and Power in Unbalanced Designs Using the Mantel-Haenszel DIF Detection Procedure

Peer reviewed

Direct link

Paek, Insu; Guo, Hongwen – Applied Psychological Measurement, 2011

This study examined how much improvement was attainable with respect to accuracy of differential item functioning (DIF) measures and DIF detection rates in the Mantel-Haenszel procedure when employing focal and reference groups with notably unbalanced sample sizes where the focal group has a fixed small sample which does not satisfy the minimum…

Descriptors: Test Bias, Accuracy, Reference Groups, Investigations

A Comparison of Bias Correction Adjustments for the DETECT Procedure

Peer reviewed

Direct link

Nandakumar, Ratna; Yu, Feng; Zhang, Yanwei – Applied Psychological Measurement, 2011

DETECT is a nonparametric methodology to identify the dimensional structure underlying test data. The associated DETECT index, "D[subscript max]," denotes the degree of multidimensionality in data. Conditional covariances (CCOV) are the building blocks of this index. In specifying population CCOVs, the latent test composite [theta][subscript TT]…

Descriptors: Nonparametric Statistics, Statistical Analysis, Tests, Data

Marginal Maximum A Posteriori Item Parameter Estimation for the Generalized Graded Unfolding Model

Peer reviewed

Direct link

Roberts, James S.; Thompson, Vanessa M. – Applied Psychological Measurement, 2011

A marginal maximum a posteriori (MMAP) procedure was implemented to estimate item parameters in the generalized graded unfolding model (GGUM). Estimates from the MMAP method were compared with those derived from marginal maximum likelihood (MML) and Markov chain Monte Carlo (MCMC) procedures in a recovery simulation that varied sample size,…

Descriptors: Statistical Analysis, Markov Processes, Computation, Monte Carlo Methods

An Extension of Least Squares Estimation of IRT Linking Coefficients for the Graded Response Model

Peer reviewed

Direct link

Kim, Seonghoon – Applied Psychological Measurement, 2010

The three types (generalized, unweighted, and weighted) of least squares methods, proposed by Ogasawara, for estimating item response theory (IRT) linking coefficients under dichotomous models are extended to the graded response model. A simulation study was conducted to confirm the accuracy of the extended formulas, and a real data study was…

Descriptors: Least Squares Statistics, Computation, Item Response Theory, Models

SIMREL: Software for Coefficient Alpha and Its Confidence Intervals with Monte Carlo Studies

Peer reviewed

Direct link

Yurdugul, Halil – Applied Psychological Measurement, 2009

This article describes SIMREL, a software program designed for the simulation of alpha coefficients and the estimation of its confidence intervals. SIMREL runs on two alternatives. In the first one, if SIMREL is run for a single data file, it performs descriptive statistics, principal components analysis, and variance analysis of the item scores…

Descriptors: Intervals, Monte Carlo Methods, Computer Software, Factor Analysis

Item Parameter Estimation for the MIRT Model: Bias and Precision of Confirmatory Factor Analysis-Based Models

Peer reviewed

Direct link

Finch, Holmes – Applied Psychological Measurement, 2010

The accuracy of item parameter estimates in the multidimensional item response theory (MIRT) model context is one that has not been researched in great detail. This study examines the ability of two confirmatory factor analysis models specifically for dichotomous data to properly estimate item parameters using common formulae for converting factor…

Descriptors: Item Response Theory, Computation, Factor Analysis, Models

Ramsay-Curve Item Response Theory for the Three-Parameter Logistic Item Response Model

Peer reviewed

Direct link

Woods, Carol M. – Applied Psychological Measurement, 2008

In Ramsay-curve item response theory (RC-IRT), the latent variable distribution is estimated simultaneously with the item parameters of a unidimensional item response model using marginal maximum likelihood estimation. This study evaluates RC-IRT for the three-parameter logistic (3PL) model with comparisons to the normal model and to the empirical…

Descriptors: Test Length, Computation, Item Response Theory, Maximum Likelihood Statistics

A Comparison of Methods for Nonparametric Estimation of Item Characteristic Curves for Binary Items

Peer reviewed

Direct link

Lee, Young-Sun – Applied Psychological Measurement, 2007

This study compares the performance of three nonparametric item characteristic curve (ICC) estimation procedures: isotonic regression, smoothed isotonic regression, and kernel smoothing. Smoothed isotonic regression, employed along with an appropriate kernel function, provides better estimates and also satisfies the assumption of strict…

Descriptors: Nonparametric Statistics, Computation, Item Response Theory, Evaluation Methods

A Monte Carlo Approach to Unidimensionality Testing in Polytomous Rasch Models

Peer reviewed

Direct link

Christensen, Karl Bang; Kreiner, Svend – Applied Psychological Measurement, 2007

Many statistical tests are designed to test the different assumptions of the Rasch model, but only few are directed at detecting multidimensionality. The Martin-Lof test is an attractive approach, the disadvantage being that its null distribution deviates strongly from the asymptotic chi-square distribution for most realistic sample sizes. A Monte…

Descriptors: Item Response Theory, Monte Carlo Methods, Testing, Models

Standard Errors of the Kernel Equating Methods under the Common-Item Design.

Peer reviewed

Liou, Michelle; Cheng, Philip E.; Johnson, Eugene G. – Applied Psychological Measurement, 1997

Derived simplified equations to compute the standard error of the frequency estimation method for equating score distributions that are continuized using a uniform or Gaussian kernel function. Results from two empirical studies indicate that these equations work reasonably well for moderate size samples. (SLD)

Descriptors: Computation, Equated Scores, Error of Measurement, Estimation (Mathematics)

Markov Chain Monte Carlo Estimation of Item Parameters for the Generalized Graded Unfolding Model

Peer reviewed

Direct link

de la Torre, Jimmy; Stark, Stephen; Chernyshenko, Oleksandr S. – Applied Psychological Measurement, 2006

The authors present a Markov Chain Monte Carlo (MCMC) parameter estimation procedure for the generalized graded unfolding model (GGUM) and compare it to the marginal maximum likelihood (MML) approach implemented in the GGUM2000 computer program, using simulated and real personality data. In the simulation study, test length, number of response…

Descriptors: Computation, Monte Carlo Methods, Markov Processes, Item Response Theory

Lei, Pui-Wa	2
Cheng, Philip E.	1
Cheng, Ying	1
Chernyshenko, Oleksandr S.	1
Christensen, Karl Bang	1
Finch, Holmes	1
Guo, Hongwen	1
Johnson, Eugene G.	1
Kim, Seonghoon	1
Kreiner, Svend	1
Lathrop, Quinn N.	1
Lee, Young-Sun	1
Li, Hongli	1
Liou, Michelle	1
Nandakumar, Ratna	1
Paek, Insu	1
Roberts, James S.	1
Stark, Stephen	1
Thompson, Vanessa M.	1
Woods, Carol M.	1
Yu, Feng	1
Yurdugul, Halil	1
Zhang, Yanwei	1
Zhao, Yu	1
de la Torre, Jimmy	1
More ▼