ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	24

Source

Educational and Psychological…

Publication Type

Journal Articles	32
Reports - Evaluative	32
Speeches/Meeting Papers	1

Education Level

Audience

Location

Indiana

Laws, Policies, & Programs

Assessments and Surveys

Law School Admission Test	1
Program for International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 32 results Save | Export

On the Pitfalls of Estimating and Using Standardized Reliability Coefficients

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2021

The population discrepancy between unstandardized and standardized reliability of homogeneous multicomponent measuring instruments is examined. Within a latent variable modeling framework, it is shown that the standardized reliability coefficient for unidimensional scales can be markedly higher than the corresponding unstandardized reliability…

Descriptors: Test Reliability, Computation, Measures (Individuals), Research Problems

Generalized Linear Factor Score Regression: A Comparison of Four Methods

Peer reviewed

Direct link

Andersson, Gustaf; Yang-Wallentin, Fan – Educational and Psychological Measurement, 2021

Factor score regression has recently received growing interest as an alternative for structural equation modeling. However, many applications are left without guidance because of the focus on normally distributed outcomes in the literature. We perform a simulation study to examine how a selection of factor scoring methods compare when estimating…

Descriptors: Regression (Statistics), Statistical Analysis, Computation, Scoring

Multiple-Component Measurement Instruments in Heterogeneous Populations: Is There a Single Coefficient Alpha?

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A.; Harrison, Michael; Menold, Natalja – Educational and Psychological Measurement, 2019

This note confronts the common use of a single coefficient alpha as an index informing about reliability of a multicomponent measurement instrument in a heterogeneous population. Two or more alpha coefficients could instead be meaningfully associated with a given instrument in finite mixture settings, and this may be increasingly more likely the…

Descriptors: Statistical Analysis, Test Reliability, Measures (Individuals), Computation

Hypothesis Testing, "p" Values, Confidence Intervals, Measures of Effect Size, and Bayesian Methods in Light of Modern Robust Techniques

Peer reviewed

Direct link

Wilcox, Rand R.; Serang, Sarfaraz – Educational and Psychological Measurement, 2017

The article provides perspectives on p values, null hypothesis testing, and alternative techniques in light of modern robust statistical methods. Null hypothesis testing and "p" values can provide useful information provided they are interpreted in a sound manner, which includes taking into account insights and advances that have…

Descriptors: Hypothesis Testing, Bayesian Statistics, Computation, Effect Size

Thou Shalt Not Bear False Witness against Null Hypothesis Significance Testing

Peer reviewed

Direct link

García-Pérez, Miguel A. – Educational and Psychological Measurement, 2017

Null hypothesis significance testing (NHST) has been the subject of debate for decades and alternative approaches to data analysis have been proposed. This article addresses this debate from the perspective of scientific inquiry and inference. Inference is an inverse problem and application of statistical methods cannot reveal whether effects…

Descriptors: Hypothesis Testing, Statistical Inference, Effect Size, Bayesian Statistics

Power Analysis for Models of Change in Cluster Randomized Designs

Peer reviewed

Direct link

Li, Wei; Konstantopoulos, Spyros – Educational and Psychological Measurement, 2017

Field experiments in education frequently assign entire groups such as schools to treatment or control conditions. These experiments incorporate sometimes a longitudinal component where for example students are followed over time to assess differences in the average rate of linear change, or rate of acceleration. In this study, we provide methods…

Descriptors: Educational Experiments, Field Studies, Models, Randomized Controlled Trials

Interrater Agreement Evaluation: A Latent Variable Modeling Approach

Peer reviewed

Direct link

Raykov, Tenko; Dimitrov, Dimiter M.; von Eye, Alexander; Marcoulides, George A. – Educational and Psychological Measurement, 2013

A latent variable modeling method for evaluation of interrater agreement is outlined. The procedure is useful for point and interval estimation of the degree of agreement among a given set of judges evaluating a group of targets. In addition, the approach allows one to test for identity in underlying thresholds across raters as well as to identify…

Descriptors: Interrater Reliability, Models, Statistical Analysis, Computation

Comparison between Dichotomous and Polytomous Scoring of Innovative Items in a Large-Scale Computerized Adaptive Test

Peer reviewed

Direct link

Jiao, Hong; Liu, Junhui; Haynie, Kathleen; Woo, Ada; Gorham, Jerry – Educational and Psychological Measurement, 2012

This study explored the impact of partial credit scoring of one type of innovative items (multiple-response items) in a computerized adaptive version of a large-scale licensure pretest and operational test settings. The impacts of partial credit scoring on the estimation of the ability parameters and classification decisions in operational test…

Descriptors: Test Items, Computer Assisted Testing, Measures (Individuals), Scoring

Numerical Differentiation Methods for Computing Error Covariance Matrices in Item Response Theory Modeling: An Evaluation and a New Proposal

Peer reviewed

Direct link

Tian, Wei; Cai, Li; Thissen, David; Xin, Tao – Educational and Psychological Measurement, 2013

In item response theory (IRT) modeling, the item parameter error covariance matrix plays a critical role in statistical inference procedures. When item parameters are estimated using the EM algorithm, the parameter error covariance matrix is not an automatic by-product of item calibration. Cai proposed the use of Supplemented EM algorithm for…

Descriptors: Item Response Theory, Computation, Matrices, Statistical Inference

Confidence Intervals for Squared Semipartial Correlation Coefficients: The Effect of Nonnormality

Peer reviewed

Direct link

Algina, James; Keselman, H. J.; Penfield, Randall D. – Educational and Psychological Measurement, 2010

The increase in the squared multiple correlation coefficient ([delta]R[superscript 2]) associated with a variable in a regression equation is a commonly used measure of importance in regression analysis. Algina, Keselman, and Penfield found that intervals based on asymptotic principles were typically very inaccurate, even though the sample size…

Descriptors: Computation, Statistical Analysis, Correlation, Statistical Inference

Assessment of the Maximal Split-Half Coefficient to Estimate Reliability

Peer reviewed

Direct link

Thompson, Barry L.; Green, Samuel B.; Yang, Yanyun – Educational and Psychological Measurement, 2010

The maximal split-half coefficient is computed by calculating all possible split-half reliability estimates for a scale and then choosing the maximal value as the reliability estimate. Osburn compared the maximal split-half coefficient with 10 other internal consistency estimates of reliability and concluded that it yielded the most consistently…

Descriptors: Reliability, Computation, Simulation, Statistical Analysis

Computerized Classification Testing under the Generalized Graded Unfolding Model

Peer reviewed

Direct link

Wang, Wen-Chung; Liu, Chen-Wei – Educational and Psychological Measurement, 2011

The generalized graded unfolding model (GGUM) has been recently developed to describe item responses to Likert items (agree-disagree) in attitude measurement. In this study, the authors (a) developed two item selection methods in computerized classification testing under the GGUM, the current estimate/ability confidence interval method and the cut…

Descriptors: Computer Assisted Testing, Adaptive Testing, Classification, Item Response Theory

Expected Equating Error Resulting from Incorrect Handling of Item Parameter Drift among the Common Items

Peer reviewed

Direct link

Miller, G. Edward; Fitzpatrick, Steven J. – Educational and Psychological Measurement, 2009

Incorrect handling of item parameter drift during the equating process can result in equating error. If the item parameter drift is due to construct-irrelevant factors, then inclusion of these items in the estimation of the equating constants can be expected to result in equating error. On the other hand, if the item parameter drift is related to…

Descriptors: Equated Scores, Computation, Item Response Theory, Test Items

The Effects of Small Sample Size on Identifying Polytomous DIF Using the Liu-Agresti Estimator of the Cumulative Common Odds Ratio

Peer reviewed

Direct link

Carvajal, Jorge; Skorupski, William P. – Educational and Psychological Measurement, 2010

This study is an evaluation of the behavior of the Liu-Agresti estimator of the cumulative common odds ratio when identifying differential item functioning (DIF) with polytomously scored test items using small samples. The Liu-Agresti estimator has been proposed by Penfield and Algina as a promising approach for the study of polytomous DIF but no…

Descriptors: Test Bias, Sample Size, Test Items, Computation

Assessing Fit and Dimensionality in Least Squares Metric Multidimensional Scaling Using Akaike's Information Criterion

Peer reviewed

Direct link

Ding, Cody S.; Davison, Mark L. – Educational and Psychological Measurement, 2010

Akaike's information criterion is suggested as a tool for evaluating fit and dimensionality in metric multidimensional scaling that uses least squares methods of estimation. This criterion combines the least squares loss function with the number of estimated parameters. Numerical examples are presented. The results from analyses of both simulation…

Descriptors: Multidimensional Scaling, Least Squares Statistics, Criteria, Computation

Previous Page | Next Page »

Pages: 1 | 2 | 3

Computation	32
Simulation	12
Statistical Analysis	12
Item Response Theory	8
Sample Size	8
Test Items	8
Correlation	7
Models	7
Monte Carlo Methods	7
Error of Measurement	6
Evaluation Methods	6
Data Analysis	5
Effect Size	5
Classification	4
Comparative Analysis	4
Factor Analysis	4
Sampling	4
Statistical Inference	4
Computer Assisted Testing	3
Equations (Mathematics)	3
Evaluation Research	3
Goodness of Fit	3
Hypothesis Testing	3
Measurement Techniques	3
Measures (Individuals)	3
More ▼

Marcoulides, George A.	3
Raykov, Tenko	3
Wang, Wen-Chung	3
Penfield, Randall D.	2
Algina, James	1
Andersson, Gustaf	1
Bandalos, Deborah	1
Bentler, Peter M.	1
Bjornstad, Jan F.	1
Brennan, Robert L.	1
Cai, Li	1
Carvajal, Jorge	1
Chen, Cheng-Te	1
Cho, Sun-Joo	1
Cota, Albert A.	1
Davison, Mark L.	1
Dimitrov, Dimiter M.	1
Ding, Cody S.	1
Feldt, Leonard S.	1
Finstuen, Kenn	1
Fitzpatrick, Steven J.	1
García-Pérez, Miguel A.	1
Gorham, Jerry	1
Green, Samuel B.	1
More ▼