Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 4 |
| Since 2007 (last 20 years) | 26 |
Descriptor
| Computation | 31 |
| Reliability | 31 |
| Error of Measurement | 11 |
| Simulation | 11 |
| Correlation | 6 |
| Monte Carlo Methods | 6 |
| Statistical Analysis | 6 |
| Item Response Theory | 5 |
| Measures (Individuals) | 5 |
| Scores | 5 |
| Comparative Analysis | 4 |
| More ▼ | |
Source
Author
| Green, Samuel B. | 2 |
| Raykov, Tenko | 2 |
| Sinharay, Sandip | 2 |
| Yang, Yanyun | 2 |
| Bailey, Paul | 1 |
| Beauducel, Andre | 1 |
| Becker, Gilbert | 1 |
| Bonett, Douglas G. | 1 |
| Chatman, Steve | 1 |
| Christ, Theodore J. | 1 |
| Deatline-Buchman, Andria | 1 |
| More ▼ | |
Publication Type
| Reports - Evaluative | 31 |
| Journal Articles | 27 |
Education Level
| Elementary Education | 2 |
| Higher Education | 2 |
| Postsecondary Education | 2 |
| Elementary Secondary Education | 1 |
| Grade 3 | 1 |
| Grade 4 | 1 |
Audience
| Researchers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| Eysenck Personality Inventory | 1 |
| Kaufman Test of Educational… | 1 |
| Marlowe Crowne Social… | 1 |
| Rosenberg Self Esteem Scale | 1 |
| Stanford Binet Intelligence… | 1 |
| Wide Range Achievement Test | 1 |
What Works Clearinghouse Rating
The Reliability of the Posterior Probability of Skill Attainment in Diagnostic Classification Models
Johnson, Matthew S.; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2020
One common score reported from diagnostic classification assessments is the vector of posterior means of the skill mastery indicators. As with any assessment, it is important to derive and report estimates of the reliability of the reported scores. After reviewing a reliability measure suggested by Templin and Bradshaw, this article suggests three…
Descriptors: Reliability, Probability, Skill Development, Classification
Song, Yue; Sun, Feng; Redline, Susan; Wang, Rui – Research Synthesis Methods, 2020
Meta-analyses of clinical trials typically focus on one outcome at a time. However, treatment decision-making depends on an overall assessment of outcomes balancing benefit in various domains and potential risks. This calls for meta-analysis methods for combined outcomes that encompass information from different domains. When individual patient…
Descriptors: Meta Analysis, Patients, Data, Outcomes of Treatment
Bailey, Paul; Emad, Ahmad; Zhang, Ting; Xie, Qingshu; Sikali, Emmanuel – American Institutes for Research, 2018
Correlation analysis has been used widely by researchers and analysts when analyzing large-scale assessment data. Limit research provided reliable methods to estimate various correlations and their standard errors with the complex sampling design and multiple plausible values taken into account. This report introduces the methodology used by the…
Descriptors: Correlation, Educational Assessment, Measurement, Statistical Bias
Marcoulides, Katerina M. – Measurement: Interdisciplinary Research and Perspectives, 2019
Longitudinal data analysis has received widespread interest throughout educational, behavioral, and social science research, with latent growth curve modeling currently being one of the most popular methods of analysis. Despite the popularity of latent growth curve modeling, limited attention has been directed toward understanding the issues of…
Descriptors: Reliability, Longitudinal Studies, Growth Models, Structural Equation Models
Green, Samuel B.; Yang, Yanyun – Educational Measurement: Issues and Practice, 2015
In the lead article, Davenport, Davison, Liou, & Love demonstrate the relationship among homogeneity, internal consistency, and coefficient alpha, and also distinguish among them. These distinctions are important because too often coefficient alpha--a reliability coefficient--is interpreted as an index of homogeneity or internal consistency.…
Descriptors: Reliability, Factor Analysis, Computation, Factor Structure
Ernest, Paul – Educational Studies in Mathematics, 2016
Two questions about certainty in mathematics are asked. First, is mathematical knowledge known with certainty? Second, why is the belief in the certainty of mathematical knowledge so widespread and where does it come from? This question is little addressed in the literature. In explaining the reasons for these beliefs, both cultural-historical and…
Descriptors: Mathematics Education, Mathematical Concepts, Mathematical Logic, Epistemology
Shu, Lianghua; Schwarz, Richard D. – Journal of Educational Measurement, 2014
As a global measure of precision, item response theory (IRT) estimated reliability is derived for four coefficients (Cronbach's a, Feldt-Raju, stratified a, and marginal reliability). Models with different underlying assumptions concerning test-part similarity are discussed. A detailed computational example is presented for the targeted…
Descriptors: Item Response Theory, Reliability, Models, Computation
Tesio, Luigi – International Journal of Rehabilitation Research, 2012
Outcome studies in biomedical research usually focus on testing mean changes across samples of subjects and, in so doing, often obscure changes in individuals. These changes, however, may be very informative in studies in which large or homogeneous samples are unavailable and mechanisms of action are still under scrutiny, as is often the case for…
Descriptors: Biomedicine, Correlation, Computation, Behavioral Sciences
Lee, Guemin; Park, In-Yong – Asia Pacific Education Review, 2012
Previous assessments of the reliability of test scores for testlet-composed tests have indicated that item-based estimation methods overestimate reliability. This study was designed to address issues related to the extent to which item-based estimation methods overestimate the reliability of test scores composed of testlets and to compare several…
Descriptors: Generalizability Theory, Simulation, Computation, Item Response Theory
Moses, Tim – Journal of Educational Measurement, 2012
The focus of this paper is assessing the impact of measurement errors on the prediction error of an observed-score regression. Measures are presented and described for decomposing the linear regression's prediction error variance into parts attributable to the true score variance and the error variances of the dependent variable and the predictor…
Descriptors: Error of Measurement, Prediction, Regression (Statistics), True Scores
Beauducel, Andre – Applied Psychological Measurement, 2013
The problem of factor score indeterminacy implies that the factor and the error scores cannot be completely disentangled in the factor model. It is therefore proposed to compute Harman's factor score predictor that contains an additive combination of factor and error variance. This additive combination is discussed in the framework of classical…
Descriptors: Factor Analysis, Predictor Variables, Reliability, Error of Measurement
Bonett, Douglas G. – Psychological Methods, 2010
The conventional fixed-effects (FE) and random-effects (RE) confidence intervals that are used to assess the average alpha reliability across multiple studies have serious limitations. The FE method, which is based on a constant coefficient model, assumes equal reliability coefficients across studies and breaks down under minor violations of this…
Descriptors: Meta Analysis, Reliability, Computation, Statistical Analysis
Romano, Jeanine L.; Kromrey, Jeffrey D.; Owens, Corina M.; Scott, Heather M. – Journal of Experimental Education, 2011
In this study, the authors aimed to examine 8 of the different methods for computing confidence intervals around alpha that have been proposed to determine which of these, if any, is the most accurate and precise. Monte Carlo methods were used to simulate samples under known and controlled population conditions wherein the underlying item…
Descriptors: Intervals, Monte Carlo Methods, Rating Scales, Computation
Thompson, Barry L.; Green, Samuel B.; Yang, Yanyun – Educational and Psychological Measurement, 2010
The maximal split-half coefficient is computed by calculating all possible split-half reliability estimates for a scale and then choosing the maximal value as the reliability estimate. Osburn compared the maximal split-half coefficient with 10 other internal consistency estimates of reliability and concluded that it yielded the most consistently…
Descriptors: Reliability, Computation, Simulation, Statistical Analysis
Harring, Jeffrey R.; Weiss, Brandi A.; Hsu, Jui-Chen – Psychological Methods, 2012
Two Monte Carlo simulations were performed to compare methods for estimating and testing hypotheses of quadratic effects in latent variable regression models. The methods considered in the current study were (a) a 2-stage moderated regression approach using latent variable scores, (b) an unconstrained product indicator approach, (c) a latent…
Descriptors: Structural Equation Models, Geometric Concepts, Computation, Comparative Analysis

Peer reviewed
Direct link
