Publication Date
| In 2026 | 0 |
| Since 2025 | 53 |
| Since 2022 (last 5 years) | 411 |
| Since 2017 (last 10 years) | 914 |
| Since 2007 (last 20 years) | 1965 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 93 |
| Practitioners | 23 |
| Teachers | 22 |
| Policymakers | 10 |
| Administrators | 5 |
| Students | 4 |
| Counselors | 2 |
| Parents | 2 |
| Community | 1 |
Location
| United States | 47 |
| Germany | 42 |
| Australia | 34 |
| Canada | 27 |
| Turkey | 27 |
| California | 22 |
| United Kingdom (England) | 20 |
| Netherlands | 18 |
| China | 17 |
| New York | 15 |
| United Kingdom | 15 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Enders, Craig K.; Peugh, James L. – Structural Equation Modeling, 2004
Two methods, direct maximum likelihood (ML) and the expectation maximization (EM) algorithm, can be used to obtain ML parameter estimates for structural equation models with missing data (MD). Although the 2 methods frequently produce identical parameter estimates, it may be easier to satisfy missing at random assumptions using EM. However, no…
Descriptors: Inferences, Structural Equation Models, Factor Analysis, Error of Measurement
Lei, Pui-Wa; Dunbar, Stephen B. – Structural Equation Modeling, 2004
The primary purpose of this study was to examine relative performance of 2 power estimation methods in structural equation modeling. Sample size, alpha level, type of manifest variable, type of specification errors, and size of correlation between constructs were manipulated. Type 1 error rate of the model chi-square test, empirical critical…
Descriptors: Measures (Individuals), Structural Equation Models, Computation, Scores
Rindskopf, David; Strauss, Shiela – Structural Equation Modeling, 2004
We demonstrate a model for categorical data that parallels the MIMIC model for continuous data. The model is equivalent to a latent class model with observed covariates; further, it includes simple handling of missing data. The model is used on data from a large-scale study of HIV that had both biological measures of infection and self-report…
Descriptors: Sexually Transmitted Diseases, Communicable Diseases, Predictor Variables, Error of Measurement
Peer reviewedOnwuegbuzie, Anthony J.; Roberts, J. Kyle; Daniel, Larry G. – Measurement and Evaluation in Counseling and Development, 2005
In this article, the authors (a) illustrate how displaying disattenuated correlation coefficients alongside their unadjusted counterparts will allow researchers to assess the impact of unreliability on bivariate relationships and (b) demonstrate how a proposed new "what if reliability" analysis can complement null hypothesis significance…
Descriptors: Correlation, Statistical Significance, Reliability, Error of Measurement
Marcus, Gary F. – Journal of Cognition and Development, 2004
"Little by little, the child develops," wrote an undergraduate in a friend's cognitive development class, and so, for the most part, it is. But what explains the U's of cognitive development? Namy, Campbell, and Tomasello and Cashon and Cohen take a standard approach to understanding U-shaped curves: as the product of a mix of different cognitive…
Descriptors: Measurement Techniques, Error of Measurement, Cognitive Processes, Cognitive Development
Yuan, Ke-Hai; Maxwell, Scott – Journal of Educational and Behavioral Statistics, 2005
Retrospective or post hoc power analysis is recommended by reviewers and editors of many journals. Little literature has been found that gave a serious study of the post hoc power. When the sample size is large, the observed effect size is a good estimator of the true power. This article studies whether such a power estimator provides valuable…
Descriptors: Effect Size, Computation, Monte Carlo Methods, Bias
Charles, Eric P. – Psychological Methods, 2005
The correction for attenuation due to measurement error (CAME) has received many historical criticisms, most of which can be traced to the limited ability to use CAME inferentially. Past attempts to determine confidence intervals for CAME are summarized and their limitations discussed. The author suggests that inference requires confidence sets…
Descriptors: Error of Measurement, Error Correction, Intervals, Inferences
Adams, Raymond J. – Studies in Educational Evaluation, 2005
Test reliability is a concept central to classical test theory and it is commonly stated as a requirement that a test attain a certain level of reliability before it be considered of sufficient quality for practical use. This article discusses the role of reliability in item response theory, and in particular the role of reliability in contexts…
Descriptors: Test Reliability, Error of Measurement, Item Sampling, Item Response Theory
Li, Deping; Oranje, Andreas – ETS Research Report Series, 2007
Two versions of a general method for approximating standard error of regression effect estimates within an IRT-based latent regression model are compared. The general method is based on Binder's (1983) approach, accounting for complex samples and finite populations by Taylor series linearization. In contrast, the current National Assessment of…
Descriptors: Error of Measurement, Regression (Statistics), Trend Analysis, National Competency Tests
Rotou, Ourania; Patsula, Liane; Steffen, Manfred; Rizavi, Saba – ETS Research Report Series, 2007
Traditionally, the fixed-length linear paper-and-pencil (P&P) mode of administration has been the standard method of test delivery. With the advancement of technology, however, the popularity of administering tests using adaptive methods like computerized adaptive testing (CAT) and multistage testing (MST) has grown in the field of measurement…
Descriptors: Comparative Analysis, Test Format, Computer Assisted Testing, Models
Hartig, Johannes; Holzel, Britta; Moosbrugger, Helfried – Multivariate Behavioral Research, 2007
Numerous studies have shown increasing item reliabilities as an effect of the item position in personality scales. Traditionally, these context effects are analyzed based on item-total correlations. This approach neglects that trends in item reliabilities can be caused either by an increase in true score variance or by a decrease in error…
Descriptors: True Scores, Error of Measurement, Structural Equation Models, Simulation
Van Hulle, C. A.; Lemery-Chalfant, K.; Goldsmith, H. H. – Journal of Child Psychology and Psychiatry, 2007
Background: Relatively little is known about the genetic architecture of childhood behavioral disorders in very young children. Method: In this study, parents completed the Infant-Toddler Social and Emotional Assessment, a questionnaire that assesses symptoms of childhood disorders, as well as socio-emotional competencies, for 822 twin pairs…
Descriptors: Twins, Behavior Disorders, Toddlers, Infants
Reardon, Sean F. – Education and the Public Interest Center, 2009
"How New York City's Charter Schools Affect Achievement" estimates the effects on student achievement of attending a New York City charter school rather than a traditional public school and investigates the characteristics of charter schools associated with the most positive effects on achievement. Because the report relies on an…
Descriptors: Charter Schools, Academic Achievement, Achievement Gains, Achievement Rating
Haberman, Shelby J.; Sinharay, Sadip; Puhan, Gautam – ETS Research Report Series, 2006
Recently, there has been an increasing level of interest in reporting subscores. This paper examines the issue of reporting subscores at an aggregate level, especially at the level of institutions that the examinees belong to. A series of statistical analyses is suggested to determine when subscores at the institutional level have any added value…
Descriptors: Scores, Statistical Analysis, Error of Measurement, Reliability
Ryan, Robert S. – Teaching of Psychology, 2006
One of the most difficult concepts for statistics students is the standard error of the mean. To improve understanding of this concept, 1 group of students used a hands-on procedure to sample from small populations representing either a true or false null hypothesis. The distribution of 120 sample means (n = 3) from each population had standard…
Descriptors: Statistics, Error of Measurement, Experiential Learning, Hypothesis Testing

Direct link
