NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Does not meet standards1
Showing 2,671 to 2,685 of 3,311 results Save | Export
Peer reviewed Peer reviewed
Cornwell, John M. – Educational and Psychological Measurement, 1993
A comparison is made of the power and actual alpha levels of three tests of homogeneity for independent product-moment correlation coefficients using Monte Carlo methods while selectively studying sample size and varying the number of correlation reliabilities. How robust these are in applied work is discussed. (SLD)
Descriptors: Comparative Analysis, Correlation, Error of Measurement, Monte Carlo Methods
Peer reviewed Peer reviewed
Blickensderfer, Roger – Physics Teacher, 1996
Presents an exercise that gives students a real feeling for experimental errors and demonstrates the importance of frequent repetition and averaging. (JRH)
Descriptors: Error of Measurement, Higher Education, Measurement, Physics
Peer reviewed Peer reviewed
Thompson, Paul A. – Multivariate Behavioral Research, 1991
Application of the bootstrap method to complex psychological analysis is illustrated using a simulation experiment with two populations with small and large samples. The method provides variance estimates, allows testing of nested competing models, and gives a preliminary idea about parameter variability. (SLD)
Descriptors: Computer Simulation, Equations (Mathematics), Error of Measurement, Estimation (Mathematics)
Peer reviewed Peer reviewed
Stanley, T. D. – Evaluation Review, 1991
W. M. K. Trochim and others defend the record of the regression-discontinuity (RD) design and blur the statistical tests for treatment effect. Their Monte Carlo results show the problematic nature of RD and its potential bias. New testing strategies and restrictions for the application of RD are proposed. (SLD)
Descriptors: Computer Simulation, Equations (Mathematics), Error of Measurement, Estimation (Mathematics)
Peer reviewed Peer reviewed
Goldstein, Harvey – Journal of Educational Statistics, 1991
Multilevel models used to assess and compare school effectiveness, particularly their application to multivariate data, are critiqued. Problems associated with standard errors of measurement and the various components of "school effect" are highlighted in terms of how they could affect incautious application of the models. (TJH)
Descriptors: Comparative Analysis, Educational Assessment, Effect Size, Elementary Secondary Education
Peer reviewed Peer reviewed
Goodwin, Laura D.; Goodwin, William L. – Journal of Early Intervention, 1991
Four approaches to estimating interrater reliability in early childhood special education research are illustrated and compared: correlation, comparison of means, percentage of agreement, and generalizability theory techniques. Generalizability theory techniques are proposed as a method for estimating the amount of variance attributable to…
Descriptors: Analysis of Variance, Disabilities, Early Childhood Education, Educational Research
Peer reviewed Peer reviewed
Molenaar, Peter C. M.; Nesselroade, John R. – Multivariate Behavioral Research, 1998
Pseudo-Maximum Likelihood (p-ML) and Asymptotically Distribution Free (ADF) estimation methods for estimating dynamic factor model parameters within a covariance structure framework were compared through a Monte Carlo simulation. Both methods appear to give consistent model parameter estimates, but only ADF gives standard errors and chi-square…
Descriptors: Chi Square, Comparative Analysis, Error of Measurement, Estimation (Mathematics)
Peer reviewed Peer reviewed
Sabatelli, Ronald M.; Waldron, Rebecca J. – Journal of Marriage and the Family, 1995
Presents a general discussion of the conceptual and operational limitations found within the existing efforts to assess constructs central to the study of parents. The discussion is followed by a review and critique of measures designed to assess concepts central to research on parents. Includes recommendations to improve the quality of measures.…
Descriptors: Content Validity, Error of Measurement, Higher Education, Measurement Techniques
Peer reviewed Peer reviewed
Direct linkDirect link
Monahan, Patrick O.; Ankenmann, Robert D. – Journal of Educational Measurement, 2005
Empirical studies demonstrated Type-I error (TIE) inflation (especially for highly discriminating easy items) of the Mantel-Haenszel chi-square test for differential item functioning (DIF), when data conformed to item response theory (IRT) models more complex than Rasch, and when IRT proficiency distributions differed only in means. However, no…
Descriptors: Sample Size, Item Response Theory, Test Items, Test Bias
Peer reviewed Peer reviewed
Direct linkDirect link
Kang, Sun-Mee; Waller, Niels G. – Applied Psychological Measurement, 2005
Two Monte Carlo studies were conducted to explore the Type I error rates in moderated multiple regression (MMR) of observed scores and estimated latent trait scores from a two-parameter logistic item response theory (IRT) model. The results of both studies showed that MMR Type I error rates were substantially higher than the nominal alpha levels…
Descriptors: Multiple Regression Analysis, Interaction, Monte Carlo Methods, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Leark, Robert A.; Wallace, Denise R.; Fitzgerald, Robert – Assessment, 2004
Test-retest reliability of the Test of Variables of Attention (T.O.V.A.) was investigated in two studies using two different time intervals: 90 min and 1 week (plus or minus 2 days). To investigate the 90-min reliability, 31 school-age children (M = 10 years, SD = 2.66) were administered the T.O.V.A. then read ministered the test 90 min afterward.…
Descriptors: Intervals, Reaction Time, Error of Measurement, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Hui, Stanley Sai-chuen; Cheung, Peggy Pui-yee – Measurement in Physical Education and Exercise Science, 2004
The purpose of this study was to evaluate the criterion-related validity of a 3-min step test using 3 stepping cadences. Chinese children (N = 50; age = 8.92 plus or minus 1.64 years) performed three 3-min bouts of a bench stepping exercise on a 12-in.-high (30.5 cm) bench. Stepping cadences for the 3 step tests were 22, 26, and 30 steps per…
Descriptors: Metabolism, Physical Fitness, Foreign Countries, Field Tests
Peer reviewed Peer reviewed
Luh, Wei-Ming; Guo, Jiin-Huarng – Journal of Experimental Education, 2005
To deal with nonnormal and heterogeneous data for the one-way fixed effect analysis of variance model, the authors adopted a trimmed means method in conjunction with Hall's invertible transformation into a heteroscedastic test statistic (Alexander-Govern test or Welch test). The results of simulation experiments showed that the proposed technique…
Descriptors: Robustness (Statistics), Computer Simulation, Educational Research, Error Patterns
Peer reviewed Peer reviewed
Direct linkDirect link
Wu, Margaret – Studies in Educational Evaluation, 2005
In large-scale assessment programs such as NAEP, TIMSS and PISA, students' achievement data sets provided for secondary analysts contain so-called "plausible values." Plausible values are multiple imputations of the unobservable latent achievement for each student. In this article it has been shown how plausible values are used to: (1)…
Descriptors: Error of Measurement, Computation, Educational Research, Educational Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
Battistin, Erich; Miniaci, Raffaele; Weber, Guglielmo – Journal of Human Resources, 2003
In this paper, we use two complementary Italian data sources (the 1995 ISTAT and Bank of Italy household surveys) to generate household-specific nondurable expenditure in the Bank of Italy sample that contains relatively high-quality income data. We show that food expenditure data are of comparable quality and informational content across the two…
Descriptors: Expenditures, Data, Prediction, Foreign Countries
Pages: 1  |  ...  |  175  |  176  |  177  |  178  |  179  |  180  |  181  |  182  |  183  |  ...  |  221