NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20260
Since 20250
Since 2022 (last 5 years)0
Since 2017 (last 10 years)1
Since 2007 (last 20 years)5
Education Level
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Showing 1 to 15 of 20 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Slavin, Robert E.; Cheung, Alan C. K. – Journal of Education for Students Placed at Risk, 2017
Large-scale randomized studies provide the best means of evaluating practical, replicable approaches to improving educational outcomes. This article discusses the advantages, problems, and pitfalls of these evaluations, focusing on alternative methods of randomization, recruitment, ensuring high-quality implementation, dealing with attrition, and…
Descriptors: Randomized Controlled Trials, Evaluation Methods, Recruitment, Attrition (Research Studies)
Peer reviewed Peer reviewed
Direct linkDirect link
Argyrous, George – Evidence & Policy: A Journal of Research, Debate and Practice, 2015
This paper illustrates the use of a quality assessment tool for regression analysis. It is designed for non-specialist "consumers" of evidence, such as policy makers. The tool provides a series of questions such consumers of evidence can ask to interrogate regression analysis, and is illustrated with reference to a recent study published…
Descriptors: Evaluation Methods, Regression (Statistics), Evidence, Critical Thinking
Peer reviewed Peer reviewed
Direct linkDirect link
Raykov, Tenko; Marcoulides, George A.; Millsap, Roger E. – Educational and Psychological Measurement, 2013
A multiple testing method for examining factorial invariance for latent constructs evaluated by multiple indicators in distinct populations is outlined. The procedure is based on the false discovery rate concept and multiple individual restriction tests and resolves general limitations of a popular factorial invariance testing approach. The…
Descriptors: Testing, Statistical Analysis, Factor Analysis, Statistical Significance
Peer reviewed Peer reviewed
Direct linkDirect link
McCoach, D. Betsy; Adelson, Jill L. – Gifted Child Quarterly, 2010
This article provides a conceptual introduction to the issues surrounding the analysis of clustered (nested) data. We define the intraclass correlation coefficient (ICC) and the design effect, and we explain their effect on the standard error. When the ICC is greater than 0, then the design effect is greater than 1. In such a scenario, the…
Descriptors: Statistical Significance, Error of Measurement, Correlation, Data Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Ryden, Jesper – International Journal of Mathematical Education in Science and Technology, 2008
Extreme-value statistics is often used to estimate so-called return values (actually related to quantiles) for environmental quantities like wind speed or wave height. A basic method for estimation is the method of block maxima which consists in partitioning observations in blocks, where maxima from each block could be considered independent.…
Descriptors: Simulation, Probability, Computation, Nonparametric Statistics
Peer reviewed Peer reviewed
Levin, Joel R.; Robinson, Daniel H. – Educational Researcher, 2000
Supports a two-step approach to the estimation and discussion of effect sizes, making a distinction between single-study decision-oriented research and multiple-study synthesis. Introduces and illustrates the concept of "conclusion coherence." (Author/SLD)
Descriptors: Effect Size, Evaluation Methods, Research Methodology, Sample Size
Ciechalski, Joseph C.; Pinkney, James W.; Weaver, Florence S. – 2002
This paper illustrates the use of the McNemar Test, using a hypothetical problem. The McNemar Test is a nonparametric statistical test that is a type of chi square test using dependent, rather than independent, samples to assess before-after designs in which each subject is used as his or her own control. Results of the McNemar test make it…
Descriptors: Attitude Change, Chi Square, Evaluation Methods, Nonparametric Statistics
Peer reviewed Peer reviewed
Posavac, E. J. – Evaluation and Program Planning, 1998
Misuses of null hypothesis significance testing are reviewed and alternative approaches are suggested for carrying out and reporting statistical tests that might be useful to program evaluators. Several themes, including the importance of respecting the magnitude of Type II errors and describing effect sizes in units stakeholders can understand,…
Descriptors: Effect Size, Evaluation Methods, Hypothesis Testing, Program Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Mallinckrodt, Brent; Abraham, W. Todd; Wei, Meifen; Russell, Daniel W. – Journal of Counseling Psychology, 2006
P. A. Frazier, A. P. Tix, and K. E. Barron (2004) highlighted a normal theory method popularized by R. M. Baron and D. A. Kenny (1986) for testing the statistical significance of indirect effects (i.e., mediator variables) in multiple regression contexts. However, simulation studies suggest that this method lacks statistical power relative to some…
Descriptors: Statistical Significance, Multiple Regression Analysis, Simulation, Evaluation Methods
Hanes, John C.; Hail, Michael – 1999
Many program evaluations involve some type of statistical testing to verify that the program has succeeded in accomplishing initially established goals. In many cases, this takes the form of null hypothesis significance testing (NHST) with t-tests, analysis of variance, or some form of the general linear model. This paper contends that, at least…
Descriptors: Change, Educational Indicators, Evaluation Methods, Hypothesis Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Marsh, Herbert W.; Hau, Kit-Tai; Wen, Zhonglin – Structural Equation Modeling, 2004
Goodness-of-fit (GOF) indexes provide "rules of thumb"?recommended cutoff values for assessing fit in structural equation modeling. Hu and Bentler (1999) proposed a more rigorous approach to evaluating decision rules based on GOF indexes and, on this basis, proposed new and more stringent cutoff values for many indexes. This article discusses…
Descriptors: Statistical Significance, Structural Equation Models, Evaluation Methods, Evaluation Research
Peer reviewed Peer reviewed
Direct linkDirect link
Atkins, David C.; Bedics, Jamie D.; Mcglinchey, Joseph B.; Beauchaine, Theodore P. – Journal of Consulting and Clinical Psychology, 2005
Measures of clinical significance are frequently used to evaluate client change during therapy. Several alternatives to the original method devised by N. S. Jacobson, W. C. Follette, & D. Revenstorf (1984) have been proposed, each purporting to increase accuracy. However, researchers have had little systematic guidance in choosing among…
Descriptors: Psychotherapy, Statistical Significance, Outcomes of Treatment, Behavior Change
Peer reviewed Peer reviewed
Suen, Hoi K.; Stevens, Robert J. – American Journal of Distance Education, 1993
Reports on a review of empirical research reports submitted over the past several years to the "American Journal of Distance Education" that identified common analytic problems and errors often overlooked by distance education researchers. Topics discussed include significance testing; assessment issues, including reliability and…
Descriptors: Distance Education, Educational Research, Evaluation Methods, Evaluation Problems
Malone, Linda C.; And Others – Microcomputers for Information Management, 1990
Discussion of automated indexing techniques focuses on ways to statistically document improvements in the development of an automated keywording system over time. The system developed by the Joint Chiefs of Staff to automate the storage, categorization, and retrieval of information from military exercises is explained, and performance measures are…
Descriptors: Artificial Intelligence, Automatic Indexing, Evaluation Methods, Graphs
Peer reviewed Peer reviewed
Cohen, Patricia – Evaluation and Program Planning: An International Journal, 1982
The various costs of Type I and Type II errors of inference from data are discussed. Six methods for minimizing each error type are presented, which may be employed even after data collection for Type I and which minimizes Type II errors by a study design and analytical means combination. (Author/CM)
Descriptors: Analysis of Variance, Data Analysis, Data Collection, Error of Measurement
Previous Page | Next Page ยป
Pages: 1  |  2