Publication Date
| In 2026 | 0 |
| Since 2025 | 45 |
| Since 2022 (last 5 years) | 266 |
| Since 2017 (last 10 years) | 794 |
| Since 2007 (last 20 years) | 2455 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 215 |
| Practitioners | 96 |
| Teachers | 62 |
| Policymakers | 37 |
| Administrators | 24 |
| Students | 17 |
| Media Staff | 6 |
| Counselors | 3 |
| Community | 1 |
| Parents | 1 |
| Support Staff | 1 |
| More ▼ | |
Location
| Turkey | 154 |
| Nigeria | 83 |
| Australia | 78 |
| United States | 70 |
| California | 49 |
| Canada | 48 |
| United Kingdom | 44 |
| Iran | 38 |
| Germany | 36 |
| Kenya | 36 |
| Florida | 34 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 3 |
Kennedy, Charlotte A. – 2002
The use of and emphasis on statistical significance testing has pervaded educational and behavioral research for many decades in spite of criticism by prominent researchers in this field. Much of the controversy is caused by lack of understanding or misinterpretations. This paper reviews criticisms of statistical significance testing and discusses…
Descriptors: Educational Research, Hypothesis Testing, Research Methodology, Sampling
Yang, Wen-Ling; Dorans, Neil J.; Tateneni, Krishna – 2002
Scores on the multiple-choice sections of alternate forms are equated through anchor-test equating for the Advanced Placement Program (AP) examinations. There is no linkage of free-response sections since different free-response items are given yearly. However, the free-response and multiple-choice sections are combined to produce a composite.…
Descriptors: Cutting Scores, Equated Scores, Multiple Choice Tests, Sample Size
Fan, Xitao; Chen, Michael – 1999
It is erroneous to extend or generalize the inter-rater reliability coefficient estimated from only a (small) proportion of the sample to the rest of the sample data where only one rater is used for scoring, although such generalization is often made implicitly in practice. It is shown that if inter-rater reliability estimate from part of a sample…
Descriptors: Estimation (Mathematics), Generalizability Theory, Interrater Reliability, Sample Size
Gruber, Kerry; Rohr, Carol L.; Perona, Mia; Fondelier, Sharon E. – 1999
This codebook contains the layout and descriptive information for survey and sampling variables for the public-use version of the 1993-94 Schools and Staffing Survey (SASS). Information in this volume allows the user to access the data file. This volume opens with a discussion of created variables and some user notes. Four appendixes contain the…
Descriptors: Coding, Elementary Secondary Education, National Surveys, Research Utilization
Peer reviewedColeman, Edmund B. – Journal of Reading Behavior, 1972
Points out that one flaw in statistical foundations of much reading research is that researchers neglect to consider whether their findings could be replicated using a different sample of materials. Also shows that many topics in reading research do not quite fit the model for the null hypothesis. (TO)
Descriptors: Reading Research, Research Criteria, Research Methodology, Research Problems
Peer reviewedHowell, John F.; Games, Paul A. – Journal of Experimental Education, 1973
The present study contrasted the robustness of the WSD and the F test to heterogeneity of variances. (Author)
Descriptors: Analysis of Variance, Computers, Educational Experiments, Educational Research
Peer reviewedBergman, L. R. – Human Development, 1972
Problems in making inferences from a sample to a population and from one cohort to other cohorts are discussed. It is concluded that in most cases a longitudinal design using repeated measures is preferable to an independent samples design. (Author)
Descriptors: Longitudinal Studies, Measurement Objectives, Research Design, Research Methodology
Peer reviewedShontz, Franklin C. – Journal of Counseling Psychology, 1972
The approach to outcome research described here has the double merit of preserving individuality and permitting pooling of data for overall analysis of results. (Author)
Descriptors: Counseling Effectiveness, Individual Development, Individualism, Item Sampling
Steinhorst, R. Kirk; Miller, C. Dean – Educ Psychol Meas, 1969
Descriptors: Analysis of Variance, Classification, Education, Psychology
Sirotnik, Ken – J Educ Meas, 1970
The assumption that an examinee's response to an item is independent of the context in which it occurs is tested. (PR)
Descriptors: Attitude Measures, Item Sampling, Measurement Techniques, Research Methodology
Pedersen, Frank A.; Bell, Richard Q. – Develop Psychol, 1970
Despite the precautions in subject selection many of the usual sex differences appeared, including boys' higher level of aggression toward peers. (MH)
Descriptors: Aggression, Behavior Patterns, Behavioral Science Research, Preschool Children
Peer reviewedElster, Richard S.; Dunnette, Marvin D. – Educational and Psychological Measurement, 1971
Descriptors: Hypothesis Testing, Measurement Techniques, Probability, Sampling
Peer reviewedSanjur, D.; And Others – American Journal of Clinical Nutrition, 1971
Descriptors: Dietetics, Eating Habits, Food Standards, Nutrition
Hsu, Tse-Chi; Feldt, Leonard S. – Amer Educ Res J, 1969
Descriptors: Analysis of Variance, Data Analysis, Ratios (Mathematics), Sampling
Gumm, George; Chambers, Gurney – Phi Delta Kappan, 1970
A previous article, EA 500 335, is criticized for claiming the applicability of the CRAM model without research documentation. (MF)
Descriptors: Feedback, Models, Performance Criteria, Problem Solving


