Publication Date
| In 2026 | 0 |
| Since 2025 | 53 |
| Since 2022 (last 5 years) | 411 |
| Since 2017 (last 10 years) | 914 |
| Since 2007 (last 20 years) | 1965 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 93 |
| Practitioners | 23 |
| Teachers | 22 |
| Policymakers | 10 |
| Administrators | 5 |
| Students | 4 |
| Counselors | 2 |
| Parents | 2 |
| Community | 1 |
Location
| United States | 47 |
| Germany | 42 |
| Australia | 34 |
| Canada | 27 |
| Turkey | 27 |
| California | 22 |
| United Kingdom (England) | 20 |
| Netherlands | 18 |
| China | 17 |
| New York | 15 |
| United Kingdom | 15 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Peer reviewedBaranowski, Tom – Journal of School Health, 1985
The most commonly used method of collecting outcome data in health education programs is self-report, which produces a variety of measurement errors. A model is proposed to systematically identify major influences for accuracy of self-reported health behavior. Methodologic studies are described, and eight steps to increase accuracy are proposed.…
Descriptors: Error of Measurement, Health Behavior, Health Education, Research Methodology
Peer reviewedBasch, Charles E.; Gold, Robert S. – Journal of School Health, 1985
Reliability guides research design and is used as a standard for judging the credibility of findings and inferences. Using data gathered in a school health education curriculum evaluation as an example, possible errors in hypothesis testing are examined. Appropriateness of internal consistency as a measure of reliability is discussed and…
Descriptors: Cognitive Tests, Elementary Secondary Education, Error of Measurement, Health Education
Peer reviewedModjeski, Richard B.; Michael, William B. – Educational and Psychological Measurement, 1983
Two tests of critical thinking (the Cornell Critical Thinking Test and the Watson-Glaser Critical Thinking Appraisal) were evaluated by a panel of psychologists relative to the validity, reliability, and error of measurement standards stated in the "Standards for Educational and Psychological Tests," 1974. (PN)
Descriptors: Cognitive Tests, Critical Thinking, Error of Measurement, Evaluation Criteria
Peer reviewedDe Santi, Roger J.; Sullivan, Vicki Gallo – Journal of Research and Development in Education, 1985
Cloze-based evaluations of reading comprehension present room for a greater amount of subjectivity in rating reader response. A study was designed to ascertain the nature of potential subjectivity within a single-rater's ratings of cloze-based assessments of reading comprehension. (DF)
Descriptors: Cloze Procedure, Elementary Secondary Education, Error of Measurement, Interrater Reliability
Peer reviewedZimmerman, Donald W.; And Others – Journal of Experimental Education, 1984
Three types of test were compared: a completion test, a matching test, and a multiple-choice test. The completion test was more reliable than the matching test, and the matching test was more reliable than the multiple-choice test. (Author/BW)
Descriptors: Comparative Analysis, Error of Measurement, Higher Education, Mathematical Models
Miller, M. David – 2002
In 1994 the State Collaborative on Assessment and Student Standards of the Council of Chief State School Officers began a study to examine the generalizability of performance-based assessments (PBAs) for state-mandated assessment programs. The intent was to examine the major sources of error associated with PBAs and the generalizability and…
Descriptors: Elementary Secondary Education, Error of Measurement, Generalizability Theory, Performance Based Assessment
Zwick, Rebecca; Thayer, Dorothy T. – 1994
Several recent studies have investigated the application of statistical inference procedures to the analysis of differential item functioning (DIF) in test items that are scored on an ordinal scale. Mantel's extension of the Mantel-Haenszel test is a possible hypothesis-testing method for this purpose. The development of descriptive statistics for…
Descriptors: Error of Measurement, Evaluation Methods, Hypothesis Testing, Item Bias
DeMars, Christine E. – 2002
When students are nested within course sections, the assumption of independence of residuals is unlikely to be met, unless the course section is explicitly included in the model. Hierarchical linear modeling (HLM) allows for modeling the course section as a random effect, leading to more accurate standard errors. In this study, students chose one…
Descriptors: College Entrance Examinations, College Students, Course Organization, Error of Measurement
Blais, Jean-Guy; Raiche, Gilles – 2002
This paper examines some characteristics of the statistics associated with the sampling distribution of the proficiency level estimate when the Rasch model is used. These characteristics allow the judgment of the meaning to be given to the proficiency level estimate obtained in adaptive testing, and as a consequence, they can illustrate the…
Descriptors: Ability, Adaptive Testing, Error of Measurement, Estimation (Mathematics)
Peer reviewedHarris, Chester W. – Journal of Educational Measurement, 1973
A brief note presenting algebraically equivalent formulas for the variances of three error types. (Author)
Descriptors: Algebra, Analysis of Covariance, Analysis of Variance, Error of Measurement
Peer reviewedKristof, Walter – Psychometrika, 1973
Paper is concerned with the hypothesis that two variables have a perfect disattenuated correlation, hence measure the same trait except for errors of measurement. (Author/RK)
Descriptors: Analysis of Variance, Correlation, Error of Measurement, Mathematical Models
Peer reviewedWerts, Charles E.; Linn, Robert L. – Educational and Psychological Measurement, 1972
The general problem of using group status to estimate true scores given multiple measures is considered in this paper. (Authors)
Descriptors: Error of Measurement, Group Status, Mathematical Applications, Multiple Regression Analysis
Peer reviewedMisLevy, Robert J.; Bock, R. Darrell – Educational and Psychological Measurement, 1982
An alternative biweight estimator based on Tukey's is examined in which (1) test disturbances are not assumed to be the same for all subjects, (2) each response is utilized proportional to its value, and (3) the biweight and maximum likelihood estimate agree when no disturbances are present. Smaller mean-squared errors are shown. (Author/CM)
Descriptors: Error of Measurement, Estimation (Mathematics), Guessing (Tests), Latent Trait Theory
Peer reviewedWestermann, Rainer; Hager, Willi – Perceptual and Motor Skills, 1983
Two psychological experiments--Anderson and Shanteau (1970), Berkowitz and LePage (1967)--are reanalyzed to present the problem of the relative importance of low Type 1 error probability and high power when answering a research question by testing several statistical hypotheses. (Author/PN)
Descriptors: Error of Measurement, Hypothesis Testing, Power (Statistics), Research Design
Peer reviewedWinne, Philip H.; Belfry, M. Joan – Journal of Educational Measurement, 1982
This review of issues about correcting for attenuation concludes that the basic difficulty lies in being able to identify and equate sources of variance in estimates of validity and reliability. Recommendations are proposed for cautious use of correction for attenuation. (Author/CM)
Descriptors: Correlation, Error of Measurement, Research Methodology, Statistical Analysis


