Publication Date
| In 2026 | 0 |
| Since 2025 | 53 |
| Since 2022 (last 5 years) | 411 |
| Since 2017 (last 10 years) | 914 |
| Since 2007 (last 20 years) | 1965 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 93 |
| Practitioners | 23 |
| Teachers | 22 |
| Policymakers | 10 |
| Administrators | 5 |
| Students | 4 |
| Counselors | 2 |
| Parents | 2 |
| Community | 1 |
Location
| United States | 47 |
| Germany | 42 |
| Australia | 34 |
| Canada | 27 |
| Turkey | 27 |
| California | 22 |
| United Kingdom (England) | 20 |
| Netherlands | 18 |
| China | 17 |
| New York | 15 |
| United Kingdom | 15 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Cheshier, Stephen R. – Engineering Education, 1975
Describes a simplified method for converting raw scores to standard scores and transforming them to "T-scores" for easy comparison of performance. Obtaining letter grades from T-scores is discussed. A reading list is included. (GH)
Descriptors: Achievement Rating, Error of Measurement, Evaluation Methods, Grades (Scholastic)
Williams, Rick L.; And Others – 1981
The National Assessment of Educational Progress in-school sampling design is a three-stage stratified design. Stratification variables include region, size of community and socioeconomic status. The three levels of sample selection are Primary Sampling Units (PSUs), schools and students. In general, two and sometimes three PSUs are selected from…
Descriptors: Educational Assessment, Elementary Secondary Education, Error of Measurement, National Competency Tests
Reid, Jerry B.; Roberts, Dennis M. – 1978
Comparisons of corresponding values of phi and kappa coefficients were made for 270 instances of data generated by a Monte Carlo technique to simulate a test-retest situation. Data were generated for distributions with the same mean but three different levels of standard deviation, standard error of measurement and cutting score. Ten samples of…
Descriptors: Comparative Analysis, Correlation, Criterion Referenced Tests, Cutting Scores
Divgi, D. R. – 1978
One aim of criterion-referenced testing is to classify an examinee without reference to a norm group; therefore, any statements about the dependability of such classification ought to be group-independent also. A population-independent index is proposed in terms of the probability of incorrect classification near the cutoff true score. The…
Descriptors: Criterion Referenced Tests, Cutting Scores, Difficulty Level, Error of Measurement
Divgi, D. R. – 1980
Because it is difficult to ascertain the dimensionality of a test composed of binary items through the use of factor analysis alone, a method is proposed that combines item characteristic curve (ICC) theory with factor analysis. Factor structure of tetrachoric correlations is distorted by non-normal distribution of ability. Item characteristics…
Descriptors: Achievement Tests, Error of Measurement, Factor Analysis, Factor Structure
PDF pending restorationCalkins, Dick S. – 1974
The mathematical derivation of the statistics used for inference in some linear models assumes that the values of the independent variables are measured without error. This assumption is often disregarded when these models are utilized in research. This study is an investigation of the consequences of the violation of this assumption for one…
Descriptors: Analysis of Covariance, Computer Programs, Error of Measurement, Error Patterns
Stanley, Julian C.; Livingston, Samuel A. – 1971
Besides the ubiquitous Pearson product-moment r, there are a number of other measures of relationship that are attenuated by errors of measurement and for which the relationship between true measures can be estimated. Among these are the correlation ratio (eta squared), Kelley's unbiased correlation ratio (epsilon squared), Hays' omega squared,…
Descriptors: Analysis of Variance, Cluster Grouping, Correlation, Data Analysis
Werts, Charles E.; Linn, Robert L. – 1972
The Werts-Linn procedure for dealing with categorical errors of measurement in "Comments on Boyle's 'Path Analysis and Ordinal Data'" in The American Journal of Sociology, volume 76, number 6, May 1971, is shown to be inappropriate to the problem of ordered categories. (For related document, see TM 002 301.) (DB)
Descriptors: Data Analysis, Error of Measurement, Goodness of Fit, Mathematical Models
Blai, Boris, Jr. – 1971
Statistics are an essential tool for making proper judgement decisions. It is concerned with probability distribution models, testing of hypotheses, significance tests and other means of determining the correctness of deductions and the most likely outcome of decisions. Measures of central tendency include the mean, median and mode. A second…
Descriptors: Analysis of Variance, Correlation, Error of Measurement, Hypothesis Testing
Peer reviewedHelvey, T. Charles – Journal of Experimental Education, 1975
This article describes a new testing method which can be used to screen learning-deficient children fast, reliably, and inexpensively out of any population of public school systems. (Editor)
Descriptors: Bayesian Statistics, Electroencephalography, Error of Measurement, Intelligence Tests
Peer reviewedStokes, Elizabeth H.; And Others – Educational and Psychological Measurement, 1978
The Wechsler Intelligence Scale for Children, and the revised form of that measure, were administered to a sample of sixth grade pupils. Although the correlation between measures was high, scores on the revised form were significantly lower. (JKS)
Descriptors: Comparative Testing, Correlation, Error of Measurement, Grade 6
Peer reviewedWright, Benjamin D. – Journal of Educational Measurement, 1977
Statements made in a previous article of this journal concerning the Rasch latent trait test model are questioned. Methods of estimation, necessary sample sizes, several formuli, and the general usefulness of the Rasch model are discussed. (JKS)
Descriptors: Computers, Error of Measurement, Item Analysis, Mathematical Models
Peer reviewedBrennan, Robert L.; Kane, Michael T. – Psychometrika, 1977
Using the assumption of randomly parallel tests and concepts from generalizability theory, three signal/noise ratios for domain-referenced tests are developed, discussed, and compared. The three ratios have the same noise but different signals depending upon the kind of decision to be made as a result of measurement. (Author/JKS)
Descriptors: Comparative Analysis, Criterion Referenced Tests, Error of Measurement, Mathematical Models
Peer reviewedFiske, Donald W. – Educational and Psychological Measurement, 1987
This paper analyzes ways in which the methods used to measure psychological constructs contribute invalidity to measurements. The analysis distinguishes between inadequacies stemming from the behaviors selected for measurement and the harmful effects generated by the measurement operations themselves. (BS)
Descriptors: Behavioral Science Research, Construct Validity, Data Analysis, Error of Measurement
Peer reviewedNorcini, John J. – Journal of Educational Measurement, 1987
Answer keys for physician and teacher licensing examinations were studied. The impact of variability on total errors of measurement was examined for answer keys constructed using the aggregate method. Results indicated that, in some cases, scorers contributed to a sizable reduction in measurement error. (Author/GDC)
Descriptors: Adults, Answer Keys, Error of Measurement, Evaluators


