Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedWildman, Beth G.; And Others – Child Development, 1975
Observer reliability and number of behaviors rated were compared following two types of observer training: (1) training of observers by one individual, and (2) self-training, or each pair of observers training itself. Results suggest that more adequate standards for the training of observers and the reporting of observer reliability be adopted by…
Descriptors: Behavior Rating Scales, Observation, Reliability, Training Methods
Peer reviewedJoe, George W.; Woodward, J. Arthur – Multivariate Behavioral Research, 1975
Descriptors: Correlation, Matrices, Sampling, Statistical Analysis
Zimmerman, Donald W. – Educ Psychol Meas, 1969
Descriptors: Item Analysis, Mathematical Models, Measurement, Test Reliability
Wasserman, Miriam – Urban Rev, 1969
Descriptors: Disadvantaged Youth, Reading Tests, Test Reliability, Testing
Carron, Albert V.; Bailey, Donald A. – Percept Mot Skills, 1969
Descriptors: Ability, Individual Differences, Motor Development, Reliability
Marshall, J. Laird; Haertel, Edward H. – 1975
For classical, norm-referenced test reliability, Cronbach's alpha has been shown to be equal to the mean of all possible split-half Pearson product-moment correlation coefficients, adjusted by the Spearman-Brown prophecy formula. For criterion-referenced test reliability, in an analogous vein, this paper provides the rationale behind, the analysis…
Descriptors: Criterion Referenced Tests, Statistical Analysis, Test Reliability
PDF pending restorationGarvin, Alfred D. – 1976
Three successively simpler formulas for approximating the standard error of measurement were derived by applying successively more simplifying assumptions to the standard formula based on the standard deviation and the Kuder-Richardson formula 20 estimate of reliability. The accuracy of each of these three formulas, with respect to the standard…
Descriptors: Error of Measurement, Statistical Analysis, Test Reliability
Peer reviewedFleiss, Joseph L.; Cicchetti, Domenic V. – Applied Psychological Measurement, 1978
The accuracy of the large sample standard error of weighted kappa appropriate to the non-null case was studied by computer simulation for the hypothesis that two independently derived estimates of weighted kappa are equal, and for setting confidence limits around a single value of weighted kappa. (Author/CTM)
Descriptors: Correlation, Hypothesis Testing, Nonparametric Statistics, Reliability
Peer reviewedWackerly, D. D.; And Others – Psychometrika, 1978
Two designs for comparing a judge's ratings with a known standard are compared: (1) Subjects are categorized into classes, with no knowledge of the size of either group; and (2) the judge is told the actual number allowed in each class. The probability distribution of the total number of correct choices is developed for each case, and their power…
Descriptors: Measurement Techniques, Probability, Rating Scales, Test Reliability
Peer reviewedKaiser, Henry F.; Michael, William B. – Educational and Psychological Measurement, 1977
A formula is derived for ascertaining factor scores for the factor analytic method: Little Jiffy, Mark IV. This formula is then employed to derive a second formula giving an exact determination of the generalized Kuder-Richardson estimate of the reliability of scores on a Little Jiffy factor. (Author/JKS)
Descriptors: Factor Analysis, Reliability, Scores, Scoring Formulas
Peer reviewedWood, Terry M.; Safrit, Margaret J. – Research Quarterly for Exercise and Sport, 1987
A comparison of three multivariate models (canonical reliability model, maximum generalizability model, canonical correlation model) for estimating test battery reliability indicated that the maximum generalizability model showed the least degree of bias, smallest errors in estimation, and the greatest relative efficiency across all experimental…
Descriptors: Comparative Analysis, Models, Research Methodology, Test Reliability
Peer reviewedSubkoviak, Michael J. – Journal of Educational Measurement, 1988
Current methods for obtaining reliability indices for mastery tests can be laborious. This paper offers practitioners tables from which agreement and kappa coefficients can be read directly and provides criterion for acceptable values of agreement and kappa coefficients. (TJH)
Descriptors: Mastery Tests, Statistical Analysis, Test Reliability, Testing
Peer reviewedGriggs, Richard A.; Ransdell, Sarah E. – Teaching of Psychology, 1987
States that taking a high school psychology course did not improve the performance of college students in an introductory psychology class on a modified version of Vaughan's misconceptions test (Test of Common Beliefs). Concludes that while college experience did lead to some improvement, comparison with other studies indicates that perhaps the…
Descriptors: High Schools, Higher Education, Psychology, Test Reliability
Peer reviewedBaldasare, John; And Others – Journal of Visual Impairment and Blindness, 1986
An assessment instrument of the visual skills required for reading among macular loss individuals is presented along with a study involving 48 adults to evaluate its reliability. Data are included to verify claims regarding aspects of reading that are particularly problematic for macular loss individuals; rehabilitation techniques are suggested.…
Descriptors: Adults, Partial Vision, Reading Tests, Test Reliability
Peer reviewedDel Greco, Linda; And Others – Adolescence, 1986
Examined the reliability of the 30-item Modified Rathus Assertiveness Schedule (MRAS) using the test-retest method over a three-week period. The MRAS yielded correlations of .74 using the Pearson product and Spearman Brown correlation coefficient. Correlations for males yielded .77 and .72. For females correlations for both tests were .72.…
Descriptors: Adolescents, Assertiveness, Correlation, Sex Differences


