Publication Date
| In 2026 | 0 |
| Since 2025 | 5 |
| Since 2022 (last 5 years) | 23 |
| Since 2017 (last 10 years) | 563 |
| Since 2007 (last 20 years) | 1786 |
Descriptor
| Statistical Analysis | 2533 |
| Reliability | 1278 |
| Test Reliability | 1074 |
| Foreign Countries | 940 |
| Correlation | 633 |
| Test Validity | 630 |
| Factor Analysis | 559 |
| Validity | 508 |
| Questionnaires | 479 |
| Measures (Individuals) | 411 |
| Test Construction | 338 |
| More ▼ | |
Source
Author
| Alonzo, Julie | 12 |
| Price, Gary G. | 12 |
| Tindal, Gerald | 10 |
| Lai, Cheng-Fei | 9 |
| Brennan, Robert L. | 8 |
| Raykov, Tenko | 8 |
| Feldt, Leonard S. | 7 |
| Livingston, Samuel A. | 7 |
| Park, Bitnara Jasmine | 7 |
| Irvin, P. Shawn | 6 |
| Anderson, Daniel | 5 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 34 |
| Practitioners | 21 |
| Teachers | 10 |
| Students | 8 |
| Administrators | 5 |
| Counselors | 2 |
| Parents | 1 |
| Policymakers | 1 |
Location
| Turkey | 204 |
| Nigeria | 57 |
| Jordan | 38 |
| Australia | 35 |
| Iran | 35 |
| Taiwan | 35 |
| Canada | 31 |
| China | 30 |
| Germany | 29 |
| California | 28 |
| United Kingdom | 25 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Peer reviewedConger, Anthony J. – Multivariate Behavioral Research, 1974
Two indices of profile reliability are shown to be equivalent in terms of the individual independent canonical composites; however, because of different weighting procedures, they yield different overall indices of profile reliability. A common formula is provided from which both indices can be derived. (Author)
Descriptors: Analysis of Variance, Correlation, Matrices, Measurement Techniques
Huynh, Huynh – 1977
Three techniques for estimating Kuder Richardson reliability (KR20) coefficients for incomplete data are contrasted. The methods are: (1) Henderson's Method 1 (analysis of variance, or ANOVA); (2) Henderson's Method 3 (FITCO); and (3) Koch's method of symmetric sums (SYSUM). A Monte Carlo simulation was used to assess the precision of the three…
Descriptors: Analysis of Variance, Comparative Analysis, Mathematical Models, Monte Carlo Methods
Peer reviewedOakland, Thomas; And Others – Journal of School Psychology, 1975
Interrater differences in scoring actual WISC protocols were determined for three different IQ levels. In general, differences among the 94 examiners tended to be within an acceptable range as established by the standard error of measurement; variance on two Verbal subtests occasionally exceeded their corresponding standard error of measurement.…
Descriptors: Evaluation Criteria, Examiners, Intelligence Tests, Measurement
Baumgartner, Ted A. – Res Quart AAHPER, 1969
Descriptors: Measurement, Physical Education, Physical Examinations, Physical Fitness
Marcus, Robert F.; And Others – 1980
Time sampled observations of the cooperative behavior of two samples of 31 preschool children were analyzed for stability (that is, short term reliability of behavior) over a 2-month period using Cronbach's generalizability coefficient. Observations were made during free play periods on nursery school settings. The observation schedule required…
Descriptors: Behavioral Science Research, Cooperation, Observation, Play
Naccarato, Richard W.; Gillmore, Gerald M. – 1976
This paper involves an application of generalizability theory in assessing the dependability of a foreign language placement exam. The French Cloze test was administered to students within five levels of French classes and the results were scored by four different raters. Three specific generalizability coefficients are discussed along with…
Descriptors: College Students, French, Higher Education, Measurement Techniques
Smith, Sandra E.; And Others – 1978
A correction of the standard F-ratio for unreliability of the dependent measure has recently been proposed by Winne; the rationale is analogous to that of correcting a correlation for attenuation. However, there are two problems associated with Winne's correction of which potential users should be aware. First, the corrected statistic, F*, has…
Descriptors: Analysis of Variance, Hypothesis Testing, Reliability, Research Problems
Moyer, Judith E.; Fishbein, Ronald L. – 1977
The problem that this research addressed was one of decision making. Given three sets of criterion-referenced tests which were designed to be parallel in content, would a traditional reliability coefficient produce different decisions about the reliability of those tests than would kappa? The procedure used collected statewide results on 136 test…
Descriptors: Analysis of Variance, Comparative Analysis, Criterion Referenced Tests, Measurement Techniques
Rowley, Glenn – 1975
The use of the intraclass correlation in determining reliability is discussed and shown to be both appropriate and simple to use in the case of an observational measure, provided that observations are made on at least two occasions. The interpretation of such coefficients is explained in terms of generalizability theory, and real data are used to…
Descriptors: Behavior, Classroom Observation Techniques, Correlation, Evaluation Methods
Friedman, Martin R.; And Others – 1974
The present study attempted to modify the latencies and errors of adult women on the Matching Familiar Figures test (MFF) by systematically altering task instructions. The results indicated that latencies of impulsive subjects could be altered with "reflective" instructions, while the latencies of reflective subjects were resistent to…
Descriptors: Adults, Cognitive Processes, Females, Individual Differences
Stocking, Martha; And Others – 1973
For two tests measuring the same trait, the program, BIV20, equates the scores using the two True score distributions estimated by the univariate method 20 program (see Wingersky, Lees, Lennon, and Lord, 1969) and, with these equated true scores and their distributions, estimates the bivariate distribution scores and the relative efficiency of the…
Descriptors: Computer Programs, Equated Scores, Statistical Analysis, Test Reliability
Lord, Frederic M. – 1972
The stepped-up reliability coefficient does not have the same standard error as an ordinary correlation coefficient. Fisher's Z -transformation should not be applied to it. Appropriate procedures are suggested. (Author)
Descriptors: Analysis of Variance, Mathematical Models, Research, Research Reports
Mandeville, Garrett K. – 1973
An investigation is conducted which presents extensive Monte Carlo results which indicate the conditions under which a procedure using the F distribution can be used to study the robustness of the confidence interval procedures for small samples. A review of the literature is presented. Procedure uses a binary data matrix. Results indicate that…
Descriptors: Confidence Testing, Item Sampling, Literature Reviews, Monte Carlo Methods
Rafacz, Bernard A.; Foley, Paul P. – 1973
A study was conducted by the Navy to develop and evaluate human performance reliability estimates for electronic maintenance. Data were collected using the Personnel Identification Information Forms, the Technical Proficiency Checkout Form, and the Job Performance Questionnaire. On the basis of the total number of uncommonly effective and the…
Descriptors: Military Personnel, Norms, Performance Criteria, Predictor Variables
PDF pending restorationKristof, Walter – 1973
This study in parametric test theory deals with the statistics of reliability estimation when scores on two parts of a test follow a binormal distribution with equal (case 1) or unequal (case 2) expectations. In each case biased maximum-likelihood estimators of reliability are obtained and converted into unbiased estimators. Sampling distributions…
Descriptors: Expectation, Research Reports, Sample Size, Sampling


