Publication Date
| In 2026 | 2 |
| Since 2025 | 454 |
| Since 2022 (last 5 years) | 1933 |
| Since 2017 (last 10 years) | 4505 |
| Since 2007 (last 20 years) | 6990 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 837 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 161 |
| Spain | 129 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 111 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Peer reviewedRowley, Glenn L. – American Educational Research Journal, 1976
A rationale for, and a relatively simple method of estimating the reliability of an observational method are described. A separate "reliability study," while not required, may be employed when examining data. Empirical findings show that high frequencies of occurrence are not necessary prerequisites for the reliable measurement of behavior.…
Descriptors: Classroom Observation Techniques, Observation, Test Reliability
Peer reviewedLord, Frederic M. – Journal of Educational Measurement, 1974
Descriptors: Statistical Analysis, Test Reliability, Transformations (Mathematics)
Zimmerman, Donald W. – Educ Psychol Meas, 1970
Results of this study indicate that the correlation between half-test scores over repeated splits, over persons, and over repeated testings resulting in different sets of observed scores, is given by Kuder-Richardson Formula 21. (RF)
Descriptors: Statistical Analysis, Statistics, Test Reliability, Tests
Peer reviewedPhilip, Alistair E. – British Journal of Psychology, 1970
Descriptors: Analysis of Variance, Anxiety, Test Reliability
Peer reviewedPepin, Arthur C. – Clearing House, 1971
Descriptors: Educational Testing, Intelligence Tests, Test Reliability
Peer reviewedMandel, Robert; McLeod, Philip – Exceptional Children, 1970
Descriptors: Intelligence Tests, Socioeconomic Status, Test Reliability
Kroll, Water – Res Quart AAHPER, 1970
Descriptors: Error Patterns, Muscular Strength, Test Reliability
Peer reviewedWilcox, Rand R. – Educational and Psychological Measurement, 1981
This paper describes and compares procedures for estimating the reliability of proficiency tests that are scored with latent structure models. Results suggest that the predictive estimate is the most accurate of the procedures. (Author/BW)
Descriptors: Criterion Referenced Tests, Scoring, Test Reliability
Peer reviewedUebersax, John S. – Educational and Psychological Measurement, 1982
A more general method for calculating the Kappa measure of nominal rating agreement among multiple raters is presented. It can be used across a broad range of rating designs, including those in which raters vary with respect to their base rates and how many subjects they rate in common. (Author/BW)
Descriptors: Mathematical Formulas, Statistical Significance, Test Reliability
Peer reviewedFeldt, Leonard S.; Charter, Richard A. – Measurement and Evaluation in Counseling and Development, 2003
Evaluating a test's reliability often requires dividing it into 3 or more unequal parts, which causes violation of the tau equivalence assumption of Cronbach's alpha. This article presents a criterion for abandoning alpha and an approach for computing a more appropriate estimate of reliability, the Gilmer-Feldt coefficient. (Author)
Descriptors: Counseling, Evaluation Methods, Psychometrics, Test Reliability
Peer reviewedBlau, Gary J. – Journal of Vocational Behavior, 1988
Examined the reliability and validity of a career commitment measure using employees (N=266) of newspaper and insurance companies. Results showed career commitment could be reliably measured and was operationally distinct from job involvement and organizational commitment. Discusses findings in terms of meaning of career commitment. (Author/ABL)
Descriptors: Careers, Employees, Test Reliability, Test Validity
Peer reviewedvan der Linden, Wim J.; Boekkooi-Timminga, Ellen – Applied Psychological Measurement, 1988
Gulliksen's matched random subtests method is a graphical method to split a test into parallel test halves, allowing maximization of coefficient alpha as a lower bound to the classical test reliability coefficient. This problem is formulated as a zero-one programing problem solvable by algorithms that already exist. (TJH)
Descriptors: Algorithms, Equations (Mathematics), Programing, Test Reliability
Peer reviewedTraub, Ross E.; Rowley, Glenn L. – Educational Measurement: Issues and Practice, 1991
The idea of test consistency is illustrated, with reference to two sets of test scores. A mathematical model is used to explain the relative consistency and relative inconsistency of measurements, and a means of indexing reliability is derived using the model. Practical aspects of estimating reliability are considered. (TJH)
Descriptors: Mathematical Models, Test Reliability, True Scores
Zinbarg, Richard E.; Revelle, William; Yovel, Iftah; Li, Wen – Psychometrika, 2005
We make theoretical comparisons among five coefficients--Cronbach's [alpha], Revelle's [beta], McDonald's [omega][sub h], and two alternative conceptualizations of reliability. Though many end users and psychometricians alike may not distinguish among these five coefficients, we demonstrate formally their nonequivalence. Specifically, whereas…
Descriptors: Psychometrics, Test Reliability, Rating Scales, Scores
Ashvind Nand Singh – ProQuest LLC, 2008
Due to the relative inability of individuals with intellectual disabilities (ID) to provide an accurate and reliable self-report, assessment in this population is more difficult than with individuals in the general population. As such, assessment procedures must be adjusted to compensate for the relative lack of information that the individual can…
Descriptors: Test Items, Item Analysis, Test Construction, Behavior Rating Scales

Direct link
