Publication Date
| In 2026 | 2 |
| Since 2025 | 462 |
| Since 2022 (last 5 years) | 1941 |
| Since 2017 (last 10 years) | 4513 |
| Since 2007 (last 20 years) | 6998 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10004 |
| Test Construction | 4369 |
| Foreign Countries | 3831 |
| Psychometrics | 2428 |
| Factor Analysis | 2301 |
| Measures (Individuals) | 1785 |
| Evaluation Methods | 1410 |
| Higher Education | 1391 |
| Questionnaires | 1261 |
| Factor Structure | 1248 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 838 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 162 |
| Spain | 129 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 112 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Peer reviewedHolmes, Susan E. – Journal of Educational Measurement, 1982
Two tests were created from a standardized reading achievement test and vertically equated using a sample of third and fourth grade students. Based on differences in ability estimates for the same student, the Rasch model did not provide a satisfactory means of vertical equating. (Author/CM)
Descriptors: Achievement Tests, Elementary Education, Equated Scores, Latent Trait Theory
Peer reviewedPassmore, David Lynn; And Others – Journal of Studies in Technical Careers, 1982
The National Longitudinal Survey of Labor Force Behavior concluded that underprivileged youth lack occupational information. But the Knowledge of the World of Work Test administered to the young people surveyed does not appear to be reliable. (SK)
Descriptors: Adolescents, Disadvantaged Youth, Occupational Information, Occupational Tests
Peer reviewedReynolds, William M.; Bernstein, Sydna M. – Journal of Abnormal Child Psychology, 1982
The factorial validity and internal consistency reliability of the Devereux Elementary School Behavior Rating Scale were examined with a random sample of elementary school children. Given the problem of multicollinearity that was shown to exist among subscales, the authors suggest caution in the interpretation of Devereux subscales as discrete…
Descriptors: Behavior Rating Scales, Elementary Education, Emotional Problems, Test Reliability
Peer reviewedReynolds, Cecil R. – Perceptual and Motor Skills, 1981
The Revised Children's Manifest Anxiety Scale was administered to 534 children (grades 4-6) at the beginning and end of the school year. The total anxiety score correlated .68 across the nine-month period, supporting the temporal stability of the test. (Author/BW)
Descriptors: Anxiety, Intermediate Grades, Personality Measures, Personality Traits
Peer reviewedWilcox, Rand R. – Educational and Psychological Measurement, 1982
Results in the engineering literature on "k out of n system reliability" can be used to characterize tests based on estimates of the probability of correctly determining whether the examinee knows the correct response. In particular, the minimum number of distractors required for multiple-choice tests can be empirically determined.…
Descriptors: Achievement Tests, Mathematical Models, Multiple Choice Tests, Test Format
Peer reviewedBattle, James – Perceptual and Motor Skills, 1981
Two hundred seventy-four boys and girls, aged 7 through 15 years, participated in a test-retest reliability study of the Developmental Tests of Visual-Motor Association. Test-retest correlations for the total group and subjects comprising categories A through D were significant. (Author/SJL)
Descriptors: Elementary Secondary Education, Nonverbal Tests, Perceptual Motor Coordination, Test Reliability
Peer reviewedLuftig, Jeffrey T.; Norton, Willis P. – Journal of Studies in Technical Careers, 1981
The purpose of this article is to review applications of reliability formulas and to recommend more appropriate methods of determining the reliability of affective instruments. (SK)
Descriptors: Affective Measures, Error of Measurement, Measurement Techniques, Test Reliability
Peer reviewedZimmerman, Donald W.; Williams, Richard H. – Psychometrika, 1982
Formulas for the standard error of measurement of three measures of change (simple differences; residualized difference scores; and a measure introduced by Tucker, Damarin, and Messick) are derived. A practical guide for determining the relative error of the three measures is developed. (Author/JKS)
Descriptors: Achievement Gains, Algorithms, Differences, Error of Measurement
Peer reviewedGati, Itamar – Educational and Psychological Measurement, 1981
This paper examines the properties of the Item Efficiency Index proposed by Neill and Jackson (1976; EJ 137 077) for minimum redundancy item analysis. (Author/BW)
Descriptors: Correlation, Factor Structure, Item Analysis, Mathematical Models
Peer reviewedCallender, John C.; Osburn, H. G. – Journal of Educational Measurement, 1979
Some procedures for estimating internal consistency reliability may be superior mathematically to the more commonly used methods such as Coefficient Alpha. One problem is computational difficulty; the other is the possibility of overestimation due to capitalization on chance. (Author/CTM)
Descriptors: Higher Education, Mathematical Formulas, Research Problems, Sampling
Peer reviewedWilcox, Rand R. – Journal of Educational Statistics, 1981
Both the binomial and beta-binomial models are applied to various problems occurring in mental test theory. The paper reviews and critiques these models. The emphasis is on the extensions of the models that have been proposed in recent years, and that might not be familiar to many educators. (Author)
Descriptors: Error of Measurement, Item Analysis, Mathematical Models, Test Reliability
Peer reviewedZingale, Salvatore A.; And Others – Learning Disability Quarterly, 1980
The temporal stability of three levels (primary I, primary II, and elementary) of the Metropolitan Achievement Test (MAT) when administered to children with learning disabilities was investigated. It was concluded that the MAT is reliable when used with learning disabled children. (Author/SBH)
Descriptors: Achievement Tests, Elementary Education, Exceptional Child Research, Learning Disabilities
Peer reviewedDyer, Frank J. – Educational and Psychological Measurement, 1980
Power analysis is in essence a technique for estimating the probability of obtaining a specific minimum observed effect size. Power analysis techniques are applied to research planning problems in test reliability studies. A table for use in research planning and hypothesis testing is presented. (Author)
Descriptors: Hypothesis Testing, Mathematical Formulas, Power (Statistics), Probability
Peer reviewedGrow, Richard; And Others – Journal of Clinical Psychology, 1980
Evaluated the effectiveness of 13 different Minnesota Multiphasic Personality Inventory taking direction techniques. Results indicated that from 10 percent to 81 percent of the variance associated with faking can be accounted for by various detection strategies. These techniques are recommended for clinical use. (Author)
Descriptors: Evaluation Methods, Measurement Techniques, Personality Measures, Test Reliability
Peer reviewedVestewig, Richard E. – Social Behavior and Personality, 1978
Item endorsement rates and reliability of the Smith Quick Measure of Achievement Motivation were studied. Concluded that the scale may have too little variance to discriminate college students on achievement motivation and that, for students, one item may be inappropriately keyed. (Author)
Descriptors: Achievement, Analysis of Variance, College Students, Motivation


