Publication Date
| In 2026 | 0 |
| Since 2025 | 433 |
| Since 2022 (last 5 years) | 1911 |
| Since 2017 (last 10 years) | 4483 |
| Since 2007 (last 20 years) | 6968 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 830 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 159 |
| Spain | 129 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 111 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Peer reviewedHuynh, Huynh – Journal of Educational Statistics, 1979
In mastery testing, the raw agreement index and the kappa index may be estimated via one test administration when the test scores follow beta-binomial distributions. This paper reports formulae, tables, and a computer program which facilitate the computation of the standard errors of the estimates. (Author/CTM)
Descriptors: Computer Programs, Cutting Scores, Decision Making, Mastery Tests
Peer reviewedTinsley, Howard E. A.; And Others – Educational and Psychological Measurement, 1981
Two procedures for scoring the Recreation Experience Preference scales were investigated using data obtained from respondents engaged in outdoor recreational activities. Both procedures yielded acceptable levels of reliability and concurrent validity. When time is unimportant, the scale score strategy is preferred over the domain score strategy.…
Descriptors: Methods, Outdoor Activities, Participant Satisfaction, Recreational Activities
Peer reviewedHiscock, Merrill – Journal of Consulting and Clinical Psychology, 1978
Examined imagery questionnaires and addressed issues of reliability, agreement among questionnaires, social desirability, and construct validity. The Betts and Gordon scales and the Paivio Individual Differences Questionnaire were examined. Reliability of the Paivio inventory was satisfactory and equivalent to other imagery questionnaires. Imagery…
Descriptors: Imagery, Males, Measurement, Questionnaires
Peer reviewedFeldhusen, John F.; Kolloff, Margaret Britton – Perceptual and Motor Skills, 1981
The purpose of this research was to develop a self-concept scale for gifted students which focused on gifted children's talents and abilities and associated or related behaviors. Preliminary development analyses for 224 gifted girls and 188 boys in Grades 3 through 6 indicate promising reliability and possibly good validity. (Author/SJL)
Descriptors: Gifted, Intermediate Grades, Self Concept Measures, Test Construction
Peer reviewedKraemer, Helena Chmura – Psychometrika, 1981
Limitations and extensions of Feldt's approach to testing the equality of Cronbach's alpha coefficients in independent and matched samples are discussed. In particular, this approach is used to test equality of intraclass correlation coefficients. (Author)
Descriptors: Analysis of Variance, Correlation, Hypothesis Testing, Mathematical Models
Peer reviewedTolman, Richard R.; Bishop, Janet L. – Journal for Special Educators, 1980
The Preschool-Primary Nowicki-Strickland Internal-External Control Scale (PPNSIE) was used to assess the locus of control in 359 mildly retarded secondary students. Unacceptably low reliability estimates for the PPNSIE were revealed, with IQ and age the strongest predictors of scores. (CL)
Descriptors: Locus of Control, Mild Mental Retardation, Psychological Testing, Secondary Education
Peer reviewedBerk, Ronald A. – Journal for Special Educators, 1981
The strengths of the scales were concentrated in the areas of domain and construct validity, weaknesses in the areas of discriminant validity, interobserver reliability, and decision-making reliability. (DB)
Descriptors: Adjustment (to Environment), Behavior Rating Scales, Mental Retardation, Test Reliability
Peer reviewedCureton, Kirk J. – Research Quarterly for Exercise and Sport, 1981
The increasing use of various VO2 max expressions as test measures is a problem because the magnitude of sex difference varies considerably with each expression. A valid match of male and female test subjects would consider physical activity history and the amount of endurance exercise done in the previous year. (Author/FG)
Descriptors: Exercise Physiology, Performance Factors, Physical Characteristics, Sex Differences
Peer reviewedLubin, Bernard; And Others – Hispanic Journal of Behavioral Sciences, 1980
The study aimed to develop an instrument that could be used for mental health research with Spanish-speaking populations. The Depression Adjective Check Lists (DACL) used to measure depressive mood was translated into Spanish and administered to 70 Hispanic subjects. Reliability determinations were high and close to those for the English version.…
Descriptors: Depression (Psychology), Mental Health, Psychological Testing, Spanish
Peer reviewedBentler, P. M.; Woodward, Arthur J. – Psychometrika, 1980
A chain of lower bound inequalities leading to the greatest lower bound to reliability is established for the internal consistency of a composite of unit-weighted scores (such as a test). Algorithms for obtaining various reliability coefficients are presented. (Author/JKS)
Descriptors: Factor Analysis, Item Analysis, Measurement Techniques, Test Construction
Peer reviewedWoehlke, Paula; Ohara, Takeshi – Educational and Psychological Measurement, 1980
To assess the stability of the factor structure of the Instructional Improvement Questionnaire (IIQ), factor analyses were run for 1973, 1974, and 1975 results, partialling out three variables: expected grade, percent taking the course as an elective, and student's year. The factors were stable over the three years. (Author/BW)
Descriptors: Factor Analysis, Factor Structure, Questionnaires, Student Evaluation of Teacher Performance
O'Shea, Arthur J.; Harrington, Thomas F. – Measurement and Evaluation in Guidance, 1980
Describes the procedures the authors of the System for Career Decision-Making (CDM) followed in establishing client scoring reliability. Authors recommend that manuals of self-scored inventories provide data establishing scorer reliability, that scoring be supervised, and that APGA test standards deal directly with scorer reliability. (Author)
Descriptors: Career Choice, College Students, Decision Making, Interest Inventories
Peer reviewedKrus, Patricia H.; And Others – Perceptual and Motor Skills, 1981
The purpose of this study was to investigate the structure of motor proficiency in a sample of 765 children between the ages of 4 1/2 to 14 1/2 years. The study was conducted as one aspect of the standardization of a motor proficiency scale, the Bruininks-Oseretsky Test of Motor Proficiency. (Author/SJL)
Descriptors: Adolescents, Children, Factor Structure, Motor Development
Peer reviewedRudner, Lawrence M.; And Others – Journal of Educational Statistics, 1980
Investigations of item bias provide an empirical basis for the identification and elimination of items appearing to measure different traits across population/culture groups. This paper reviews the psychometric rationales of six types of approaches to biased item identification. (Author/JKS)
Descriptors: Culture Fair Tests, Item Analysis, Latent Trait Theory, Test Bias
Nitko, Anthony J. – New Directions for Testing and Measurement, 1980
Criterion-referencing is a way to enhance the interpretation of test scores by referencing them to well-defined behavior domains. Behavior domains may be ordered or unordered; several varieties of criterion-referenced tests within each of these types are discussed. (Author/RL)
Descriptors: Classification, Criterion Referenced Tests, Scaling, Scores


