Publication Date
| In 2026 | 0 |
| Since 2025 | 433 |
| Since 2022 (last 5 years) | 1911 |
| Since 2017 (last 10 years) | 4483 |
| Since 2007 (last 20 years) | 6968 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 830 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 159 |
| Spain | 129 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 111 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Ebel, Robert L. – Educ Psychol Meas, 1969
Descriptors: Item Analysis, Multiple Choice Tests, Objective Tests, Test Reliability
Peer reviewedShapiro, Alexander – Psychometrika, 1982
Minimum trace factor analysis has been used to find the greatest lower bound to reliability. This technique, however, fails to be scale free. A solution to the scale problem is proposed through the maximization of the greatest lower bound as the function of weights. (Author/JKS)
Descriptors: Algorithms, Estimation (Mathematics), Factor Analysis, Psychometrics
Peer reviewedJuni, Samuel – Journal of Research in Personality, 1982
Reanalysis of previously published data suggests the Defense Mechanism Inventory can yield a composite measure of reaction to frustration by contrasting linearly the defenses Turning-against-object and Projection against those of Principalization and Reversal-of-affect. Factor-analytic and correlational data support the exclusion of…
Descriptors: Data Analysis, Measures (Individuals), Psychological Characteristics, Test Reliability
Peer reviewedGilmer, Jerry S.; Feldt, Leonard S. – Psychometrika, 1983
Estimating the reliability of measures derived from separate questions on essay tests or individual judges on a rater panel is considered. Cronbach's alpha is shown to underestimate reliability in these cases. Some alternative coefficients are presented. (JKS)
Descriptors: Essay Tests, Item Analysis, Measurement Techniques, Rating Scales
Peer reviewedTorabi-Parizi, Rosa; Campbell, Noma Jo – Elementary School Journal, 1982
Investigates the effects of varying the placement of blanks and the number of options available in multiple-choice items on the reliability of fifth-grade students' scores. Results indicate that scores on three-choice item tests were not less reliable than scores on four-choice item tests. A similar finding was found regarding the placement of…
Descriptors: Elementary Education, Elementary School Students, Scores, Test Format
Peer reviewedDillon, Ronna F.; Donow, Carolyn – Educational and Psychological Measurement, 1982
College undergraduates were given Zelniker and Jeffrey's modification of the Matching Familiar Figures Test to assess its psychometric credibility and construct validity for adult problem solvers. The modified test has improved internal consistency and stability over the original. The construct's possible correlation with general problem solving…
Descriptors: Cognitive Tests, Higher Education, Problem Solving, Test Reliability
Peer reviewedHolden, E. Wayne; And Others – Journal of Abnormal Child Psychology, 1982
PANESS total score was reliable and significantly correlated with relevant indices of the Wechsler Intelligence Scale for Children-Revised. (Author/CL)
Descriptors: Clinical Diagnosis, Disability Identification, Elementary Education, Neurological Impairments
Peer reviewedEvans, Charles S. – Journal of Moral Education, 1982
Describes a study investigating the comparative reliability of Form A and Form B of the Moral Judgment Interview when given in written version to high school students. Subjects were 49 juniors and seniors enrolled in a behavioral science class. Findings indicated that alternative forms of the interview are not highly correlated. (AM)
Descriptors: Educational Research, Ethical Instruction, High Schools, Test Reliability
Peer reviewedYule, William; Rigley, Leslie V. – Journal of Research in Reading, 1982
Findings suggest that modestly good predictions can be made between IQ as measured by the Wechsler intelligence scales for children at age five and one-half and scores on group reading tests administered at ages seven and eight years. (FL)
Descriptors: Intelligence Tests, Predictive Validity, Primary Education, Reading Tests
Peer reviewedGallagher, Dolores; And Others – Journal of Consulting and Clinical Psychology, 1982
Reports three reliability coefficients for the Beck Depression Inventory using samples of elderly community volunteers and depressed outpatients. All three indexes were reasonably high in the total sample and fall within the accepted range of reliability for a clinical screening instrument. (Author)
Descriptors: Depression (Psychology), Diagnostic Tests, Measures (Individuals), Older Adults
Peer reviewedvan den Wollenberg, Arnold L. – Psychometrika, 1982
Presently available test statistics for the Rasch model are shown to be insensitive to violations of the assumption of test unidimensionality. Two new statistics are presented. One is similar to available statistics, but with some improvements; the other addresses the problem of insensitivity to unidimensionality. (Author/JKS)
Descriptors: Item Analysis, Latent Trait Theory, Statistics, Test Reliability
Peer reviewedBrown, Hilary S. R.; May, Arthur E. – Journal of Consulting and Clinical Psychology, 1979
The test-retest IQs of 50 patients were correlated. The patients were included in the sample only because they had been given the Wechsler Adult Intelligence Scale before. The interval between test and retest averaged almost two years. All test-retest correlations were .90 or better. (Author)
Descriptors: Correlation, Followup Studies, Foreign Countries, Intelligence Tests
Peer reviewedBurton, Nancy W. – Educational and Psychological Measurement, 1981
This study was concerned with selecting a measure of scorer agreement for use with the National Assessment of Educational Progress. The simple percent of agreement and Cohen's kappa were compared. It was concluded that Cohen's kappa does not add sufficient information to make its calculation worthwhile. (Author/BW)
Descriptors: Educational Assessment, Elementary Secondary Education, Quality Control, Scoring
Peer reviewedRaju, Nambury S. – Psychometrika, 1979
An important relationship is given for two generalizations of coefficient alpha: (1) Rajaratnam, Cronbach, and Gleser's generalizability formula for stratified-parallel tests, and (2) Raju's coefficient beta. (Author/CTM)
Descriptors: Item Analysis, Mathematical Formulas, Test Construction, Test Items
Peer reviewedBrennan, Robert L.; Prediger, Dale J. – Educational and Psychological Measurement, 1981
This paper considers some appropriate and inappropriate uses of coefficient kappa and alternative kappa-like statistics. Discussion is restricted to the descriptive characteristics of these statistics for measuring agreement with categorical data in studies of reliability and validity. (Author)
Descriptors: Classification, Error of Measurement, Mathematical Models, Test Reliability


