Publication Date
| In 2026 | 2 |
| Since 2025 | 462 |
| Since 2022 (last 5 years) | 1941 |
| Since 2017 (last 10 years) | 4513 |
| Since 2007 (last 20 years) | 6998 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10004 |
| Test Construction | 4369 |
| Foreign Countries | 3831 |
| Psychometrics | 2428 |
| Factor Analysis | 2301 |
| Measures (Individuals) | 1785 |
| Evaluation Methods | 1410 |
| Higher Education | 1391 |
| Questionnaires | 1261 |
| Factor Structure | 1248 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 838 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 162 |
| Spain | 129 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 112 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Peer reviewedMitchell, Karen; Anderson, Judy – Educational and Psychological Measurement, 1986
This study examined the reliability of holistic scoring for a sample of essays written during the Spring 1985 MCAT administration. Analysis of variance techniques was used to estimate the reliability of scoring and to partition score variance into that due to level differences between papers and to context-specific factors. (Author/LMO)
Descriptors: Analysis of Variance, Essay Tests, Holistic Evaluation, Medical Education
Peer reviewedWiersma, Uco; Latham, Gary P. – Personnel Psychology, 1986
The practicality of three appraisal instruments was measured in terms of user preference, namely, behavioral observation scales (BOS), behavioral expectation scales (BES), and trait scales. In all instances, BOS were preferred to BES, and in all but two instances, BOS were viewed as superior to trait scales. (Author/ABB)
Descriptors: Administrators, Behavior Patterns, Behavior Rating Scales, Personnel Evaluation
Peer reviewedKinnier, Richard T. – Journal of Counseling Psychology, 1987
Describes the development of a Values Conflict Resolution Assessment (VCRA) and reports on validation and reliability. Items were constructed from theoretical criteria in values clarification and decision making with "ethical-emotional" and "rational-behavioral" components. VCRA scores correlated negatively with…
Descriptors: Anxiety, Conflict Resolution, Decision Making, Psychometrics
Peer reviewedLubin, Bernard; And Others – Hispanic Journal of Behavioral Sciences, 1986
Tested utility of Spanish (American) version of Depression Adjective Check Lists with 70 Mexican American and 66 Mexican college student samples. Found no significant differences on lists E, F, and G. Found significant concurrent validity in Mexican sample by means of correlations with the Center for Epidemiologic Studies Depression Scale. (NEC)
Descriptors: College Students, Comparative Analysis, Mexican Americans, Mexicans
An Evaluation of the Diagnostic Efficiency of the Wechsler Intelligence Scale for Children--Revised.
Peer reviewedMueller, Horst H.; And Others – Alberta Journal of Educational Research, 1984
Because diagnostic capability of the WISC-R has remained in doubt, its diagnostic suitability was assessed by applying Kelley's method of estimating the proportion of score differences in excess of chance to the original subscales, Bannatyne clusters, and Kaufman's three factor groupings. Caution should be used when applying WISC-R diagnostically.…
Descriptors: Clinical Diagnosis, Comparative Analysis, Evaluation Criteria, Tables (Data)
Peer reviewedZimmerman, Irla Lee; And Others – Psychology in the Schools, 1986
Assessed the degree of comparability between the tests over time for two samples of referred adolescents of borderline intelligence. Results indicated that the Wechsler Adult Intelligence Scale-Revised significantly overestimated the Wechsler Intelligence Scale for Children-Revised by three to five points. Differences were most marked at the lower…
Descriptors: Adolescents, Comparative Analysis, Intelligence Tests, Learning Disabilities
Peer reviewedMackie, Kerrie; Dermody, Phillip – Journal of Speech and Hearing Research, 1986
The monosyllabic adaptive speech test (MAST) procedures were found to be reliable with children as young as 3. The accuracy of the MAST estimate of 50% speech threshold was confirmed in 60 hearing imparied and normal children 3-7 years old. With 10 hearing impaired children, MAST threshold was significantly correlated with pure-tone loss. (CL)
Descriptors: Audiometric Tests, Elementary Education, Hearing Impairments, Preschool Education
Peer reviewedSchmitt, Neal; Ostroff, Cheri – Personnel Psychology, 1986
Delineates a systematic procedure for operationalizing the "behavioral consistency" notion. The steps used in developing selection tests from a content-oriented strategy are illustrated, and the transformation of specific job behaviors into tests related to job content is demonstrated. Test reliability and content validity are presented.…
Descriptors: Job Application, Job Placement, Job Skills, Occupational Information
Peer reviewedKane, Michael T. – Journal of Educational Measurement, 1986
These analyses suggest that if a criterion-referenced test had a reliability (defined in terms of internal consistency) below 0.5, a simple a priori procedure would provide better estimates of students' universe scores than would individual observed scores. (Author/LMO)
Descriptors: Criterion Referenced Tests, Educational Research, Error of Measurement, Generalizability Theory
Peer reviewedCarey, Michael P.; And Others – Journal of Consulting and Clinical Psychology, 1986
Reports on the development of the Adolescent Activities Checklist (AAC) which is comprised of 100 items that assess pleasant and unpleasant activities. The AAC subscales demonstrated high internal consistency and homogeneity. Results suggest the AAC is a reliable index of the frequency of pleasant and unpleasant activities reported by adolescents.…
Descriptors: Adolescents, Depression (Psychology), Measurement Techniques, Physical Activity Level
Peer reviewedFrisbie, David A.; Druva, Cynthia A. – Journal of Educational Measurement, 1986
This study was designed to examine the level of dependence within multiple true-false test-item clusters by computing sets of item correlations with data from a test composed of both multiple true-false and multiple-choice items. (Author/LMO)
Descriptors: Cluster Analysis, Correlation, Higher Education, Multiple Choice Tests
Peer reviewedKazdin, Alan E.; And Others – Journal of Consulting and Clinical Psychology, 1986
Evaluated psychometric features and correlates of the Hopelessness Scale for Children. Results indicated the scale was internally consistent, item-total score correlations and test-retest reliability were in the moderate range, and individual items discriminated high and low hopelessness children. Hopelessness correlated positively with depression…
Descriptors: Children, Depression (Psychology), Elementary Secondary Education, Self Esteem
Peer reviewedGutkin, Terry B.; And Others – Educational and Psychological Measurement, 1985
This study examined selected pyschometric properties of the Health Locus of Control Scale (HLOC) (Wallston, Wallston, Kaplan, and Maides, 1976). Specifically, the HLOC factor structure, factor score reliabilities, and correlations with social desirability were investigated. (Author)
Descriptors: Attitude Measures, Factor Structure, Health, Higher Education
Peer reviewedGraham, John W.; And Others – Journal of Drug Education, 1984
Describes an evaluation of a self-report questionnaire administered to seventh graders (N=396). Using the test-retest reliability matrix, eight of nine drug-use indices appeared to have acceptable to good reliability. The three measures included in the test-retest reliability matrix provide stronger evidence for good reliability than could any…
Descriptors: Drug Use, Junior High School Students, Junior High Schools, Measurement Techniques
Fuchs, Lynn S.; And Others – Diagnostique, 1983
Effects of aggregation on reliability of curriculum-based measures of academic performance were explored in two studies involving elementary students. Findings suggested that some academic behaviors initially are measured precisely (aggregation had minimal effect), while other behaviors, such as scores on the error passage reading measure, are not…
Descriptors: Academic Achievement, Diagnostic Teaching, Disabilities, Elementary Education


