Publication Date
| In 2026 | 0 |
| Since 2025 | 6 |
| Since 2022 (last 5 years) | 26 |
| Since 2017 (last 10 years) | 108 |
| Since 2007 (last 20 years) | 302 |
Descriptor
| Comparative Analysis | 792 |
| Test Reliability | 792 |
| Test Validity | 425 |
| Foreign Countries | 174 |
| Test Construction | 132 |
| Correlation | 119 |
| Statistical Analysis | 117 |
| Scores | 106 |
| Higher Education | 98 |
| Psychometrics | 91 |
| Test Items | 89 |
| More ▼ | |
Source
Author
| Reckase, Mark D. | 5 |
| Bashaw, W. L. | 3 |
| Bennett, Randy Elliot | 3 |
| Benson, Jeri | 3 |
| Crehan, Kevin D. | 3 |
| Ebel, Robert L. | 3 |
| Frisbie, David A. | 3 |
| Hakstian, A. Ralph | 3 |
| Henk, William A. | 3 |
| Weiss, David J. | 3 |
| Winke, Paula | 3 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 18 |
| Practitioners | 17 |
| Teachers | 9 |
| Administrators | 4 |
| Counselors | 2 |
| Policymakers | 2 |
| Parents | 1 |
| Support Staff | 1 |
Location
| United States | 21 |
| Turkey | 20 |
| Australia | 16 |
| China | 11 |
| United Kingdom (England) | 11 |
| Germany | 9 |
| Hong Kong | 9 |
| Iran | 9 |
| Taiwan | 9 |
| United Kingdom | 9 |
| Canada | 8 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedZimmerman, Donald W.; And Others – Journal of Experimental Education, 1984
Three types of test were compared: a completion test, a matching test, and a multiple-choice test. The completion test was more reliable than the matching test, and the matching test was more reliable than the multiple-choice test. (Author/BW)
Descriptors: Comparative Analysis, Error of Measurement, Higher Education, Mathematical Models
Hess, Joseph W. – J Med Educ, 1969
In a comparison of two systems used for evaluating the skills of medical students in relating to patients, the one utilizing interaction analysis yielded more reliable ratings and seems to have potential as an instructional method. The other system used traditional types of judgments registered on a 10-point continuum. Both systems were used with…
Descriptors: Behavior Rating Scales, Comparative Analysis, Evaluation Methods, Interaction Process Analysis
Peer reviewedMcConnell, Campbell – Journal of Economic Education, 1983
The Flesch reading formula is questioned as being appropriate for evaluating college economics textbooks. The Dale-Chall, Modified Dale-Chall, Fry, and Flesch formulas were used to evaluate nine introductory textbooks. There was little or no consistency in either the absolute reading levels or the rank orderings. (Author/AM)
Descriptors: Comparative Analysis, Economics Education, Educational Research, Higher Education
Peer reviewedStiggins, Richard J. – Research in the Teaching of English, 1982
Compares direct and indirect writing assessment strategies and contrasts them in terms of the relationship each has to specific classroom decision-making situations, the components of writing assessed, practical testing matters, characteristics of test exercises, test scoring procedures, and procedures for determining test quality. (HOD)
Descriptors: Comparative Analysis, Decision Making, Educational Assessment, Test Format
Peer reviewedNewmark, Charles S. – Journal of Clinical Psychology, 1981
Provides a brief synopsis of the utility of Minnesota Multiphasic Personality Inventory (MMPI) short forms with psychiatric, medical and normal samples. Strengths and limitations of each MMPI short form are discussed. (Author)
Descriptors: Clinical Psychology, Comparative Analysis, Diagnostic Tests, Personality Measures
Drummond, Robert J.; McIntire, Walter G. – Measurement and Evaluation in Guidance, 1977
This article examines the factor structure of two measures of self-concept routinely used with elementary school children: the Coopersmith Self-Esteem Inventory and the Self-Concept and Motivation Inventory. (Author)
Descriptors: Comparative Analysis, Factor Analysis, Factor Structure, Self Concept
Peer reviewedSchinka, John A.; LaLone, Leif – Psychological Assessment, 1997
To study deviations from U.S. population demographics in the Minnesota Multiphasic Personality Inventory-2 restandardization sample, validity, clinical, content, and supplementary scale comparisons were performed between the restandardization sample and a census-matched subsample. Results indicate that deviations from population demographic…
Descriptors: Census Figures, Comparative Analysis, Demography, Norms
Peer reviewedEngelhard, George, Jr.; And Others – Journal of Research and Development in Education, 1990
Results are reported from a study that investigated the correspondence between two methods used to assess differential item functioning (test item bias). The study also explored the influence of sample size on the two procedures. Although agreement between the two procedures was generally good, the Rasch procedure was more reliable. (IAH)
Descriptors: Comparative Analysis, Elementary Secondary Education, Item Bias, Racial Differences
Peer reviewedKramer, Gene A.; DeMarais, David R. – Journal of Dental Education, 1992
This study found that the restructured National Board Dental Examination Part II is a reliable test assessing a full range of cognitive behaviors, and a unidimensional test of comprehensive general dentistry, suggesting better testing of knowledge and problem-solving skills than on the traditional examination. Performance on the pilot and…
Descriptors: Comparative Analysis, Dentistry, Higher Education, Licensing Examinations (Professions)
Peer reviewedMcCabe, Patrick P.; And Others – Reading Research and Instruction, 1991
Compares reading-disabled students' instructional levels yielded by the Ekwall Reading Inventory and the Metropolitan Achievement Test. Finds that both instruments provide instructional-level designations to help place students in appropriate materials but little correspondence between the instructional levels of the two measures. Suggests that…
Descriptors: Comparative Analysis, Intermediate Grades, Reading Diagnosis, Reading Difficulties
Salthouse, Timothy A.; Schroeder, David H.; Ferrer, Emilio – Developmental Psychology, 2004
Several analyses were conducted on data from samples of adults between 18 and 58 years of age who completed the same cognitive tests after an interval ranging from less than 1 week to 35 years. Because the retest interval varied across individuals, it was possible to determine the length of time needed before the gains associated with a retest…
Descriptors: Cognitive Tests, Test Reliability, Cognitive Ability, Adults
Newman, Michelle G.; Holmes, Marilyn; Zuellig, Andrea R.; Kachin, Kevin E.; Behar, Evelyn – Psychological Assessment, 2006
This study examined the Panic Disorder Self-Report (PDSR), a new self-report diagnostic measure of panic disorder based on the 4th edition of the Diagnostic and Statistical Manual of Mental Disorders (American Psychiatric Association, 1994). PDSR diagnoses were compared with structured interview diagnoses of individuals with generalized anxiety…
Descriptors: Test Reliability, Validity, Diagnostic Tests, Clinical Diagnosis
Poteat, G. Michael; And Others – 1986
Six sociometric measures were evaluated on a sample of 85 four-year-olds from three preschool and day care centers. Stability, intercorrelations, and accuracy of classifying rejected children were compared for measures of social preference, social impact, peer ratings, alternative status, and positive and negative nominations. Test-retest…
Descriptors: Classification, Comparative Analysis, Construct Validity, Peer Relationship
Kapes, Jerome T. – 1975
Two independent studies were conducted to investigate possible differences in General Aptitude Test Battery (GATB) aptitude M resulting from the use of different test equipment (wooden vs. plastic apparatus.) As part of a ten-year longitudinal study of Vocational Development being conducted in the Department of Vocational Education at The…
Descriptors: Aptitude Tests, Comparative Analysis, Elementary Secondary Education, Scores
Chandler, Theodore A.; Patterson, Richard G. – 1976
This study demonstrated the efficacy of a Likert format in contrast to two choice formats in eliciting a more normal distribution of internal-external locus of control responses in a highly variable lower class university sample. The revised Likert format, in contrast to the original, collaborated other research evidence suggesting a multifactor…
Descriptors: Comparative Analysis, Factor Analysis, Individual Differences, Locus of Control

Direct link
