Publication Date
| In 2026 | 0 |
| Since 2025 | 12 |
| Since 2022 (last 5 years) | 114 |
| Since 2017 (last 10 years) | 375 |
| Since 2007 (last 20 years) | 1130 |
Descriptor
| Comparative Analysis | 1943 |
| Reliability | 880 |
| Test Reliability | 792 |
| Foreign Countries | 554 |
| Test Validity | 443 |
| Correlation | 350 |
| Validity | 332 |
| Interrater Reliability | 327 |
| Statistical Analysis | 321 |
| Scores | 280 |
| Measures (Individuals) | 236 |
| More ▼ | |
Source
Author
| Reckase, Mark D. | 6 |
| Attali, Yigal | 5 |
| Coniam, David | 5 |
| Brennan, Robert L. | 4 |
| Crehan, Kevin D. | 4 |
| Feldt, Leonard S. | 4 |
| Hakstian, A. Ralph | 4 |
| Jones, Ian | 4 |
| Kolen, Michael J. | 4 |
| Lunz, Mary E. | 4 |
| August, Diane | 3 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 35 |
| Practitioners | 29 |
| Teachers | 15 |
| Administrators | 9 |
| Policymakers | 6 |
| Counselors | 2 |
| Media Staff | 2 |
| Parents | 1 |
| Support Staff | 1 |
Location
| Turkey | 59 |
| United States | 47 |
| Australia | 36 |
| China | 33 |
| Canada | 32 |
| United Kingdom (England) | 32 |
| United Kingdom | 28 |
| Germany | 25 |
| Netherlands | 24 |
| Taiwan | 22 |
| Hong Kong | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Kulikowich, Jonna M.; Mason, Linda H.; Brown, Scott W. – Reading and Writing: An Interdisciplinary Journal, 2008
Drawing from multiple theoretical frameworks representing cognitive and educational psychology, we present a writing task and scoring system for measurement of students' informative writing. Participants in this study were 72 fifth- and sixth-grade students who wrote compositions describing real-world problems and how mathematics, science, and…
Descriptors: World Problems, Expository Writing, Educational Psychology, Validity
Thrash, Susan K.; Porter, Andrew C. – 1974
The purpose of this paper is to prove that one currently recommended method of obtaining the reliability of an instrument defined on a population of aggregate units is invalid. This method randomly splits the aggregate into two halves, correlates the two half unit scores by a Pearson product moment correlation coefficient, and corrects the…
Descriptors: Comparative Analysis, Correlation, Measurement Techniques, Sampling
Thostenson, Marvin S. – 1966
This investigation dealt with the development and evaluation of both a music dictation test (PRM78 Dictation Test) and a sightsinging test (CSS76 Criterion Sightsinging Test). It was hoped that the dictation test could eventually be developed to serve as an adequate replacement for the latter. Thirteen samples participated in this project--7…
Descriptors: Auditory Training, Comparative Analysis, Music Reading, Statistical Analysis
Kleinke, David J. – 1976
Data from 200 college-level tests were used to compare three reliability approximations (two of Saupe and one of Cureton) to Kuder-Richardson Formula 20 (KR20). While the approximations correlated highly (about .9) with the reliability estimate, they tended to be underapproximations. The explanation lies in an apparent bias of Lord's approximation…
Descriptors: Comparative Analysis, Correlation, Error of Measurement, Statistical Analysis
Peer reviewedMcQuitty, Louis L.; Koch, Valerie L. – Educational and Psychological Measurement, 1975
A rapid method for hierarchically clustering the n objects of a matrix which portrays the interrelation of every object to every other object, where n equals any number up to 1,000 and even larger, is developed and discussed. Results compare favorably with those from other excellent methods. (Author/BJG)
Descriptors: Cluster Analysis, Comparative Analysis, Evaluation Methods, Matrices
McLaughlin, Thomas M.; And Others – Research Quarterly, 1977
Results of experimentation suggest that the cubic spline is a convenient and consistent method for providing an accurate description of displacement-time data and for obtaining the corresponding time derivatives. (MJB)
Descriptors: Comparative Analysis, Measurement Techniques, Physical Activities, Reliability
Peer reviewedSerlin, Ronald C.; Marascuilo, Leonard A. – Journal of Educational Statistics, 1983
Two alternatives to the problems of conducting planned and post hoc comparisons in tests of concordance and discordance for G groups of judges are examined. The two models are illustrated using existing data. (Author/JKS)
Descriptors: Attitude Measures, Comparative Analysis, Interrater Reliability, Mathematical Models
Peer reviewedConger, Anthony J.; Ward, David G. – Educational and Psychological Measurement, 1984
Sixteen measures of reliability for two-category nominal scales are compared. Upon correcting for chance agreement, there are only five distinct indices: Fleiss's modification of A-sub-1, the phi coefficient, Cohen's kappa, and two intraclass coefficients. Recommendations for choosing an agreement index are made based on definitions, magnitude,…
Descriptors: Comparative Analysis, Correlation, Data Analysis, Mathematical Models
Peer reviewedMartois, John S. – Educational and Psychological Measurement, 1973
Copies of this program may be obtained from the author at the University of Southern California, School of Pharmacy, University Park, Los Angeles 90007. (CB)
Descriptors: Comparative Analysis, Computer Programs, Input Output, Statistical Analysis
Peer reviewedMcConchie, Richard Duane; Rutschmann, Jacques – Perceptual and Motor Skills, 1971
Descriptors: College Students, Comparative Analysis, Males, Measurement
Peer reviewedTarling, Roger – Educational and Psychological Measurement, 1982
The Mean Cost Rating, P(A) from Signal Detection Theory, Kendall's rank correlation coefficient tau, and Goodman and Kruskal's gamma measures of predictive power are compared and shown to be different transformations of the statistic S. Gamma is generally preferred for hypothesis testing. Measures of association for ordered contingency tables are…
Descriptors: Comparative Analysis, Hypothesis Testing, Power (Statistics), Predictive Measurement
Peer reviewedWakefield, James A., Jr. – Educational and Psychological Measurement, 1980
Studies in applied behavior analysis have used two expressions of reliability for human observations: percentage agreement and correlational techniques (including the phi coefficient). Formulas for converting percentage agreement scores to phi coefficients and vice versa are presented. (Author/RL)
Descriptors: Behavioral Science Research, Comparative Analysis, Correlation, Mathematical Formulas
Peer reviewedBruck, Maggie; Ceci, Stephen J.; Hembrooke, Helene – Developmental Review, 2002
Used various suggestive techniques in repeated interviews with preschool children to elicit narratives about true and fictional events. Found that fictional narratives contained more spontaneous details, more elaborations, and more aggressive details than true narratives. Across retellings, false narratives were less consistent but contained more…
Descriptors: Comparative Analysis, Credibility, Honesty, Interviews
Peer reviewedBrutus, Stephane; Fleenor, John W.; London, Manuel – Journal of Management Development, 1998
Self, subordinate, peer, and supervisor ratings of 1,080 managers in education, military, government, manufacturing, finance, and health were analyzed for leniency, interrater agreement, and effectiveness. In the private sector, more poor performing managers tended to overestimate their performance. Interrater agreement was lowest in government…
Descriptors: Comparative Analysis, Feedback, Interrater Reliability, Job Performance
Frazier, Thomas W.; Naugle, Richard I.; Haggerty, Kathryn A. – Psychological Assessment, 2006
The 160-item short form of the Personality Assessment Inventory (PAI) was developed for situations in which respondents complete only the 1st half of the test. The present study evaluates the adequacy and comparability of the full and short forms of the PAI in terms of a wide range of psychometric characteristics. In all, 421 participants…
Descriptors: Psychometrics, Personality Assessment, Reliability, Evaluation Methods

Direct link
