Publication Date
| In 2026 | 0 |
| Since 2025 | 12 |
| Since 2022 (last 5 years) | 114 |
| Since 2017 (last 10 years) | 375 |
| Since 2007 (last 20 years) | 1130 |
Descriptor
| Comparative Analysis | 1943 |
| Reliability | 880 |
| Test Reliability | 792 |
| Foreign Countries | 554 |
| Test Validity | 443 |
| Correlation | 350 |
| Validity | 332 |
| Interrater Reliability | 327 |
| Statistical Analysis | 321 |
| Scores | 280 |
| Measures (Individuals) | 236 |
| More ▼ | |
Source
Author
| Reckase, Mark D. | 6 |
| Attali, Yigal | 5 |
| Coniam, David | 5 |
| Brennan, Robert L. | 4 |
| Crehan, Kevin D. | 4 |
| Feldt, Leonard S. | 4 |
| Hakstian, A. Ralph | 4 |
| Jones, Ian | 4 |
| Kolen, Michael J. | 4 |
| Lunz, Mary E. | 4 |
| August, Diane | 3 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 35 |
| Practitioners | 29 |
| Teachers | 15 |
| Administrators | 9 |
| Policymakers | 6 |
| Counselors | 2 |
| Media Staff | 2 |
| Parents | 1 |
| Support Staff | 1 |
Location
| Turkey | 59 |
| United States | 47 |
| Australia | 36 |
| China | 33 |
| Canada | 32 |
| United Kingdom (England) | 32 |
| United Kingdom | 28 |
| Germany | 25 |
| Netherlands | 24 |
| Taiwan | 22 |
| Hong Kong | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Peer reviewedBerry, Kenneth J.; Mielke, Paul W., Jr. – Educational and Psychological Measurement, 1997
Describes a FORTRAN software program that calculates the probability of an observed difference between agreement measures obtained from two independent sets of raters. An example illustrates the use of the DIFFER program in evaluating undergraduate essays. (Author/SLD)
Descriptors: Comparative Analysis, Computer Software, Evaluation Methods, Higher Education
Peer reviewedSchroeder, Marsha L.; Hakstian, A. Ralph – Psychometrika, 1990
A 2-facet measurement model is identified, and its coefficient of generalizability (CG) is examined. Three other multifaceted measurement models and their CGs are identified. An empirical investigation of all four procedures is conducted using data from a study of the psychopathology of 71 prison inmates. (SLD)
Descriptors: Comparative Analysis, Equations (Mathematics), Generalizability Theory, Mathematical Models
Peer reviewedMcGee, Robin A.; And Others – Child Abuse & Neglect: The International Journal, 1995
This study examined the comparability and predictive validity of three approaches to measurement of child or adolescent maltreatment, involving the adolescents themselves (n=160), case file review by researchers, and protection agency social workers. Comparison of ratings across sources indicated considerable disagreement with respect to judgments…
Descriptors: Adolescents, Child Abuse, Comparative Analysis, Interrater Reliability
Peer reviewedSeifer, Ronald; And Others – Child Development, 1994
Observers and mothers rated infant behavior in the home on dimensions of temperament once a week for eight weeks. Although week-to-week correlations were modest, aggregates of the eight observations had high reliability for both observers and mothers. When direct observations were compared with mother reports, little evidence of mother-observer…
Descriptors: Comparative Analysis, Infant Behavior, Infants, Interrater Reliability
Peer reviewedLandrine, Hope; Klonoff, Elizabeth A. – Journal of Black Psychology, 1995
Studied African American culture, using a new, shortened, 33-item African American Acculturation Scale (AAAS-33) to assess the scale's validity and reliability. Comparisons between the original form and AAAS-33 reveal high correlations, however, the longer form may be sensitive to some beliefs, practices, and attitudes not assessed by the short…
Descriptors: Acculturation, Blacks, Comparative Analysis, Correlation
Peer reviewedStansfield, Charles W.; Kenyon, Dorry Mann – System, 1992
Reviews research that sheds light on the comparability of Oral Proficiency Interview and the Simulated Oral Proficiency Interview. Suggestions are provided for further research. (16 references) (VWL)
Descriptors: Comparative Analysis, Interviews, Language Proficiency, Language Tests
Peer reviewedPorter, Stephen; And Others – Child Abuse & Neglect: The International Journal, 1995
Fifteen deaf and 11 hearing children (ages 8-10) witnessed slides depicting a wallet theft and were interviewed using a free recall approach followed by increasingly directive questions. Although accuracy of the two groups did not differ in free recall, deaf children provided less accurate responses to directive questions, whereas accuracy of the…
Descriptors: Comparative Analysis, Deafness, Information Sources, Memory
Peer reviewedSherman, Thomas M. – Journal of General Education, 1991
Reviews commercially available instruments to assess students' study skills/habits/behavior, finding that none completely meets the standards set by the American Psychological Association or the American Educational Research Association. Offers guidance on selecting and applying these instruments in college settings. (DMM)
Descriptors: Comparative Analysis, Higher Education, Standardized Tests, Standards
Krippendorff, Klaus – Human Communication Research, 2004
In a recent article in this journal, Lombard, Snyder-Duch, and Bracken (2002) surveyed 200 content analyses for their reporting of reliability tests, compared the virtues and drawbacks of five popular reliability measures, and proposed guidelines and standards for their use. Their discussion revealed that numerous misconceptions circulate in the…
Descriptors: Misconceptions, Content Analysis, News Reporting, Measurement Techniques
Abdullah, Firdaus – Quality Assurance in Education: An International Perspective, 2005
Purpose: The purpose of this paper is to empirically test a new industry-specific scale, HEdPERF (Higher Education PERFormance) to capture the authentic determinants of service quality within higher education sector. Design/methodology/approach: The primary goal of this research was to test and compare the relative efficacy of HEdPERF against…
Descriptors: Higher Education, Measurement, Comparative Analysis, Educational Quality
Subramaniam, Selva Ranee; Cheong, Loh Sau – Journal of Science and Mathematics Education in Southeast Asia, 2008
This study sought to explore the emotional intelligence of Form One mathematics and science teachers. The emotional intelligence of the teachers was determined using the Emotional Intelligence for Mathematics and Science Teachers (EIMST) survey instrument. It was adapted and adopted from related instruments and then pilot tested for validity and…
Descriptors: Emotional Intelligence, Teaching Methods, Science Teachers, Mathematics Teachers
Guskey, Thomas R. – Educational Measurement: Issues and Practice, 2007
This study compared different stakeholders' perceived validity of various indicators of student learning used to judge the quality of students' academic performance. Data were gathered from the questionnaire responses of 314 educators in three states that have implemented comprehensive state-wide assessment programs with high-stakes consequences…
Descriptors: Academic Achievement, Educational Indicators, State Surveys, Participation
Mann, Rebecca – 1988
In response to growing concern about the lack of basic writing skills, this paper presents an overview of the issues involved in selecting a method for the assessment of students' writing skills. After general criteria for determining the appropriateness of a writing evaluation procedure are outlined, the merits and limitations of objective tests…
Descriptors: Comparative Analysis, Evaluation Criteria, Evaluation Methods, Holistic Evaluation
Huynh, Huynh – 1977
Three techniques for estimating Kuder Richardson reliability (KR20) coefficients for incomplete data are contrasted. The methods are: (1) Henderson's Method 1 (analysis of variance, or ANOVA); (2) Henderson's Method 3 (FITCO); and (3) Koch's method of symmetric sums (SYSUM). A Monte Carlo simulation was used to assess the precision of the three…
Descriptors: Analysis of Variance, Comparative Analysis, Mathematical Models, Monte Carlo Methods
Peer reviewedEbel, Robert L. – Journal of Educational Measurement, 1975
Descriptors: Comparative Analysis, Multiple Choice Tests, Objective Tests, Teachers

Direct link
