Publication Date
| In 2026 | 0 |
| Since 2025 | 28 |
| Since 2022 (last 5 years) | 117 |
| Since 2017 (last 10 years) | 228 |
| Since 2007 (last 20 years) | 561 |
Descriptor
| Evaluation Methods | 1408 |
| Test Reliability | 1408 |
| Test Validity | 954 |
| Student Evaluation | 339 |
| Test Construction | 305 |
| Foreign Countries | 217 |
| Higher Education | 183 |
| Measurement Techniques | 170 |
| Psychometrics | 168 |
| Elementary Secondary Education | 147 |
| Evaluation Criteria | 122 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 74 |
| Practitioners | 72 |
| Teachers | 29 |
| Administrators | 18 |
| Policymakers | 11 |
| Students | 4 |
| Counselors | 3 |
| Support Staff | 3 |
| Community | 1 |
| Parents | 1 |
Location
| Australia | 24 |
| United Kingdom | 22 |
| Canada | 18 |
| Turkey | 16 |
| China | 14 |
| United States | 14 |
| California | 11 |
| Netherlands | 10 |
| Florida | 9 |
| Texas | 8 |
| United Kingdom (England) | 8 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Peer reviewedYang, June C.; Laube, Douglas W. – Journal of Medical Education, 1983
The intraclass reliability of an oral examination administered during a clinical clerkship was analyzed in three ways: by traditional estimation, by faculty evaluation of taped examinations, and with use of a new evaluation form. A noticeable increase in reliability was found with the new form's use. (MSE)
Descriptors: Clinical Experience, Evaluation Methods, Higher Education, Medical Education
Morris, Lynn Lyons; Fitz-Gibbon, Carol Taylor; Lindheim, Elaine – 1987
The "CSE Program Evaluation Kit" is a series of nine books intended to assist people conducting program evaluations. This volume, the seventh in the kit, provides an overview of a variety of approaches to measuring performance outcomes. It presents considerations in deciding what to measure and in selecting or developing instruments best suited to…
Descriptors: Evaluation Methods, Evaluation Utilization, Performance Tests, Program Evaluation
Peer reviewedBrown, William R. – Journal of Research in Science Teaching, 1973
Describes the development of two forms of the Checklist for Assessment of Science Teachers and their application to preservice teachers. Reveals that differences in classroom activities and student-teacher relationships are found between the treatment and nontreatment teacher groups, but not in teacher personal adjustment. (CC)
Descriptors: Educational Programs, Evaluation Criteria, Evaluation Methods, Measurement Instruments
Tocci, Ronald J. – Technical Education News, 1971
The problem session approach represents a restructuring of traditional testing and evaluation procedures aimed at assessing student achievement and ability. (JS)
Descriptors: Educational Testing, Evaluation Methods, Problem Solving, Self Evaluation
Hess, Joseph W. – J Med Educ, 1969
In a comparison of two systems used for evaluating the skills of medical students in relating to patients, the one utilizing interaction analysis yielded more reliable ratings and seems to have potential as an instructional method. The other system used traditional types of judgments registered on a 10-point continuum. Both systems were used with…
Descriptors: Behavior Rating Scales, Comparative Analysis, Evaluation Methods, Interaction Process Analysis
Peer reviewedHesselbrock, Michie N.; And Others – Journal of Consulting and Clinical Psychology, 1983
Compared three instruments assessing depression in alcoholics: Diagnostic and Statistical Manual of Mental Disorders (DSM-II), the Minnesota Multiphasic Personality Inventory Depression scale (MMPI D), and the Beck Depression Inventory (BDI). The number of subjects who were diagnosed as "depressed" varied considerably according to the…
Descriptors: Alcoholism, Comparative Testing, Depression (Psychology), Diagnostic Tests
Peer reviewedBerk, Ronald A. – Journal of Educational Measurement, 1980
A dozen different approaches that yield 13 reliability indices for criterion-referenced tests were identified and grouped into three categories: threshold loss function, squared-error loss function, and domain score estimation. Indices were evaluated within each category. (Author/RL)
Descriptors: Classification, Criterion Referenced Tests, Cutting Scores, Evaluation Methods
The Visual Motor Integration Test: High Interjudge Reliability, High Potential For Diagnostic Error.
Peer reviewedSnyder, Peggy P.; And Others – Psychology in the Schools, 1981
Investigated scoring agreement among three different training levels of Visual Motor Integration Test (VMI) diagnosticians. Correlational data demonstrated high interexaminer reliabilities; however, there were gross errors in precision after raw scores had been converted into VMI age equivalent scores. (Author/RC)
Descriptors: Educational Diagnosis, Evaluation Methods, Grade Equivalent Scores, Motor Development
Peer reviewedHarris, Larry P.; Wolf, Steven R. – Learning Disability Quarterly, 1979
The article focuses on the controversy over norm-referenced v criterion-referenced measures (CRM) in assessment of learning disorders. The authors contend that while the reliability of CRMs is generally indisputable, the validity of measures designed from local curricula is still dependent on the intuitive judgments of teachers. (Author/SBH)
Descriptors: Criterion Referenced Tests, Evaluation Methods, Learning Disabilities, Norm Referenced Tests
Santa Maria, D. L.; And Others – Research Quarterly, 1976
The O.S.U. Step Test was administered to 68 male university students to determine the objectivity of three methods of monitering heart rate--subjects count, investigator's count, and ECG records--with results indicating that the investigator was significantly more accurate in heart rate determination than were the subjects. (MB)
Descriptors: Cardiovascular System, College Students, Evaluation Methods, Exercise (Physiology)
Ross, Linda J.; Gallagher, Patricia A. – New Outlook for the Blind, 1976
Descriptors: Behavior Patterns, Elementary Secondary Education, Evaluation Methods, Exceptional Child Research
Peer reviewedSchultz, Margaret – Journal of Special Education, 1997
This study examined effects on 62 elementary students with learning disabilities undergoing triennial reevaluation of the change from the Wechsler Intelligence Scale for Children--Revised to the Wechsler Intelligence Scale for Children--Third Edition. Results indicate changes in the correlation with Woodcock-Johnson--Revised Tests of Achievement…
Descriptors: Disability Identification, Elementary Education, Evaluation Methods, Intelligence Tests
Peer reviewedHayes, Jeffrey A. – Journal of Counseling Psychology, 1997
Examines the reliability and validity of the Brief Symptom Inventory (BSI). Results based on 2,078 clients at 31 university counseling centers who completed the BSI at intake indicate that, although internal consistency for the subscales was high, factor analyses yielded a six-factor solution rather than a nine-factor one. (RJM)
Descriptors: College Students, Diagnostic Tests, Evaluation Methods, Evaluation Problems
Peer reviewedMuris, Peter; Steerneman, Pim; Ratering, Elise – Journal of Autism and Developmental Disorders, 1997
A study of 10 children (ages 3-6) with pervasive developmental disorders investigated the interrater reliability of the Psychoeducational Profile (PEP). Results show good interrater reliability for the developmental items, indicating that the PEP can be used to evaluate progress in development of children with pervasive developmental disorders.…
Descriptors: Child Development, Children, Evaluation Methods, Foreign Countries
Peer reviewedKim, Bryan S. K.; Cartwright, Brenda Y.; Asay, Penelope A.; D'Andrea, Michael J. – Measurement and Evaluation in Counseling and Development, 2003
On the basis of data from 2 studies with counseling graduate students, the Multicultural Awareness, Knowledge, and Skills Survey--Counselor Edition was revised. The new 33-item instrument consists of 10-item Awareness, 13-item Knowledge, and 10-item Skills subscales. Evidence of reliability and validity are described. (Contains 35 references and 3…
Descriptors: Counselor Training, Cultural Awareness, Cultural Pluralism, Evaluation Methods


