Publication Date
| In 2026 | 0 |
| Since 2025 | 28 |
| Since 2022 (last 5 years) | 117 |
| Since 2017 (last 10 years) | 228 |
| Since 2007 (last 20 years) | 561 |
Descriptor
| Evaluation Methods | 1408 |
| Test Reliability | 1408 |
| Test Validity | 954 |
| Student Evaluation | 339 |
| Test Construction | 305 |
| Foreign Countries | 217 |
| Higher Education | 183 |
| Measurement Techniques | 170 |
| Psychometrics | 168 |
| Elementary Secondary Education | 147 |
| Evaluation Criteria | 122 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 74 |
| Practitioners | 72 |
| Teachers | 29 |
| Administrators | 18 |
| Policymakers | 11 |
| Students | 4 |
| Counselors | 3 |
| Support Staff | 3 |
| Community | 1 |
| Parents | 1 |
Location
| Australia | 24 |
| United Kingdom | 22 |
| Canada | 18 |
| Turkey | 16 |
| China | 14 |
| United States | 14 |
| California | 11 |
| Netherlands | 10 |
| Florida | 9 |
| Texas | 8 |
| United Kingdom (England) | 8 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Crehan, Kevin D. – 1997
Writing fits well within the realm of outcomes suitable for observation by performance assessments. Studies of the reliability of performance assessments have suggested that interrater reliability can be consistently high. Scoring consistency, however, is only one aspect of quality in decisions based on assessment results. Another is…
Descriptors: Evaluation Methods, Feedback, Generalizability Theory, Interrater Reliability
Peer reviewedRaven, Ronald J. – Science Education, 1973
Discusses the development and analysis of Raven's Test of Logical Operations which uses the same problem solving rules as Piaget's. Indicates that the test is sensitive to the determination of students' difficulties with specific types of reasoning patterns. (CC)
Descriptors: Developmental Psychology, Educational Research, Elementary Education, Evaluation Methods
McLeod, John; Anderson, Jonathan – J Reading Behav, 1970
Descriptors: Cloze Procedure, Evaluation Methods, Information Theory, Language Patterns
Peer reviewedAskegaard, Lewis D.; Umila, Benwardo V. – Journal of Educational Measurement, 1982
Multiple matrix sampling of items and examinees was applied to an 18-item rank order instrument administered to a randomly assigned group and compared to the ordering and ranking of all items by control subjects. High correlations between ranks suggest the methodology may viably reduce respondent effort on long rank ordering tasks. (Author/CM)
Descriptors: Evaluation Methods, Item Sampling, Junior High Schools, Student Reaction
Peer reviewedHardesty, Larry; And Others – College and Research Libraries, 1979
Focuses primarily on summative evaluation of library-use instruction programs, using the development and results of a systematic assessment of such a program at DePauw University. Tabulated statistics and the evaluation instrument are provided. (Author/JD)
Descriptors: Academic Libraries, College Freshmen, Evaluation Methods, Library Instruction
Peer reviewedKitchin, R. M.; Jacobson, R. D. – Journal of Visual Impairment & Blindness, 1997
Assesses techniques used by researchers to collect and analyze data on how people with visual impairments or blindness learn, understand, and think about geographic space. Recommendations are made for increasing the validity of studies, including the use of multiple, mutually supportive tests; larger samples; and real-world environments.…
Descriptors: Blindness, Cognitive Tests, Data Collection, Data Interpretation
Peer reviewedBarthelemy, C.; And Others – Journal of Autism and Developmental Disorders, 1997
A French study of 136 children (ages 20-139 months) with developmental disabilities investigated the reliability and validity of the Revised Behavior Summarized Evaluation Scale (BSE-R) in evaluating autistic behavior in children with developmental delays. The BSE-R was found to be useful for progressive recording of the evolution of patients…
Descriptors: Autism, Children, Developmental Delays, Disability Identification
Peer reviewedBrantley, Ashley; Huebner, E. Scott; Nagle, Richard J. – Mental Retardation, 2002
The Multidimensional Students' Life Satisfaction Scale was used to compare life satisfaction reports of 80 high school students with mild mental disabilities with 80 typical students. Students with disabilities reported comparable positive levels with two exceptions: lower satisfaction with their friendships and higher satisfaction with school…
Descriptors: Evaluation Methods, Friendship, Life Satisfaction, Mild Mental Retardation
Peer reviewedBarthelemy, C.; And Others – Journal of Autism and Developmental Disorders, 1990
The Behavior Summarized Evaluation (BSE) measures changes in behavioral parameters in autistic children over time and treatments. Tests of reliability and validity suggest that the BSE is an acceptable tool for the assessment of autistic behaviors, is easy to handle, and is accessible to both professional and paraprofessional medicoeducative…
Descriptors: Autism, Behavior Change, Behavior Rating Scales, Elementary Secondary Education
Peer reviewedCahan, Sorel – Educational and Psychological Measurement, 1989
Statistical significance and "abnormality" have been used as criteria for the evaluation of intra-individual subtest score differences. Shortcomings of these criteria are identified, and improved estimates of the true score differences are suggested. The applicability of the abnormality criterion to these improved estimates is reviewed.…
Descriptors: Estimation (Mathematics), Evaluation Methods, Individual Differences, Mathematical Models
Peer reviewedSchriesheim, Chester A.; And Others – Educational and Psychological Measurement, 1989
Three studies explored the effects of grouping versus randomized items in questionnaires on internal consistency and test-retest reliability with samples of 80, 80, and 100, respectively, university students and undergraduates. The 2 correlational and 1 experimental studies were reasonably consistent in demonstrating that neither format was…
Descriptors: Classification, College Students, Evaluation Methods, Higher Education
Bradley, Robert H.; And Others – American Journal on Mental Retardation, 1989
The usefulness and validity of the 3 versions (Infant-Toddler, Early Childhood, and Middle Childhood) of the HOME Inventory were studied with 261 children with cognitive, hearing, vision, or orthopedic handicaps. The Inventory in its original form and a modified form was subjected to analysis of reliability, construct validity, and criterion…
Descriptors: Concurrent Validity, Construct Validity, Disabilities, Elementary Education
Cooper, Terence H. – Journal of Agronomic Education (JAE), 1988
Describes a study used to determine differences in exam reliability, difficulty, and student evaluations. Indicates that when a fourth option was added to the three-option items, the exams became more difficult. Includes methods, results discussion, and tables on student characteristics, whole test analyses, and selected items. (RT)
Descriptors: Agronomy, College Science, Error of Measurement, Evaluation Methods
Peer reviewedSloan, R. L.; And Others – International Journal of Rehabilitation Research, 1992
This study tested the interrater reliability of the Modified Ashworth Scale in measuring upper and lower limb spasticity in 34 hemiplegic adult patients examined by 2 physiotherapists and 2 doctors. Findings indicated satisfactory reliability for upper limb spasticity but less satisfactory results for lower limb spasticity. (DB)
Descriptors: Adults, Behavior Rating Scales, Evaluation Methods, Interrater Reliability
Peer reviewedWeems, Richard A.; And Others – Journal of Dental Education, 1992
A procedure for testing the ability of dental students to detect presence and depth of dental caries was evaluated. Students (n=40) from four experience groups examined radiographs obtained from a model. Results indicated that this method of assessing student competence in radiographic interpretation is valid. (MSE)
Descriptors: Clinical Diagnosis, Dental Schools, Evaluation Methods, Higher Education


