Publication Date
| In 2026 | 0 |
| Since 2025 | 28 |
| Since 2022 (last 5 years) | 117 |
| Since 2017 (last 10 years) | 228 |
| Since 2007 (last 20 years) | 561 |
Descriptor
| Evaluation Methods | 1408 |
| Test Reliability | 1408 |
| Test Validity | 954 |
| Student Evaluation | 339 |
| Test Construction | 305 |
| Foreign Countries | 217 |
| Higher Education | 183 |
| Measurement Techniques | 170 |
| Psychometrics | 168 |
| Elementary Secondary Education | 147 |
| Evaluation Criteria | 122 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 74 |
| Practitioners | 72 |
| Teachers | 29 |
| Administrators | 18 |
| Policymakers | 11 |
| Students | 4 |
| Counselors | 3 |
| Support Staff | 3 |
| Community | 1 |
| Parents | 1 |
Location
| Australia | 24 |
| United Kingdom | 22 |
| Canada | 18 |
| Turkey | 16 |
| China | 14 |
| United States | 14 |
| California | 11 |
| Netherlands | 10 |
| Florida | 9 |
| Texas | 8 |
| United Kingdom (England) | 8 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Peer reviewedDimes, R. E. – Physics Education, 1973
Summarizes classifications of educational objectives, types of test items, and principles underlying the construction of tests. Indicates that a combination of essay and objective tests is preferable in the overall evaluation of a course. (CC)
Descriptors: Behavioral Objectives, Evaluation Methods, Item Analysis, Objective Tests
Wright, Logan; Dunn, Thomas – Educ Psychol Meas, 1970
Descriptors: Evaluation Methods, Factor Analysis, Factor Structure, Measurement Techniques
Peer reviewedNisonger, Thomas E. – Library Resources and Technical Services, 1983
Using random selection of citations from journal articles, two specific permutations of the citation checking approach to university library collection evaluation are tested on political science collections in five university libraries in the Washington, D.C. area. The history of the citation checking approach is reviewed. Forty-three references…
Descriptors: Academic Libraries, Citations (References), Evaluation Methods, Higher Education
Peer reviewedWebb, Melvin W., II – Journal of Reading, 1983
Uses a scale to analyze three tests used for assessing community college students' reading levels: The Stanford Diagnostic Reading Test (SDRT), the Nelson-Denny Reading Test (NDRT), and the Iowa Silent Reading Tests (ISRT). Judges the ISRT to be the best and the NDRT to be unacceptable according to the criteria of the scale. (FL)
Descriptors: Community Colleges, Evaluation Methods, Higher Education, Measures (Individuals)
Peer reviewedLange, Bob – Reading Teacher, 1982
Examines the arguments against the indiscriminate use of readability formulas. (FL)
Descriptors: Elementary Education, Evaluation Methods, Readability Formulas, Reading Diagnosis
Peer reviewedEpstein, Howard R.; Weber, Donald B. – Mental Retardation, 1980
Evaluations were made of 46 mentally retarded residents (mean age=18.32 years) in a central Ohio state institution using part one of the American Association on Mental Deficiency's Adaptive Behavior Scale. A significant overall setting effect was not revealed in the multivariate analysis of variance for repeated measures. (Author)
Descriptors: Adjustment (to Environment), Adults, Behavior Rating Scales, Evaluation Methods
Stamm, Carol Lee; Moore, Joyce E. – Research Quarterly, 1980
Generalizability theory provides the teacher and the researcher with a flexible method for establishing reliability coefficients in tests. This theory is effective in estimating reliability for a set of motor performance test scores. (CJ)
Descriptors: Educational Research, Evaluation Methods, Motor Development, Performance Tests
Peer reviewedMaxon, Antonia Brancia; White, Karl R.; Culpepper, Brandt; Vohr, Betty R. – Journal of Communication Disorders, 1997
Describes factors that can affect the referral rate for otoacoustic emission-based newborn hearing screening and discusses the screening results of 1,328 newborns screened with transient evoked otoaoustic emissions prior to hospital discharge. The youngest infants were as likely to pass as infants who were 24-27 hours old. (Author/CR)
Descriptors: Age Differences, Auditory Tests, Evaluation Methods, Hearing Impairments
Peer reviewedPlucker, Jonathan A. – Journal of Secondary Gifted Education, 1997
This study used a sample (n=967) of academically gifted adolescent students attending summer enrichment programs and participating in urban school districts' gifted programs to evaluate the reliability and validity of the Adolescent Coping Scale. Results suggest the instrument is sufficiently reliable for group administration and research purposes…
Descriptors: Academically Gifted, Adolescents, Coping, Elementary Secondary Education
Glascoe, Frances Page – Diagnostique, 1997
In order to locate optimal cutoff scores for detecting developmental delays, the Brigance Screens were administered to 408 children (ages 21-48 months) along with a criterion battery measuring achievement and intelligence. The Receiver Operating Characteristic analyses were then used to locate optical cutoff scores for each form of the Brigance.…
Descriptors: Developmental Delays, Disability Identification, Early Identification, Evaluation Methods
Peer reviewedCohen, Ira L.; Schmidt-Lackner, Susan; Romanczyk, Raymond; Sudhalter, Vicki – Journal of Autism and Developmental Disorders, 2003
Two studies evaluated the PDD Behavior Inventory, (PDDBI), a rating scale designed to assess adaptive and maladaptive behaviors of children having a pervasive developmental disorder (PDD). It was concluded that the PDDBI is both reliable and valid and is useful in providing information not typically available in most instruments used to assess…
Descriptors: Behavior Rating Scales, Children, Elementary Education, Evaluation Methods
Peer reviewedMurray, Bruce A.; Smith, Kimberly A.; Murray, Geralyn G. – Journal of Literacy Research, 2000
Tests the validity of the Test of Phoneme Identities (TPI). Finds the TPI to be reliable and comparable to other phoneme awareness measures in predicting decoding ability; and to be more effective than a nursery rhyme and alphabet measures in predicting the number of lessons required for a student to learn to distinguish phonetic cues. (RS)
Descriptors: Decoding (Reading), Evaluation Methods, Kindergarten, Phonemes
Brems, Christiane; And Others – American Journal on Mental Retardation, 1990
Developmental Record protocols were obtained from archival files for 1,069 institutionalized mentally retarded adults and children. Analyses revealed high internal reliabilities, but questionable validity. The Developmental Record did not adequately discriminate the five areas of functioning it purportedly assesses (self-care, perceptual-motor,…
Descriptors: Adults, Child Development, Children, Evaluation Methods
Peer reviewedStreufert, Siegfried; And Others – Personnel Psychology, 1988
Evaluated quasi-experimental simulation technique designed to measure impact of individual differences in managerial styles on executive performance. Tested 20 simulation-based measures for reliability and validity. Data from two samples suggest that this quasi-experimental simulation technology may be useful in assessing managerial styles not…
Descriptors: Administrator Qualifications, Competence, Evaluation Methods, Individual Differences
Peer reviewedRyser, Gail R. – Journal of Secondary Gifted Education, 1994
The meanings of reliability and validity as they apply to standardized measures are used as a framework for applying the concepts of reliability and validity to authentic assessments. This article sees reliability as scorability and stability, whereas validity is seen as students' ability to use knowledge authentically in the field. (DB)
Descriptors: Elementary Secondary Education, Evaluation Methods, Performance Based Assessment, Reliability


