Publication Date
| In 2026 | 0 |
| Since 2025 | 2 |
| Since 2022 (last 5 years) | 6 |
| Since 2017 (last 10 years) | 27 |
| Since 2007 (last 20 years) | 46 |
Descriptor
| Test Reliability | 418 |
| Test Use | 418 |
| Test Validity | 297 |
| Test Construction | 143 |
| Elementary Secondary Education | 77 |
| Higher Education | 66 |
| Evaluation Methods | 60 |
| Psychometrics | 56 |
| Foreign Countries | 52 |
| Scoring | 49 |
| Standardized Tests | 49 |
| More ▼ | |
Source
Author
| Stansfield, Charles W. | 4 |
| Straus, Murray A. | 4 |
| Thompson, Bruce | 4 |
| Baker, Eva L. | 3 |
| Alsalam, Nabeel | 2 |
| Anderson, Stephen A. | 2 |
| Axelrod, Bradley N. | 2 |
| Boesel, David | 2 |
| Bricker, Diane | 2 |
| Burrell, Brenda | 2 |
| Clark, Duncan B. | 2 |
| More ▼ | |
Publication Type
Education Level
| Higher Education | 11 |
| Postsecondary Education | 11 |
| Elementary Education | 10 |
| Early Childhood Education | 7 |
| Elementary Secondary Education | 5 |
| Primary Education | 5 |
| Secondary Education | 5 |
| Grade 3 | 4 |
| Grade 4 | 4 |
| Grade 5 | 4 |
| Grade 6 | 4 |
| More ▼ | |
Audience
| Practitioners | 43 |
| Teachers | 17 |
| Researchers | 9 |
| Students | 8 |
| Administrators | 7 |
| Parents | 5 |
| Policymakers | 3 |
| Community | 2 |
| Counselors | 2 |
| Support Staff | 1 |
Location
| Australia | 10 |
| Canada | 6 |
| New York | 6 |
| Hong Kong | 3 |
| Finland | 2 |
| Georgia | 2 |
| Ireland | 2 |
| Israel | 2 |
| Massachusetts | 2 |
| Michigan | 2 |
| Netherlands | 2 |
| More ▼ | |
Laws, Policies, & Programs
| Education Consolidation… | 2 |
| Elementary and Secondary… | 1 |
| Every Student Succeeds Act… | 1 |
| Individuals with Disabilities… | 1 |
| Individuals with Disabilities… | 1 |
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedJohnson, Mark E.; Fisher, Dennis G.; Rhodes, Fen; Booth, Robert – Assessment, 1996
The Wide Range Achievement Test-Revised and the Woodcock Reading Mastery Tests-Revised were administered twice to 269 current drug abusers over an average time interval of 204.2 days. Overall, the study demonstrates that the two instruments have strong psychometric properties and that results from current drug abusers are reliable. (SLD)
Descriptors: Adults, Concurrent Validity, Drug Abuse, Psychometrics
Peer reviewedAxelrod, Bradley N.; And Others – Psychological Assessment, 1996
The underlying structure of the Postconcussion Syndrome Questionnaire (PCS) was evaluated in a large sample of 1,116 medical and psychiatric patients. Balancing internal consistency, confirmatory factor analysis, and parsimony results in endorsement of the four-factor solution for the PCS for this sample. (SLD)
Descriptors: Adults, Evaluation Methods, Factor Structure, Head Injuries
Lock, Robin H.; Layton, Carol A. – College and University, 2002
Examined the ability of the Learning Disabilities Diagnostic Inventory (LDDI) to differentiate between postsecondary populations with and without learning disabilities. Found that the LDDI is a reliable method for identifying the possibility of a learning disability in postsecondary students. (EV)
Descriptors: Diagnostic Tests, Educational Diagnosis, Learning Disabilities, Postsecondary Education
Peer reviewedClark, Duncan B.; And Others – Psychological Assessment, 1994
Reliability and validity of the Social Phobia and Anxiety Scale, a measure developed for adults, were studied for adolescents using a sample of 223 adolescents ages 12 to 18, from community and clinical sources. Results demonstrate that the instrument is a reliable and valid measure of social phobia for adolescents. (SLD)
Descriptors: Adolescents, Anxiety, Attitude Measures, Social Attitudes
Peer reviewedMeier, Augustine; Boivin, Micheline – Journal of Consulting and Clinical Psychology, 1986
The Client Verbal Response Category System classifies client responses into Temporal, Directional and Experiential categories. The categories with their subcategories are defined, interjudge reliability data is presented, and the instrument's utility in psychotherapy process research is demonstrated. Initial results indicate that the instrument is…
Descriptors: Client Characteristics (Human Services), Interrater Reliability, Psychotherapy, Research Tools
Peer reviewedNickel, Elizabeth J.; And Others – Adolescence, 1986
Reports the reliability and concurrent validity of both the trait and state forms of the Multiple Affect Adjective Check List (MAACL) with a high school population, age range of 14-16 years and educational range of 9th and 10th grades (N=403). Findings indicate the MAACL is sufficiently reliable and valid to warrant additional use with an…
Descriptors: High School Students, Secondary Education, Sex Differences, Test Reliability
Peer reviewedTillinghast, B. S., Jr.; And Others – Journal of Educational Research, 1983
A study using the Peabody Picture Vocabulary Test (Revised) was conducted to determine whether the increase in reliability when both Forms L and M were employed justified the increase in time required for the longer procedure. Children in grades four, five, and six were involved in the project. (PP)
Descriptors: Intermediate Grades, Test Reliability, Test Results, Test Use
Peer reviewedCallahan, Carolyn M.; And Others – Gifted Child Quarterly, 1993
This article describes the Scale for the Evaluation of Gifted Identification Instruments, developed for use by school decision makers. Development of the scale is reviewed in terms of five areas of assessment: validity, reliability, propriety, respondent appropriateness, and utility. Specific guidelines and cautions in using the scale are also…
Descriptors: Ability Identification, Gifted, Screening Tests, Test Reliability
Peer reviewedUtsey, Shawn O. – Journal of Black Psychology, 1998
Reviews six instruments developed to assess the psychological processes associated with experiences of racism among African Americans. Recommendations for modifications, revisions, and additional reliability and validity evidence are presented for each of these relatively new measures. (SLD)
Descriptors: Blacks, Racial Bias, Stress Variables, Test Construction
Lehr, Camilla A.; And Others – 1986
Information about current assessment practices was obtained from 54 surveys completed by Handicapped Children's Early Education Program (HCEEP) demonstration projects across the United States. Information about factors influencing the selection and continued use of tests also was provided. Results indicated that 19 tests were used by five or more…
Descriptors: Demonstration Programs, Disabilities, National Surveys, Preschool Education
Reuter, Jeanette; And Others – 1982
Of the 15 substantive papers in this report, 12 focus on the use of the Kent Infant Development (KID) Scale with severely handicapped children. The KID Scale measures 252 behaviors usually developed during the first year of life in five domains (cognitive, motor, language, self-help, and social). It was successfully adapted to elicit reliable…
Descriptors: Infants, Severe Disabilities, Student Evaluation, Test Reliability
Peer reviewedLindley, Celeste M.; And Others – American Journal of Pharmaceutical Education, 1986
Student perceptions of the preparation for and fairness, value as a learning experience, and logistics of a newly implemented oral examination in a course preparing senior pharmacy students for community pharmacy practice are summarized. (MSE)
Descriptors: Higher Education, Pharmaceutical Education, Student Attitudes, Test Reliability
Peer reviewedWiersma, Uco; Latham, Gary P. – Personnel Psychology, 1986
The practicality of three appraisal instruments was measured in terms of user preference, namely, behavioral observation scales (BOS), behavioral expectation scales (BES), and trait scales. In all instances, BOS were preferred to BES, and in all but two instances, BOS were viewed as superior to trait scales. (Author/ABB)
Descriptors: Administrators, Behavior Patterns, Behavior Rating Scales, Personnel Evaluation
An Evaluation of the Diagnostic Efficiency of the Wechsler Intelligence Scale for Children--Revised.
Peer reviewedMueller, Horst H.; And Others – Alberta Journal of Educational Research, 1984
Because diagnostic capability of the WISC-R has remained in doubt, its diagnostic suitability was assessed by applying Kelley's method of estimating the proportion of score differences in excess of chance to the original subscales, Bannatyne clusters, and Kaufman's three factor groupings. Caution should be used when applying WISC-R diagnostically.…
Descriptors: Clinical Diagnosis, Comparative Analysis, Evaluation Criteria, Tables (Data)
Peer reviewedMackie, Kerrie; Dermody, Phillip – Journal of Speech and Hearing Research, 1986
The monosyllabic adaptive speech test (MAST) procedures were found to be reliable with children as young as 3. The accuracy of the MAST estimate of 50% speech threshold was confirmed in 60 hearing imparied and normal children 3-7 years old. With 10 hearing impaired children, MAST threshold was significantly correlated with pure-tone loss. (CL)
Descriptors: Audiometric Tests, Elementary Education, Hearing Impairments, Preschool Education

Direct link
