Publication Date
In 2025 | 205 |
Since 2024 | 705 |
Since 2021 (last 5 years) | 2293 |
Since 2016 (last 10 years) | 4594 |
Since 2006 (last 20 years) | 6899 |
Descriptor
Test Reliability | 14762 |
Test Validity | 9771 |
Test Construction | 4248 |
Foreign Countries | 3657 |
Psychometrics | 2361 |
Factor Analysis | 2251 |
Measures (Individuals) | 1717 |
Evaluation Methods | 1401 |
Higher Education | 1384 |
Correlation | 1234 |
Questionnaires | 1228 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 452 |
Practitioners | 319 |
Teachers | 128 |
Administrators | 73 |
Policymakers | 33 |
Counselors | 31 |
Students | 17 |
Parents | 10 |
Community | 6 |
Support Staff | 5 |
Location
Turkey | 797 |
Australia | 236 |
Canada | 205 |
China | 195 |
Indonesia | 142 |
Spain | 124 |
United States | 121 |
United Kingdom | 117 |
Germany | 106 |
Taiwan | 103 |
Netherlands | 99 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 2 |
Meets WWC Standards with or without Reservations | 2 |
Does not meet standards | 1 |

Gustafsson, Jan-Eric; Undheim, Johan Olav – Journal of Educational Psychology, 1992
The stability of some dimensions of ability between the ages of 12 and 15 years was investigated for 225 boys and 242 girls in Sweden. Testing in grades 6, 8, and 9 indicated high stability for the general intelligence factor and for the residual of the General Visual factor. (SLD)
Descriptors: Ability, Adolescents, Age Differences, Comparative Testing

Lindblad, Torsten – System, 1992
Looks at the large-scale experiments on the testing of oral proficiency in English, French, and German that have been carried out over the last five years in the Swedish gymnasium. Various kinds of tasks and different grading criteria have been used, and the practical problems of scheduling and of teacher training have been discussed. (nine…
Descriptors: English (Second Language), Foreign Countries, French, German

Isaacson, Stephen L. – Learning Disabilities Research and Practice, 1992
This review of the Test of Early Written Language concludes that the test succeeds in identifying students who are below their peers in writing and in measuring long-term gains in written language achievement; but its format makes it difficult to document specific strengths and weaknesses and its reliability; and validity have not been…
Descriptors: Early Childhood Education, Evaluation Methods, Student Evaluation, Test Reliability

Stoskopf, Carleen H.; And Others – Evaluation Review, 1992
Data are presented that demonstrate the reliability and construct validity of a 27-item behaviorally anchored rating scale (BARS) used to rate the performance of 757 nursing assistants in South Carolina. Results support the reliability and construct validity of the BARS and the usefulness of the BARS approach for evaluation. (SLD)
Descriptors: Construct Validity, Evaluation Methods, Long Term Care, Measurement Techniques

Greenan, James P.; Jarwan, Fathi A. – Career Development for Exceptional Individuals, 1992
This study focused on the validation of Generalizable Reasoning Skills assessment instruments with students with disabilities in secondary vocational programs. Results indicated that student self-ratings, teacher ratings, and a performance test were internally consistent and precise measures of reasoning skills for some uses but that most…
Descriptors: Abstract Reasoning, Disabilities, Evaluation Methods, Generalization

Smith-Sebasto, N. J. – Journal of Environmental Education, 1992
A study reveals the need for extensive refinement of the Revised Perceived Environmental Control Measure purported in the past to be a reliable and valid instrument to measure the relationship between the psychological construct, "locus of control," and environmental action or environmentally responsible behavior. (MCO)
Descriptors: Behavior, Behavioral Science Research, Concurrent Validity, Construct Validity

Johnson, William L.; And Others – Teacher Education and Practice, 1992
This article briefly reviews findings from more than 250 research studies on instructional leadership and productive schools and discusses development and field testing of a needs assessment instrument for assessment of the continuing education needs of principals. (IAH)
Descriptors: Administrator Education, Educational Needs, Educational Research, Elementary Secondary Education

Abu-Hilal, Maher M.; Salameh, Kayed M. – Educational and Psychological Measurement, 1992
To assess the reliability and validity of the Maslach Burnout Inventory (MBI) in a non-Western setting, the instrument was administered to 223 teachers in Jordan. Results indicate an acceptable reliability for the MBI and suggest that it has promise for use in non-Western countries. (SLD)
Descriptors: Construct Validity, Cross Cultural Studies, Developing Nations, Elementary School Teachers

Lindsey, Pam – Education and Training in Mental Retardation and Developmental Disabilities, 1994
The Consent Screening Interview was developed to enable consumers with mental retardation to express views and preferences about community residential placements and indicate to service providers their ability to give informed consent. Analysis of content and construct validity and interrater reliability, involving 69 subjects, revealed that the…
Descriptors: Adults, Cognitive Ability, Comprehension, Evaluation Methods

Baer, John – Roeper Review, 1994
Two studies are reported that measure the long-term stability of performance assessments involving story-writing and poetry-writing (involving grade four and five students) and story-telling (involving grade two students). The long-term stability of these assessments compares favorably with stability figures for other creativity tests. (Author/JDD)
Descriptors: Creative Thinking, Creativity, Creativity Tests, Elementary Education
Wadsworth, John S.; Harper, Dennis C. – Journal of the Association for Persons with Severe Handicaps (JASH), 1991
Subscales of the Sheltered Care Environmental Scale dealing with conflict, cohesion, and independence were administered to 47 adults with moderate mental retardation on 4 occasions using either a verbal format or picture-cued format. Results indicated that the use of pictures enhanced the test-retest reliability of the instrument. (Author/JDD)
Descriptors: Adults, Conflict, Group Unity, Institutionalized Persons

Walker, Hill M.; And Others – School Psychology Review, 1991
Psychometric characteristics and factorial replicability of the factor structure of the adolescent version (grades 7-12) of the Walker-McConnell Scale of Social Competence and School Adjustment were studied in an initial wave (n=266) of the national normative sample. The version studied has substantial utility in assessing adolescent social…
Descriptors: Adolescents, Age Differences, Factor Analysis, Factor Structure

Sevin, Jay A.; And Others – Journal of Autism and Developmental Disorders, 1991
This study, involving 24 children or adolescents with pervasive developmental disorders, assessed 3 autism scales: Autism Behavior Checklist, Real Life Rating Scale, and Childhood Autism Rating Scale. The study analyzed interrater reliability, correlations between pairs of the three scales, diagnostic classification cutoff scores, and…
Descriptors: Adaptive Behavior (of Disabled), Behavior Rating Scales, Check Lists, Educational Diagnosis

Nelson, Jack K.; And Others – Research Quarterly for Exercise and Sport, 1991
Researchers studied the reliability of the modified push-up test in measuring upper body strength and endurance in elementary through college students. It also examined the accuracy of partner scoring. The test proved much easier to administer than the regular floor push-up. It was valid and reliable for all students and suitable for partner…
Descriptors: College Students, Elementary School Students, Elementary Secondary Education, High School Students

Shatzer, John H.; And Others – Academic Medicine, 1993
A study compared the generalizability of 36 medical students' performance scores under systematically varied station times in 2 surgery end-of-clerkship performance-based examinations. Results indicated longer station length decreased generalizability of scores by decreasing variability among students' performances. Testing time was also affected.…
Descriptors: Academic Achievement, Clinical Experience, Competency Based Education, Higher Education