Publication Date
| In 2026 | 0 |
| Since 2025 | 2 |
| Since 2022 (last 5 years) | 6 |
| Since 2017 (last 10 years) | 27 |
| Since 2007 (last 20 years) | 46 |
Descriptor
| Test Reliability | 418 |
| Test Use | 418 |
| Test Validity | 297 |
| Test Construction | 143 |
| Elementary Secondary Education | 77 |
| Higher Education | 66 |
| Evaluation Methods | 60 |
| Psychometrics | 56 |
| Foreign Countries | 52 |
| Scoring | 49 |
| Standardized Tests | 49 |
| More ▼ | |
Source
Author
| Stansfield, Charles W. | 4 |
| Straus, Murray A. | 4 |
| Thompson, Bruce | 4 |
| Baker, Eva L. | 3 |
| Alsalam, Nabeel | 2 |
| Anderson, Stephen A. | 2 |
| Axelrod, Bradley N. | 2 |
| Boesel, David | 2 |
| Bricker, Diane | 2 |
| Burrell, Brenda | 2 |
| Clark, Duncan B. | 2 |
| More ▼ | |
Publication Type
Education Level
| Higher Education | 11 |
| Postsecondary Education | 11 |
| Elementary Education | 10 |
| Early Childhood Education | 7 |
| Elementary Secondary Education | 5 |
| Primary Education | 5 |
| Secondary Education | 5 |
| Grade 3 | 4 |
| Grade 4 | 4 |
| Grade 5 | 4 |
| Grade 6 | 4 |
| More ▼ | |
Audience
| Practitioners | 43 |
| Teachers | 17 |
| Researchers | 9 |
| Students | 8 |
| Administrators | 7 |
| Parents | 5 |
| Policymakers | 3 |
| Community | 2 |
| Counselors | 2 |
| Support Staff | 1 |
Location
| Australia | 10 |
| Canada | 6 |
| New York | 6 |
| Hong Kong | 3 |
| Finland | 2 |
| Georgia | 2 |
| Ireland | 2 |
| Israel | 2 |
| Massachusetts | 2 |
| Michigan | 2 |
| Netherlands | 2 |
| More ▼ | |
Laws, Policies, & Programs
| Education Consolidation… | 2 |
| Elementary and Secondary… | 1 |
| Every Student Succeeds Act… | 1 |
| Individuals with Disabilities… | 1 |
| Individuals with Disabilities… | 1 |
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedWoodburn, Jim; Sutcliffe, Nick – Assessment & Evaluation in Higher Education, 1996
The Objective Structured Clinical Examination (OSCE), initially developed for undergraduate medical education, has been adapted for assessment of clinical skills in podiatry students. A 12-month pilot study found the test had relatively low levels of reliability, high construct and criterion validity, and good stability of performance over time.…
Descriptors: Clinical Teaching (Health Professions), Higher Education, Medical Education, Podiatry
Williams, Janice E.; Coombs, William T. – 1996
The reliability of A. Bandura's Multidimensional Scales of Perceived Self-Efficacy (MSPSE) was studied using the Cronbach alpha measure of internal consistency. The divergent validity of the MSPSE was also examined using subscale correlations, and the construct validity of the measure was studied through application of principal axes factor…
Descriptors: College Bound Students, Construct Validity, Factor Analysis, Factor Structure
Gaffney, Patrick V.; Byrd-Gaffney, Sharon – 1996
The Pupil Control Ideology Form (PCI) is one of the major instruments used by researchers interested in the study of school climate. Pupil control is a central feature of the organizational life of schools, and each school appears to have a prevailing ideology of pupil control. The PCI is a self-report instrument used to measure an educator's…
Descriptors: Construct Validity, Cross Cultural Studies, Discipline, Educational Environment
Reckase, Mark D. – 1997
This paper argues that special procedures for constructing assessment tools containing performance assessment tasks are unnecessary and that current test methodology can easily be generalized to complex performance assessment tasks without destroying the desirable characteristics of those tasks. Reasonable statistical requirements for sound…
Descriptors: Educational Assessment, Generalizability Theory, High Stakes Tests, Interrater Reliability
Rodriguez-Aragon, Graciela; And Others – 1993
The predictive power of the Split-Half version of the Wechsler Intelligence Scale for Children--Revised (WISC-R) Object Assembly (OA) subtest was compared to that of the full administration of the OA subtest. A cohort of 218 male and 49 female adolescent offenders detained in a Texas juvenile detention facility between 1990 and 1992 was used. The…
Descriptors: Adolescents, Cohort Analysis, Comparative Testing, Correlation
Reuter, Jeanette; And Others – 1982
This panel presentation presents results of an assessment study of the reliability, validity, and utility of caregivers' reports on: (1) the behavioral competencies of severely handicapped children, and (2) the adaptive and intellectual behaviors of moderately handicapped children. The Kent Infant Development (KID) Scale (used with severely and…
Descriptors: Behavior Rating Scales, Developmental Stages, Evaluation Methods, Moderate Mental Retardation
Hughes, Kevin R.; And Others – 1989
The purpose of this study was to provide evidence for adapting and generalizing the use of the Children's Academic Motivation Inventory (CAMI) to high school students. The instrument was originally developed to provide a reliable, valid, theory-based measure of academic achievement motivation; it was suitable for use with children aged 12-14…
Descriptors: Academic Achievement, Achievement Tests, Generalizability Theory, Grade Point Average
Aiken, Lewis R. – 1979
The research literature on oral achievement testing is reviewed, and advantages and disadvantages of oral tests are described. A number of suggestions are made for improving the objectivity, reliability, and validity of oral tests. The results of a survey of the attitudes and experiences of a selected sample of college students with regard to…
Descriptors: Achievement Tests, Evaluation Methods, Interpretive Skills, Speech Skills
Peer reviewedPierson, Dorothy; And Others – Educational and Psychological Measurement, 1985
The construct validity and reliability of the Porter Needs Satisfaction Questionnaire (adapted) for educators were examined. Results did not support its use as suggested by Porter. Suggestions for its revision and alternate use are presented. (Author/GDC)
Descriptors: Attitude Measures, Elementary Secondary Education, Factor Structure, Job Satisfaction
Peer reviewedOnore, Cynthia S. – Journal of Reading, 1986
Reviews the Stanford Writing Assessment Program that has three intended uses: district-wide survey of students' writing ability, diagnosis of classroom or district-wide instructional strengths and weaknesses, and staff development through training for administering and scoring of writing samples. Notes that of these uses, none is necessarily best…
Descriptors: Educational Assessment, Educational Diagnosis, Evaluation Methods, Staff Development
Hayhoe, Mike – Highway One, 1985
Stresses the importance of devising accurate methods of evaluation rather than teaching material only because it can be easily evaluated. (DF)
Descriptors: Accountability, English Instruction, Evaluation Criteria, Evaluation Methods
Mittag, Kathleen – 1998
Measures of normal variations in personality, called "psychological type," are frequently used in education (e.g., to identify learning styles) and counseling (e.g., in career counseling). However, the most frequently used measure of types has been criticized on various psychometric grounds. The present study investigated the…
Descriptors: Counseling, Factor Analysis, High School Students, High Schools
Peer reviewedHamid, P. Nicholas; Cheng, Sheung-Tak – Educational and Psychological Measurement, 1996
A short measure of trait and state negative and positive affect, the Chinese Affect Scale, was developed for Chinese-speaking people and tested in Hong Kong with 306 community adults and 314 college students. Scores had reasonable internal and retest reliabilities and high convergent and discriminant validity. (SLD)
Descriptors: Adults, Affective Behavior, Chinese, College Students
Peer reviewedWodtke, Kenneth H.; And Others – Educational Evaluation and Policy Analysis, 1989
A qualitative observational study of standardized group testing in 10 kindergartens revealed variations in testing conditions, discrepancies from standardized administration procedures, and variations in children's behavior that contributed to difficulties in maintaining a uniform testing process. High-stakes group testing in kindergarten should…
Descriptors: Classroom Observation Techniques, Group Testing, Kindergarten, Primary Education
Peer reviewedCicchetti, Domenic V. – Psychological Assessment, 1994
In the context of developing assessment instruments in psychology, issues of standardization, norming procedures, and test reliability and validity are discussed. Criteria, guidelines, and rules of thumb are provided to help the clinician with instrument selection for a given psychological assessment. (SLD)
Descriptors: Clinical Diagnosis, Criteria, Evaluation Methods, Guides


