Publication Date
| In 2026 | 0 |
| Since 2025 | 861 |
| Since 2022 (last 5 years) | 4466 |
| Since 2017 (last 10 years) | 10399 |
| Since 2007 (last 20 years) | 21862 |
Descriptor
| Test Validity | 21718 |
| Validity | 13766 |
| Test Reliability | 10818 |
| Foreign Countries | 9837 |
| Test Construction | 6862 |
| Factor Analysis | 5754 |
| Measures (Individuals) | 5614 |
| Predictive Validity | 5018 |
| Psychometrics | 4798 |
| Reliability | 4632 |
| Correlation | 4368 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 1169 |
| Practitioners | 629 |
| Teachers | 336 |
| Administrators | 165 |
| Policymakers | 110 |
| Counselors | 63 |
| Students | 63 |
| Parents | 15 |
| Community | 12 |
| Media Staff | 10 |
| Support Staff | 8 |
| More ▼ | |
Location
| Turkey | 1387 |
| Australia | 704 |
| Canada | 626 |
| China | 527 |
| United States | 439 |
| Indonesia | 385 |
| United Kingdom | 363 |
| Germany | 338 |
| California | 337 |
| Netherlands | 334 |
| Spain | 309 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 7 |
| Meets WWC Standards with or without Reservations | 12 |
| Does not meet standards | 10 |
Peer reviewedBlack, Paul – Studies in Educational Evaluation, 1995
The role of assessment in science education is explored, focusing on summative assessment in British public certificate examinations. Examples of test items are presented to illustrate difficulties in making valid and reliable assessments, and issues with implications for formative assessment are discussed. (SLD)
Descriptors: Educational Assessment, Feedback, Foreign Countries, Formative Evaluation
Peer reviewedStanovich, Keith E. – Learning Disability Quarterly, 1991
This paper argues that using intelligence as an aptitude benchmark in defining reading disability conceals unsupported assumptions about educational potential and makes it difficult to differentiate the cognitive characteristics of dyslexic children from those of other poor readers. The use of a more educationally relevant aptitude measure such as…
Descriptors: Academic Ability, Academic Achievement, Academic Aptitude, Construct Validity
Peer reviewedVu, Nu Viet; And Others – Academic Medicine, 1992
The use of a performance-based assessment of senior medical students' clinical skills utilizing standardized patients was evaluated, with 6,804 student-patient encounters involving 405 students over 6 years. Results provide evidence for test security, content validity, construct validity, reliability, and test ability to discriminate a wide range…
Descriptors: Clinical Experience, Evaluation Methods, Higher Education, Medical Education
Peer reviewedSpillane, Stephen A.; And Others – Journal of Learning Disabilities, 1992
This survey of 40 northeast state universities and colleges found that criteria used by admission personnel to determine the eligibility of undergraduate applicants with learning disabilities are similar to those employed for applicants without learning disabilities. Criteria differ significantly among institutions, and admission personnel do not…
Descriptors: Admission Criteria, Admissions Officers, College Admission, College Applicants
Peer reviewedDornyei, Zoltan; Katona, Lucy – Language Testing, 1992
A total of 102 university English majors were administered 4 different language tests to form a General Language Proficiency measure against which the C-test was evaluated. Results confirmed its reliability and validity and also provided data on text difficulty/appropriateness, word structure, content, and different scoring methods. (13…
Descriptors: College Students, English (Second Language), Higher Education, Language Proficiency
Peer reviewedTremblay, R. E.; And Others – International Journal of Behavioral Development, 1992
Mother and peer assessments of preschoolers' behavior were compared to teachers' responses on a preschool behavior questionnaire with three components. The disruptive component was highly correlated with peer assessments, and moderately correlated with mother assessments; the prosocial and anxious components were moderately correlated with mother…
Descriptors: Antisocial Behavior, Anxiety, Factor Analysis, Foreign Countries
Peer reviewedPennington, Bruce F.; And Others – Journal of Learning Disabilities, 1992
This study of 640 twins with reading disability and 436 controls (mean age 12) examined external validity of the distinction between specific reading retardation and reading backwardness, in 3 domains: genetic etiology, sex ratio and clinical correlates, and neuropsychological profiles. There was no evidence of differential genetic etiology of the…
Descriptors: Age Differences, Definitions, Educational Diagnosis, Elementary Secondary Education
Peer reviewedKline, Rex B.; Lachar, David – Psychological Assessment, 1992
Whether the external validity of the Personality Inventory for Children (PIC) was moderated by age, sex, or race was studied using 1,333 children and adolescents referred for mental health services. Race and sex generally did not moderate the relation of PIC scales to symptom checklists. Some relationships were age modified. (SLD)
Descriptors: Ability, Adolescents, Age Differences, Check Lists
Peer reviewedMittenberg, Wiley; And Others – Psychological Assessment, 1992
Normative data for the Wechsler Memory Scale-Revised were derived empirically using a sample of 50 volunteers between 25 and 34 years of age, who matched U.S. Census data on demographic characteristics. Differences between these empirical norms and published norms that were estimated statistically appear clinically significant. (SLD)
Descriptors: Adults, Census Figures, Demography, Diagnostic Tests
Peer reviewedBullis, Michael; Reiman, John – Exceptional Children, 1992
The Transition Competence Battery for Deaf Adolescents and Young Adults (TCB) measures employment and independent living skills. The TCB was standardized on students (N from 180 to 230 for the different subtests) from both mainstreamed and residential settings. Item statistics and subtest reliabilities were adequate; evidence of construct validity…
Descriptors: Adolescents, Competence, Deafness, Education Work Relationship
Peer reviewedMarshburn, Elaine C.; Aman, Michael G. – Journal of Autism and Developmental Disorders, 1992
Teacher ratings on the Aberrant Behavior Checklist were collected on 666 students with mental retardation attending special classes. Classroom placement and age had significant effects on subscale scores, whereas sex failed to affect ratings. The study concludes that the original scoring method, developed for individuals in residential facilities,…
Descriptors: Age Differences, Behavior Problems, Behavior Rating Scales, Children
Peer reviewedPike, Gary R. – Journal of General Education, 1992
Presents a study comparing two measures of general education outcomes, focusing on the validity of these instruments as measures of program effectiveness. Challenges the perceived advantages of standardized testing as measures of general education outcomes, recommending qualitative, institution-specific approaches relying on student and faculty…
Descriptors: College Outcomes Assessment, Comparative Analysis, Construct Validity, Evaluation Methods
Peer reviewedRomberg, Thomas A.; Wilson, Linda D. – Arithmetic Teacher, 1992
Examined 6 widely used grade-8 standardized tests for content, required processes, and level to determine their alignment with the 5-8 NCTM "Curriculum and Evaluation Standards." Concluded that these tests inadequately covered the 5-8 standards. A follow-up study examined items from newly developed and foreign tests to demonstrate the…
Descriptors: Achievement Tests, Educational Change, Elementary Education, Mathematics Achievement
Peer reviewedAllen, Linda; And Others – Journal of Educational Psychology, 1992
Different methods of measuring print experience were studied for 63 fifth graders who completed daily activity diaries. Book reading time estimate was correlated with new measures of reading time that use a checklist. Results support recognition checklist measures as convenient proxy indicators of children's print exposure. (SLD)
Descriptors: Check Lists, Children, Cognitive Processes, Construct Validity
Peer reviewedCohen, Robert; And Others – Academic Medicine, 1991
A study evaluated the feasibility of an objective structured clinical examination to assess the competence of foreign medical school graduates, clinical clerks, and interns to address clinical ethical situations. The University of Toronto's experience with the measure found it useful but in need of improvement. (MSE)
Descriptors: Clinical Experience, Ethics, Evaluation Criteria, Evaluation Methods


