Publication Date
| In 2026 | 1 |
| Since 2025 | 899 |
| Since 2022 (last 5 years) | 4508 |
| Since 2017 (last 10 years) | 10441 |
| Since 2007 (last 20 years) | 21904 |
Descriptor
| Test Validity | 21743 |
| Validity | 13779 |
| Test Reliability | 10839 |
| Foreign Countries | 9859 |
| Test Construction | 6878 |
| Factor Analysis | 5756 |
| Measures (Individuals) | 5619 |
| Predictive Validity | 5019 |
| Psychometrics | 4806 |
| Reliability | 4634 |
| Correlation | 4373 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 1169 |
| Practitioners | 629 |
| Teachers | 336 |
| Administrators | 165 |
| Policymakers | 110 |
| Counselors | 63 |
| Students | 63 |
| Parents | 15 |
| Community | 12 |
| Media Staff | 10 |
| Support Staff | 8 |
| More ▼ | |
Location
| Turkey | 1393 |
| Australia | 704 |
| Canada | 626 |
| China | 527 |
| United States | 439 |
| Indonesia | 387 |
| United Kingdom | 363 |
| Germany | 338 |
| California | 337 |
| Netherlands | 334 |
| Spain | 309 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 7 |
| Meets WWC Standards with or without Reservations | 12 |
| Does not meet standards | 10 |
Peer reviewedKennedy, Craig H. – Education and Treatment of Children, 2002
This article presents criteria researchers should consider when seeking to establish a socially valid understanding of problem behavior. Criteria include determining if behavior change is demonstrated in typical settings and whether the intervention promotes movement in the least restrictive environment, is conducted by families and/or school and…
Descriptors: Adults, Behavior Change, Behavior Disorders, Behavior Modification
Peer reviewedRojahn, Johannes; Aman, Michael G.; Matson, Johnny L.; Mayville, Erik – Research in Developmental Disabilities, 2003
A study compared the Aberrant Behavior Checklist (ABC) and the Behavior Problems Inventory (BPI) for assessing the maladaptive behavior of 226 adults, mostly with severe or profound mental retardation. Individuals with elevated BPI scores generally had higher ABC scores, however, the extent of covariation differed across subscales. (Contains…
Descriptors: Adult Education, Adults, Aggression, Behavior Problems
Peer reviewedHoffman, James V.; Roser, Nancy L.; Salas, Rachel; Patterson, Elizabeth; Pennington, Julie – Journal of Literacy Research, 2001
Investigates reliability of two approaches for estimating text difficulty at the first-grade level: the Scale for Text Accessibility and Support and the Fountas/Pinnell system. Supports the predictive validity of the two rating scales with performance data. Suggests potential benchmarks for first-grade performance: 95% accuracy; 80 words per…
Descriptors: Beginning Reading, Difficulty Level, Grade 1, Measurement Techniques
Peer reviewedHelwig, Robert; Tindal, Gerald – Assessment for Effective Intervention, 2002
Four alternate versions of a 15-item general outcome measure (GOM) of mathematics conceptual understanding and applications were developed and administered to 117 eighth-graders. Results were correlated with scores on state multiple-choice mathematics achievement tests. Correlations ranged from .81 to .87 with no significant differences, offering…
Descriptors: Educational Assessment, Evaluation Methods, Grade 8, Learning Disabilities
Peer reviewedGoldstein, Sam – Journal of Autism and Developmental Disorders, 2002
The reliability, validity, and clinical utility of the Asperger Syndrome Diagnostic Scale in the diagnosis of pervasive developmental disorders are reviewed. While the measure holds promise as a research tool, there appears little evidence that it can distinguish among the variety of types of pervasive developmental disorders, or diagnose Asperger…
Descriptors: Asperger Syndrome, Autism, Behavior Rating Scales, Classification
Peer reviewedKolstad, Rosemarie K.; Kolstad, Robert A. – Journal of Dental Education, 1991
A study evaluated the use of a "none-of-these" option on multiple-choice achievement tests in undergraduate dental education. Results indicated this option neither enhanced nor diminished examinee performance stability but did reduce the examinee's opportunity to select correct choices by means unrelated to course objectives, thereby enhancing…
Descriptors: Achievement Tests, Dental Schools, Difficulty Level, Higher Education
Peer reviewedNorris, Stephen P. – Journal of Educational Measurement, 1990
The relevance of verbal reports of thinking for validating multiple-choice critical thinking tests was examined. Results from 342 senior high school students in Newfoundland (Canada) indicate that verbal reports can meet a necessary condition of validation data and collecting data does not alter thinking and performance. (SLD)
Descriptors: Cognitive Tests, Critical Thinking, Foreign Countries, High School Students
Peer reviewedPecheone, Raymond L.; Carey, Neil B. – Journal of Personnel Evaluation in Education, 1990
The Connecticut Teacher Assessment Center Project has, since 1986, been developing a semistructured interview in the area of mathematics to evaluate beginning teacher competence. The strategy for validation of the project's performance tests, Connecticut's reform initiatives, and implications of systematic validity for traditional psychometric…
Descriptors: Beginning Teachers, Higher Education, Interviews, Licensing Examinations (Professions)
Peer reviewedEndler, Norman S.; Parker, James D. A. – Educational and Psychological Measurement, 1990
C. Davis and M. Cowles (1989) analyzed a total trait anxiety score on the Endler Multidimensional Anxiety Scales (EMAS)--a unidimensional construct that this multidimensional measure does not assess. Data are reanalyzed using the appropriate scoring procedure for the EMAS. Subjects included 145 undergraduates in 1 of 4 testing conditions. (SLD)
Descriptors: Anxiety, Comparative Testing, Computer Assisted Testing, Construct Validity
Peer reviewedYoung, John W. – Journal of Educational Measurement, 1990
A new measure of academic performance was developed through a new application of item response theory (IRT). This new criterion, an IRT-based grade point average (GPA), was used to determine the predictive validity of certain preadmissions measures for 1,564 students admitted to Stanford University in 1982. (SLD)
Descriptors: Academic Achievement, Admission Criteria, College Entrance Examinations, College Students
Peer reviewedWatkins, C. Edward, Jr.; Campbell, Vicki L. – Counseling Psychologist, 1990
Introduces this issue of "The Counseling Psychologist," which considers several contemporary developments and issues in the areas of testing and assessment and their relevance for counseling psychologists. Topics examined by the papers are identified, along with five additional current issues in testing and assessment. (Author/TE)
Descriptors: Career Counseling, Computer Assisted Testing, Counseling, Evaluation Methods
Peer reviewedHarper, Dennis C.; Wadsworth, John S. – Research in Developmental Disabilities, 1990
This article investigates cognitive decline and depressive symptomatology among older adults with mental retardation. A pilot study of assessment instruments is reported. Findings reveal that decreasing cognitive ability is associated with higher rates of observed depression and reported behavioral problems. Cognitive decline was associated with…
Descriptors: Aging (Individuals), Behavior Problems, Clinical Diagnosis, Cognitive Ability
Peer reviewedRead, John – English for Specific Purposes, 1990
Considers the question of how best to elicit samples of writing for assessment in an English-for-academic-purposes proficiency test and assure that every test taker has something to write about. Three types of writing tasks are defined and analyzed, and examples are given. (25 references) (GLR)
Descriptors: English for Academic Purposes, Higher Education, Language Proficiency, Prior Learning
Peer reviewedNist, Sherrie L.; And Others – Reading Research and Instruction, 1990
Investigates the utility and predictive validity of the Learning and Study Strategies Inventory (LASSI) as a means of measuring college students' cognitive and affective growth following a study strategies course. Finds cognitive and affective growth in both regularly admitted and developmental studies students. Finds that LASSI cannot yet be used…
Descriptors: Affective Measures, Cognitive Measurement, College Students, Developmental Studies Programs
Peer reviewedByrne, Barbara M. – Multivariate Behavioral Research, 1989
Construct validity findings from Campbell-Fiske and LISREL confirmatory factor analyses of a multitrait-multimethod matrix were compared for 252 low-track and 588 high-track eleventh- and twelfth-grade students. The manner in which construct validity varies across groups is also discussed. (SLD)
Descriptors: Comparative Analysis, Construct Validity, Factor Analysis, Grade 11


