Publication Date
| In 2026 | 1 |
| Since 2025 | 899 |
| Since 2022 (last 5 years) | 4508 |
| Since 2017 (last 10 years) | 10441 |
| Since 2007 (last 20 years) | 21904 |
Descriptor
| Test Validity | 21743 |
| Validity | 13779 |
| Test Reliability | 10839 |
| Foreign Countries | 9859 |
| Test Construction | 6878 |
| Factor Analysis | 5756 |
| Measures (Individuals) | 5619 |
| Predictive Validity | 5019 |
| Psychometrics | 4806 |
| Reliability | 4634 |
| Correlation | 4373 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 1169 |
| Practitioners | 629 |
| Teachers | 336 |
| Administrators | 165 |
| Policymakers | 110 |
| Counselors | 63 |
| Students | 63 |
| Parents | 15 |
| Community | 12 |
| Media Staff | 10 |
| Support Staff | 8 |
| More ▼ | |
Location
| Turkey | 1393 |
| Australia | 704 |
| Canada | 626 |
| China | 527 |
| United States | 439 |
| Indonesia | 387 |
| United Kingdom | 363 |
| Germany | 338 |
| California | 337 |
| Netherlands | 334 |
| Spain | 309 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 7 |
| Meets WWC Standards with or without Reservations | 12 |
| Does not meet standards | 10 |
Peer reviewedHelwig, Robert; Tindal, Gerald – Assessment for Effective Intervention, 2002
Four alternate versions of a 15-item general outcome measure (GOM) of mathematics conceptual understanding and applications were developed and administered to 117 eighth-graders. Results were correlated with scores on state multiple-choice mathematics achievement tests. Correlations ranged from .81 to .87 with no significant differences, offering…
Descriptors: Educational Assessment, Evaluation Methods, Grade 8, Learning Disabilities
Peer reviewedGoldstein, Sam – Journal of Autism and Developmental Disorders, 2002
The reliability, validity, and clinical utility of the Asperger Syndrome Diagnostic Scale in the diagnosis of pervasive developmental disorders are reviewed. While the measure holds promise as a research tool, there appears little evidence that it can distinguish among the variety of types of pervasive developmental disorders, or diagnose Asperger…
Descriptors: Asperger Syndrome, Autism, Behavior Rating Scales, Classification
Peer reviewedKolstad, Rosemarie K.; Kolstad, Robert A. – Journal of Dental Education, 1991
A study evaluated the use of a "none-of-these" option on multiple-choice achievement tests in undergraduate dental education. Results indicated this option neither enhanced nor diminished examinee performance stability but did reduce the examinee's opportunity to select correct choices by means unrelated to course objectives, thereby enhancing…
Descriptors: Achievement Tests, Dental Schools, Difficulty Level, Higher Education
Peer reviewedNorris, Stephen P. – Journal of Educational Measurement, 1990
The relevance of verbal reports of thinking for validating multiple-choice critical thinking tests was examined. Results from 342 senior high school students in Newfoundland (Canada) indicate that verbal reports can meet a necessary condition of validation data and collecting data does not alter thinking and performance. (SLD)
Descriptors: Cognitive Tests, Critical Thinking, Foreign Countries, High School Students
Peer reviewedPecheone, Raymond L.; Carey, Neil B. – Journal of Personnel Evaluation in Education, 1990
The Connecticut Teacher Assessment Center Project has, since 1986, been developing a semistructured interview in the area of mathematics to evaluate beginning teacher competence. The strategy for validation of the project's performance tests, Connecticut's reform initiatives, and implications of systematic validity for traditional psychometric…
Descriptors: Beginning Teachers, Higher Education, Interviews, Licensing Examinations (Professions)
Peer reviewedEndler, Norman S.; Parker, James D. A. – Educational and Psychological Measurement, 1990
C. Davis and M. Cowles (1989) analyzed a total trait anxiety score on the Endler Multidimensional Anxiety Scales (EMAS)--a unidimensional construct that this multidimensional measure does not assess. Data are reanalyzed using the appropriate scoring procedure for the EMAS. Subjects included 145 undergraduates in 1 of 4 testing conditions. (SLD)
Descriptors: Anxiety, Comparative Testing, Computer Assisted Testing, Construct Validity
Peer reviewedYoung, John W. – Journal of Educational Measurement, 1990
A new measure of academic performance was developed through a new application of item response theory (IRT). This new criterion, an IRT-based grade point average (GPA), was used to determine the predictive validity of certain preadmissions measures for 1,564 students admitted to Stanford University in 1982. (SLD)
Descriptors: Academic Achievement, Admission Criteria, College Entrance Examinations, College Students
Peer reviewedWatkins, C. Edward, Jr.; Campbell, Vicki L. – Counseling Psychologist, 1990
Introduces this issue of "The Counseling Psychologist," which considers several contemporary developments and issues in the areas of testing and assessment and their relevance for counseling psychologists. Topics examined by the papers are identified, along with five additional current issues in testing and assessment. (Author/TE)
Descriptors: Career Counseling, Computer Assisted Testing, Counseling, Evaluation Methods
Peer reviewedHarper, Dennis C.; Wadsworth, John S. – Research in Developmental Disabilities, 1990
This article investigates cognitive decline and depressive symptomatology among older adults with mental retardation. A pilot study of assessment instruments is reported. Findings reveal that decreasing cognitive ability is associated with higher rates of observed depression and reported behavioral problems. Cognitive decline was associated with…
Descriptors: Aging (Individuals), Behavior Problems, Clinical Diagnosis, Cognitive Ability
Peer reviewedRead, John – English for Specific Purposes, 1990
Considers the question of how best to elicit samples of writing for assessment in an English-for-academic-purposes proficiency test and assure that every test taker has something to write about. Three types of writing tasks are defined and analyzed, and examples are given. (25 references) (GLR)
Descriptors: English for Academic Purposes, Higher Education, Language Proficiency, Prior Learning
Peer reviewedNist, Sherrie L.; And Others – Reading Research and Instruction, 1990
Investigates the utility and predictive validity of the Learning and Study Strategies Inventory (LASSI) as a means of measuring college students' cognitive and affective growth following a study strategies course. Finds cognitive and affective growth in both regularly admitted and developmental studies students. Finds that LASSI cannot yet be used…
Descriptors: Affective Measures, Cognitive Measurement, College Students, Developmental Studies Programs
Peer reviewedByrne, Barbara M. – Multivariate Behavioral Research, 1989
Construct validity findings from Campbell-Fiske and LISREL confirmatory factor analyses of a multitrait-multimethod matrix were compared for 252 low-track and 588 high-track eleventh- and twelfth-grade students. The manner in which construct validity varies across groups is also discussed. (SLD)
Descriptors: Comparative Analysis, Construct Validity, Factor Analysis, Grade 11
Peer reviewedNixon, Jon – Peabody Journal of Education, 1987
The role of the teacher as researcher is related to key themes in the literature on action research. Current issues facing teachers involved in research into their own practice are highlighted. The pressure of accountability placed on teachers is discussed in the context of teacher research. (IAH)
Descriptors: Accountability, Action Research, Curriculum Research, Elementary Secondary Education
Peer reviewedChletsos, Peter N.; And Others – Journal of Research and Development in Education, 1989
This article presents evidence of the reliability and validity of a new paper-and-pencil test of proportional reasoning, Paper-and-Pencil Balance Beam Test. A Total of 627 individuals, aged 8-47, participated in the 3 studies discussed. Results support previous research which correlates performance on proportional reasoning problems with…
Descriptors: Age Differences, Cognitive Development, Elementary Secondary Education, Formal Operations
Peer reviewedMatthews, Margaret – ELT Journal, 1990
Discusses problems with the current trend in using behavior trait-based criteria to assess English-as-a-Second-Language productivity skills, and describes alternatives to such testing that involve the matching of linguistic tasks against nonlinguistic criteria. (Author/CB)
Descriptors: Communicative Competence (Languages), English (Second Language), Evaluation Criteria, Language Proficiency


