Publication Date
| In 2026 | 1 |
| Since 2025 | 899 |
| Since 2022 (last 5 years) | 4508 |
| Since 2017 (last 10 years) | 10441 |
| Since 2007 (last 20 years) | 21904 |
Descriptor
| Test Validity | 21743 |
| Validity | 13779 |
| Test Reliability | 10839 |
| Foreign Countries | 9859 |
| Test Construction | 6878 |
| Factor Analysis | 5756 |
| Measures (Individuals) | 5619 |
| Predictive Validity | 5019 |
| Psychometrics | 4806 |
| Reliability | 4634 |
| Correlation | 4373 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 1169 |
| Practitioners | 629 |
| Teachers | 336 |
| Administrators | 165 |
| Policymakers | 110 |
| Counselors | 63 |
| Students | 63 |
| Parents | 15 |
| Community | 12 |
| Media Staff | 10 |
| Support Staff | 8 |
| More ▼ | |
Location
| Turkey | 1393 |
| Australia | 704 |
| Canada | 626 |
| China | 527 |
| United States | 439 |
| Indonesia | 387 |
| United Kingdom | 363 |
| Germany | 338 |
| California | 337 |
| Netherlands | 334 |
| Spain | 309 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 7 |
| Meets WWC Standards with or without Reservations | 12 |
| Does not meet standards | 10 |
Peer reviewedMcNamara, T. F. – Language Testing, 1990
Discusses the role of the Rasch model IRT in evaluating two subtests of the Occupational English test and argues for its use in exploring test constructs and in considering the implications of the empirical analysis presented for the validity of communicative language tests involving speaking and writing skills. (39 references) (Author/JL)
Descriptors: Construct Validity, English for Special Purposes, Evaluation, Health Occupations
Peer reviewedPapa, Frank; And Others – Academic Medicine, 1990
In this study an artificial intelligence assessment tool used disease-by-feature frequency estimates to create disease prototypes for nine common causes of acute chest pain. The tool then used each subject's prototypes and a pattern-recognition-based decision-making mechanism to diagnose 18 myocardial infarction cases. (MLW)
Descriptors: Artificial Intelligence, Clinical Diagnosis, Construct Validity, Decision Making
Ford, Jerry; Gaylord-Ross, Robert – American Journal on Mental Retardation, 1991
This study examined 40 articles published in the "American Journal on Mental Retardation" or the "Journal of the Association for Persons with Severe Handicaps" (JASH) from 1976-78 and 1986-88. Both journals published low numbers of articles with ecological validity in the late 1970s, but JASH subsequently increased…
Descriptors: Behavior Change, Ecological Factors, Generalization, Intervention
Peer reviewedHawkins, Robert P. – Journal of Applied Behavior Analysis, 1991
This paper argues that many social validity processes in applied behavior analysis are actually measuring consumer satisfaction and not the habilitative validity of goals, procedures, or outcomes. The term "habilitative validation" is proposed to replace social validity, and use of more objective assessment methods is encouraged.…
Descriptors: Behavior Problems, Behavioral Science Research, Consumer Economics, Evaluation Methods
Brassard, Marla R.; And Others – Child Abuse and Neglect: The International Journal, 1993
The Psychological Maltreatment Rating Scales (PMRS) were developed for assessing psychological maltreatment in the mother-child interaction, and were used to rate the videotaped interaction of 49 high-risk mother-child dyads and predict child protective service involvements. The PMRS was found to be a moderately reliable and valid measure.…
Descriptors: Behavior Rating Scales, Child Abuse, Child Neglect, Child Welfare
Peer reviewedAllen, Bryce – Information Processing and Management, 1994
Describes two experiments that were conducted at the University of Illinois to determine how cognitive abilities of users of information retrieval systems and specific design features combine to create system usability. Logical reasoning and perceptual speed are examined, and reliability and validity are discussed. (Contains 12 references.) (LRW)
Descriptors: Analysis of Variance, Cognitive Ability, Higher Education, Hypothesis Testing
Peer reviewedPharr, Steven; And Others – Journal of Education for Business, 1993
College entrance examination scores, sophomore grade point average (GPA), and GPA for prerequisite courses were compared for 483 accounting, management, and economics students. All correlated with academic success, but test scores were less significant than GPAs. Only GPAs demonstrated significant results in predicting academic difficulty. (SK)
Descriptors: Academic Achievement, Academic Failure, Admission Criteria, Business Administration Education
Peer reviewedScriven, Michael – New Directions for Program Evaluation, 1993
Seven chapters present 31 propositions challenging traditional ideas about the nature and practice of program evaluation. Methods to improve evaluation models and approaches and ways to address intermediate and advanced evaluation issues are explored. The discussion also serves as an introduction to the most analytical and comprehensive of the…
Descriptors: Bias, Evaluation Methods, Evaluation Problems, Evaluation Utilization
Peer reviewedLines, Christi – Middle School Journal, 1994
Today's educational goals are too varied to be adequately evaluated only by conventional tests and measures. The wide variability of adolescents' development requires assessment devices that transcend the limitations of traditional paper-and-pencil assessments. Assessment should be continuous, comprehensive, multidimensional, collaborative, and…
Descriptors: Adolescents, Developmental Stages, Evaluation Criteria, Intermediate Grades
Peer reviewedDoyle, Eva I.; Chng, Chwee Lye – Health Values: The Journal of Health Behavior, Education & Promotion, 1994
Describes the development, validation, and testing of the Mexican American Attitude and Knowledge Scale (MAAKS) which measures knowledge of and attitudes toward traditional, less acculturated Mexican American culture among university students. The MAAKS was established as a valid, reliable instrument for use with college students. (SM)
Descriptors: Attitude Measures, College Students, Cultural Awareness, Evaluation Methods
Peer reviewedWhite, William – Teacher Education and Practice, 1992
In this interview, an Educational Testing Service (ETS) staff person outlines the three program components, as well as program development, pilot testing, and implementation plans for "The Praxis Series: Professional Assessments for Beginning Teachers." Issues related to equity and computer-assisted testing are also examined. (IAH)
Descriptors: Beginning Teachers, Computer Assisted Testing, Elementary Secondary Education, Higher Education
Peer reviewedSharpton, William R.; Sexton, David; Luster, Jane Nell; Lang, Margaret – Educational and Psychological Measurement, 1998
The purpose of this study was to obtain validity and reliability indexes for scores on a 12-item scale measuring family perspectives on extended school year (ESY) programs for students in special education. Responses from 128 families with a child potentially eligible for ESY programs indicated that a two-component model was both valid and…
Descriptors: Elementary Secondary Education, Extended School Year, Factor Analysis, Factor Structure
Stubbings, Vicki; Martin, Garry L. – American Journal on Mental Retardation, 1998
A study compared the accuracy of experienced staff with a learning test in predicting the ability of 18 persons with mental retardation to learn 12 training tasks. Results found the Assessment of Basic Learning Abilities test was more accurate for predicting client performance than the assessments of experienced staff. (Author/CR)
Descriptors: Attitudes, Cognitive Ability, Competence, Evaluation Methods
Peer reviewedGray, Shelley; Plante, Elena; Vance, Rebecca; Henrichsen, Mary – Language, Speech, and Hearing Services in Schools, 1999
This study compared four commonly used vocabulary tests to screen or identify preschool children for specific language impairment (SLI). Four- and five-year olds with (N=31) and without (N=31) SLI were compared on the tests. Despite moderate to strong inter-test correlations, no test was a strong identifier of SLI. (Author/DB)
Descriptors: Clinical Diagnosis, Delayed Speech, Disability Identification, Language Acquisition
Peer reviewedBusier, Holly-Lynn; Clark, Kelly A.; Esch, Rebecca A.; Glesne, Corrine; Pigeon, Yvette; Tarule, Jill M. – International Journal of Qualitative Studies in Education, 1997
Discusses issues of intimacy in qualitative research raised by Harry Wolcott's Brad trilogy in connection with Reba Page's use of the trilogy to teach about validity in interpretive research methods. Focuses on intimacy in research, relational reflexivity, power in relationships, relational ethics, evolutionary understandings of relationships, and…
Descriptors: Ethics, Ethnography, Graduate Study, Higher Education


