Publication Date
| In 2026 | 1 |
| Since 2025 | 899 |
| Since 2022 (last 5 years) | 4508 |
| Since 2017 (last 10 years) | 10441 |
| Since 2007 (last 20 years) | 21904 |
Descriptor
| Test Validity | 21743 |
| Validity | 13779 |
| Test Reliability | 10839 |
| Foreign Countries | 9859 |
| Test Construction | 6878 |
| Factor Analysis | 5756 |
| Measures (Individuals) | 5619 |
| Predictive Validity | 5019 |
| Psychometrics | 4806 |
| Reliability | 4634 |
| Correlation | 4373 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 1169 |
| Practitioners | 629 |
| Teachers | 336 |
| Administrators | 165 |
| Policymakers | 110 |
| Counselors | 63 |
| Students | 63 |
| Parents | 15 |
| Community | 12 |
| Media Staff | 10 |
| Support Staff | 8 |
| More ▼ | |
Location
| Turkey | 1393 |
| Australia | 704 |
| Canada | 626 |
| China | 527 |
| United States | 439 |
| Indonesia | 387 |
| United Kingdom | 363 |
| Germany | 338 |
| California | 337 |
| Netherlands | 334 |
| Spain | 309 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 7 |
| Meets WWC Standards with or without Reservations | 12 |
| Does not meet standards | 10 |
Peer reviewedMitchell, Karen J.; Molidor, John B. – Educational and Psychological Measurement, 1986
Research reported in this paper considered the construct validity of a trial essay administered in 1985-87 Medical College Admission Test (MCAT). The addition of the essay caused the non-science factor observed in previous MCAT research to be more strongly defined. (Author/LMO)
Descriptors: College Entrance Examinations, Construct Validity, Correlation, Essay Tests
Peer reviewedGreenan, James P. – Career Development for Exceptional Individuals, 1986
Results of field testing for reliability and validity the Generalizable Mathematics Skills Student Self-Ratings (SSR), Teacher Ratings (TR), and Performance Test (PT) assessment instruments with 138 handicapped secondary students in vocational programs and their vocational teachers (N=5) found sufficient content and face validity and relatively…
Descriptors: Disabilities, Mathematics Tests, Performance Tests, Secondary Education
Peer reviewedMagnan, Sally Sieloff – Foreign Language Annals, 1986
Presents research rating the speaking proficiency on the American Council on the Teaching of Foreign Language Oral Proficiency Interview of 40 college French students at four different levels of study. The data show a significant, positive relationship between level of proficiency and level of study. (Author/CB)
Descriptors: Academic Achievement, Audiolingual Skills, College Students, French
Peer reviewedOwston, Ronald D. – Evaluation and Program Planning, 1986
This article outlines an approach designed to strengthen the validity of the case study method of evaluation. Five aspects of this approach are identified: evaluator selection procedure, method of setting out the evaluation expectations, use of on-going feedback, use of multiple perspectives, and wide involvement of various audiences. (Author/JAZ)
Descriptors: American Indian Education, Audience Participation, Canada Natives, Elementary Secondary Education
Peer reviewedMehrens, William A. – Educational Measurement: Issues and Practice, 1986
The President of the National Council on Measurement in Education replies to his critics. He argues that the concept of measurement error should not be used to make cut-scores more valid and that grade point averages have not been demonstrated to be valid indicators of teachers' subject matter competence. (LMO)
Descriptors: Cutting Scores, Grade Point Average, Licensing Examinations (Professions), Measurement Objectives
Peer reviewedBracken, Bruce A. – School Psychology Review, 1985
Discrepancies between the K-ABC and its theoretical base of simultaneous and sequential mental processing; technical and design problems related to disproportionate subtest contributions of the Simultaneous Scale to the Mental Processing Composite; the method of subtest-specific variance computation and use in interpretation; and utility with…
Descriptors: Cognitive Processes, Elementary Education, Individual Testing, Intelligence Tests
Peer reviewedCurry, Lynn; Purkis, Ian E. – Journal of Medical Education, 1986
An evaluation procedure designed to measure the effects of university-organized continuing medical education (CME) courses on participants' prescribing behavior was examined. Copies of prescriptions were analyzed to establish real behavior compared with the physicians' self-reports. (Author/MLW)
Descriptors: Behavior Change, Comparative Analysis, Higher Education, Medical Education
Harrison, Alton, Jr.; Musial, Diann – Journal of Computer-Based Instruction, 1985
Describes the three stages of development of a national computer-based teaching package for school board members: identification of critical school board competencies; development and programing of the computer teaching program; and field and validity testing. (Author/MBR)
Descriptors: Boards of Education, Computer Assisted Instruction, Field Tests, Instructional Design
Peer reviewedWolfe, Joseph – Simulation and Games, 1985
This 10-year literature update of Greenlaw and Wyman's review of teaching effectiveness of collegiate-level business games reviews research on top-management or functionally integrated games and functional games which emphasize specific business operations areas. Conclusions are drawn on business gaming's teaching effectiveness, and observations…
Descriptors: Academic Achievement, Business Administration Education, Collective Bargaining, Economics
Peer reviewedBaldauf, Richard B., Jr.; And Others – Educational and Psychological Measurement, 1985
The reliability and factorial validity of the Self Concept as a Learner Scale was studied, using 12-year-old Anglo-Australians. Reliability was acceptable for total scale and three subscales (task orientation, problem solving, and class membership), but not motivation. The validity of the factorial subscales was not confirmed. (GDC)
Descriptors: Factor Structure, Foreign Countries, Junior High Schools, Learning
Peer reviewedPierson, Dorothy; And Others – Educational and Psychological Measurement, 1985
The construct validity and reliability of the Porter Needs Satisfaction Questionnaire (adapted) for educators were examined. Results did not support its use as suggested by Porter. Suggestions for its revision and alternate use are presented. (Author/GDC)
Descriptors: Attitude Measures, Elementary Secondary Education, Factor Structure, Job Satisfaction
Peer reviewedKrus, David J.; Stanley, Maureen A. – Educational and Psychological Measurement, 1985
The Attitudes toward Bilingual Education scale was shown to be valid with respect to its ability to discriminate between proponents of bilingual education and a general population. (The 23-item scale is included). (Author/GDC)
Descriptors: Attitude Measures, Bilingual Education Programs, Elementary Secondary Education, Family Attitudes
Peer reviewedMoshman, David; Franks, Bridget A. – Child Development, 1986
Tested hypothesis that understanding validity of inference is a relatively late development by asking fourth and seventh graders and college students to sort sets of deductive arguments. None of fourth graders, 45 percent of seventh graders, and 85 percent of college students used validity as basis for distinguishing arguments. Experiments…
Descriptors: Abstract Reasoning, Age Differences, College Students, Deduction
Peer reviewedDiserens, Deborah; And Others – Journal of Medical Education, 1986
A computer program developed at the University of Pennsylvania School of Medicine presents simulated patient cases and then scores participants' clinical problem-solving in the cases by comparing their performances with those of faculty members. The validity and reliability of this evaluation system was investigated. (Author/MLW)
Descriptors: Clinical Diagnosis, Evaluation Methods, Graduate Medical Students, Higher Education
Peer reviewedEaves, Ronald C.; Simpson, Robert G. – Psychology in the Schools, 1986
Contends that erroneous conclusions concerning intraindividual strengths may result when comparing scaled scores on subtests of The Test of Reading Comprehension. Examination of scaled scores may seem to indicate that a given student has performed better on one subtest than on another when the difference between the two scores is not statistically…
Descriptors: Academic Ability, Comparative Analysis, Elementary Education, Elementary School Students


