Publication Date
| In 2026 | 2 |
| Since 2025 | 462 |
| Since 2022 (last 5 years) | 1941 |
| Since 2017 (last 10 years) | 4513 |
| Since 2007 (last 20 years) | 6998 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10004 |
| Test Construction | 4369 |
| Foreign Countries | 3831 |
| Psychometrics | 2428 |
| Factor Analysis | 2301 |
| Measures (Individuals) | 1785 |
| Evaluation Methods | 1410 |
| Higher Education | 1391 |
| Questionnaires | 1261 |
| Factor Structure | 1248 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 838 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 162 |
| Spain | 129 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 112 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Ekstrom, Ruth B.; And Others – 1974
This report is part of a general study of Reference Measures for Cognitive and Noncognitive Factors. The specific activity that is being reported is the development of "factor-referenced" tests or "marker" tests for several cognitive factors related to divergent production (i.e., ability to produce a variety of words, phrases,…
Descriptors: Cognitive Ability, Cognitive Processes, Creativity, Divergent Thinking
Schwartz, Howard P. – 1974
Distinction between norm referenced and criterion referenced tests are explored in relationship to underlying philosophy and intent. In considering the use of a criterion referenced test for instructional purposes, consideration is given to: specification of objectives, item content and selection, reliability, and needs assessment. (Author)
Descriptors: Comparative Analysis, Criterion Referenced Tests, Educational Assessment, Educational Needs
Diederich, Paul B. – 1973
Written by an ex-Latin teacher, short-cuts to analyzing test results for the non-mathematical teacher are provided. Discussions are given of item analysis (item analysis by a show of hands, standards for test items: success, standards for test items: discrimination, and the second stage of item analysis. The standard error is then presented (the…
Descriptors: Correlation, Error of Measurement, Guides, Item Analysis
PDF pending restorationTaylor, Anne P.; Helmstadter, G. C. – 1971
A pair comparison scale for measuring aesthetic judgment which could be used with four and five year old children was developed by having art experts independently judge for "aesthetic quality" color slides representing a variety of stimuli on an eleven-point successive category scale. The scale was administered to forty children on two…
Descriptors: Art Appreciation, Childhood Attitudes, Childhood Interests, Pictorial Stimuli
Koos, Eugenia M. – 1970
Problems in assessing the validity and reliability of the Mid-Continent Regional Educational Laboratory (McREL) tests of the inquiry skills of biology students are discussed by reference to the first trial version of the first Explorations in Biology (EIB) booklets. Since students learn during the two parts of the test, coefficients of stability…
Descriptors: Biology, Evaluation, Inquiry, Measurement
Thomas, Charles R. – 1971
One hundred and fifteen first graders were randomly assigned to experimental and control groups. Experimental pupils used the Visual Tracking program, and the control pupils participated in directed listening activities in separate rooms. The teachers followed a weekly rotating schedule in supervising the groups. After 12 weeks of training,…
Descriptors: Eye Movements, Grade 1, Sex Differences, Silent Reading
Schmeiser, Cynthia Board; Whitney, Douglas R. – 1973
Violations of four selected principles of writing multiple-choice items were introduced into an undergraduate religion course mid-term examination. Three of the flaws significantly increased test difficulty. KR-sub-20 values were lower for all of the tests containing the flawed items than for the "good" versions of the items but significantly so…
Descriptors: Item Analysis, Multiple Choice Tests, Research Reports, Test Construction
Young, Jon I. – 1972
Some theoretical concerns for competency-based evaluation instruments are discussed, and means of examining these instruments for validity and reliability are presented. The areas of concern include descriptions of the behavior, the level of response, and the nature of the evaluation. Two different types of instruments are examined to determine…
Descriptors: Evaluation Methods, Measurement Instruments, Models, Performance Tests
Modu, Christopher C. – 1972
The contribution of a 20-minute essay question, given as part of the one-hour achievement test in American History and Social Studies, to the pool of information available on a candidate from an all-objective examination of the College Board Admissions Testing Program is presented in this report. The study limits itself to a consideration of the…
Descriptors: Academic Achievement, American History, Essay Tests, Objective Tests
Wilmoth, Gregory H.; McFarland, Sam G. – 1976
Kohlberg's Moral Judgment Scale, Gilligan, et al.'s Sexual Moral Judgment Scale, Maitland and Goldman's Objective Moral Judgment Scale, and Hogan's Maturity of Moral Judgment Scale were examined for reliability and inter-scale relationships. All measures except the Objective Moral Judgment Scale had good reliabilities. The obtained relations…
Descriptors: Adults, Comparative Analysis, Correlation, Moral Development
Schlenker, Richard M.
Sixty-nine students in grades 9, 10, and 11 were tested with three of Viktor Lowenfeld's visual-haptic tests in an attempt to ascertain whether students at these levels segregated in a fashion similar to Lowenfeld's sample. Respondents were spread over the visual-haptic continuum as Lowenfeld suggested they should be. However, a large and…
Descriptors: Aptitude Tests, Perception Tests, Scoring, Secondary Education
1976
Over forty articles dealing with studies of affective variables were analyzed in this report of the Elementary School Subcommittee of the National Council on Measurement in Education's Task Force on Measurement of Affective Outcomes. The measurement problems were of both a theoretical and practical nature. The greatest practical problem is…
Descriptors: Affective Behavior, Affective Measures, Affective Objectives, Evaluation Methods
Peer reviewedZimmerman, Donald W. – Educational and Psychological Measurement, 1976
Using the concepts of conditional probability, conditional expectation, and conditional independence, the main results of the classical test theory model can be derived in a very few steps with minimal assumptions. The present effort explores the possibility that present classical test theories can be further condensed. (Author/RC)
Descriptors: Career Development, Correlation, Mathematical Models, Measurement
Rathus, Spencer A.; Siegel, Larry J. – Journal of Family Counseling, 1976
Self-concept questionnaire was shown to have high test-retest reliability, but only fair to moderate split-half (odd-even) reliability. Validity was adequate. The scale will serve as a heuristic device for family counselors who require a rapid assessment of a child's self-esteem. (Author)
Descriptors: Personality Measures, Rating Scales, Research Projects, Self Concept Measures
Peer reviewedHakstian, A. Ralph; Bennet, Richard W. – Educational and Psychological Measurement, 1978
Three multiple-abilities batteries--the Comprehensive Ability Battery, Differential Aptitude Tests, and General Aptitude Test Battery--are briefly discussed, and results of a study involving cross-correlations of the tests in these batteries are presented. (Author/JKS)
Descriptors: Achievement Tests, Aptitude Tests, Correlation, Foreign Countries


