Publication Date
| In 2026 | 1 |
| Since 2025 | 51 |
| Since 2022 (last 5 years) | 213 |
| Since 2017 (last 10 years) | 494 |
| Since 2007 (last 20 years) | 986 |
Descriptor
| Test Validity | 3910 |
| Test Reliability | 1518 |
| Testing | 1090 |
| Test Construction | 1014 |
| Testing Problems | 1008 |
| Computer Assisted Testing | 618 |
| Elementary Secondary Education | 553 |
| Foreign Countries | 494 |
| Higher Education | 490 |
| Standardized Tests | 488 |
| Test Interpretation | 433 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 16 |
| Hambleton, Ronald K. | 13 |
| Green, Donald Ross | 10 |
| Popham, W. James | 10 |
| Linn, Robert L. | 9 |
| Haney, Walt | 8 |
| Koretz, Daniel | 8 |
| Sireci, Stephen G. | 8 |
| Thompson, Bruce | 8 |
| Tindal, Gerald | 8 |
| Hilliard, Asa G., III | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 137 |
| Researchers | 134 |
| Teachers | 51 |
| Administrators | 34 |
| Policymakers | 18 |
| Counselors | 11 |
| Students | 8 |
| Parents | 5 |
| Support Staff | 4 |
| Community | 2 |
Location
| Canada | 57 |
| Australia | 40 |
| California | 40 |
| China | 34 |
| United Kingdom (England) | 31 |
| United Kingdom | 29 |
| New York | 28 |
| United States | 26 |
| Florida | 22 |
| Germany | 21 |
| Turkey | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Reynolds, Cecil R. – 1981
The cultural test bias hypothesis represents the contention that all ethnic or racial group differences on mental tests are due to inherent, artifactual biases produced within the tests through flawed psychometric methodology. This address focuses on an empirical evaluation of the cultural test bias hypothesis, especially emphasizing the construct…
Descriptors: Elementary Secondary Education, Intelligence Tests, Personality Measures, Test Bias
Tarver, Linda K.; And Others – 1980
The purpose of this study is to describe and analyze the performance of Louisiana's potential teachers on the Common Examinations of the National Teacher Examinations (NTE). Scores from 1352 examinees who took the Common Examinations of the NTE in February, 1979, were analyzed. The performance of Louisiana's potential teachers compares favorably…
Descriptors: Beginning Teachers, Difficulty Level, Scores, State Standards
Diamond, Esther E. – 1981
As test standards and research literature in general indicate, definitions of test bias and item bias vary considerably, as do the results of existing methods of identifying biased items. The situation is further complicated by issues of content, context, construct, and criterion. In achievement tests, for example, content validity may impose…
Descriptors: Achievement Tests, Aptitude Tests, Psychometrics, Test Bias
McGue, Matthew; And Others – 1979
The validity of the Woodcock-Johnson Psycho-Educational Battery was examined using test results of 50 learning disabled fourth graders. The appropriateness of the developmental strategy and the evidence for the external validity of the cluster measures contained in the battery were considered. Results indicated that the factor and scholastic…
Descriptors: Elementary Education, Exceptional Child Research, Learning Disabilities, Standardized Tests
Livingston, Samuel A. – 1975
A measure of the usefulness of a pass/fail testing decision procedure is the ratio of the utility of the given procedure to the utility of a procedure based on knowledge of scores on a criterion measure. It is computed from scores for a representative sample of persons tested. Utility functions may be specified by the test user or set by…
Descriptors: Cutting Scores, Decision Making, Mathematical Models, Measurement Techniques
Bloomer, Corinne – Teacher, 1975
Article discussed the disadvantages of student testing as a means of evaluating student progress in the classroom and suggested the use of a new model of assessment. Three steps intended for classroom diagnosis of students were described. (RK)
Descriptors: Academic Achievement, Educational Testing, Models, Standardized Tests
Peer reviewedModjeski, Richard B.; Michael, William B. – Educational and Psychological Measurement, 1978
The General Education Performance Index (GEPI) is a comparatively short test covering the same content as the General Educational Development Test (GED), which takes ten hours to administer. Correlations of the subtests of the GEPI with the GED ranged from .28 to .57. (JKS)
Descriptors: Correlation, Equivalency Tests, Military Personnel, Statistical Data
Peer reviewedDeffenbacher, Jerry L.; Deitz, Sheila R. – Psychology in the Schools, 1978
Test performance and reported anxiety levels of high and low test-anxious subjects taking either a regular exam or an exam containing brief, written relaxation instructions were compared. High test-anxious subjects performed more poorly and reported greater worry and emotionality. Results provide greater external validity for Test Anxiety Scale.…
Descriptors: Anxiety, College Students, Higher Education, Research Projects
Peer reviewedHull, Marc; Halloran, William – Educational and Psychological Measurement, 1976
Results show that the mean number of Occupational Aptitude Patterns (OAP's) generated for a sample of mentally retarded and boarderline intelligence students is significantly greater for the Nonreading Aptitude Test Battery (NATB) than for the General Aptitude Test Battery (GATB). (DEP)
Descriptors: Comparative Testing, Intelligence Tests, Low Ability Students, Mental Retardation
The Invalidity of Partitioned-U Tests in Canonical Correlation and Multivariate Analysis of Variance
Peer reviewedHarris, Richard J. – Multivariate Behavioral Research, 1976
The partitioned-U procedure is outlined, a fundamental logical flaw in this procedure's avoidance of any direct test of the significance of the first discriminant function or largest coefficient of canonical correlation is pointed out, and two alternatives to the partitioned-U procedure are discussed. (Author/DEP)
Descriptors: Analysis of Variance, Correlation, Hypothesis Testing, Multivariate Analysis
Peer reviewedDudley, Harold K.; And Others – Journal of Youth and Adolescence, 1976
Indicates that IQ ranking is the most significnat factor affecting Draw A Person test performance by male subjects. IQ rankings were not found to significantly influence drawings by females. (Author/DEP)
Descriptors: Adolescents, Background, Institutionalized Persons, Intelligence
Peer reviewedMcGovern, Francis J.; Nevid, Jeffrey S. – Journal of Consulting and Clinical Psychology, 1986
Psychological inventories were administered to incarcerated offenders, either without prior cuing or following exposure to experimental cues that identified psychological health and growth with either positive or negative self-disclosure. Results showed that self-disclosure of deviant and symptomatic responses could be enhanced by associating…
Descriptors: Correctional Institutions, Personality Measures, Prisoners, Prompting
Peer reviewedHays, Ron D.; Huba, George J. – Journal of Consulting and Clinical Psychology, 1988
Considered techniques to assess self-reported drug use. Evaluated the effects of different response options on the distribution, reliability, and validity of scores on drug-use items. Suggests that more quantitative measures are not necessarily more reliable or valid than less quantitative measures of drug use. (Author/KS)
Descriptors: Drug Use, Item Analysis, Psychological Testing, Psychometrics
Peer reviewedNelson, Linda D. – Journal of Consulting and Clinical Psychology, 1987
Administered Minnesota Multiphasic Personality Inventory (MMPI) to clinically depressed and nondepressed inpatients, and compared scores from its Depression scale with scores from the Beck Depression Inventory. Demonstrated a positive linear relationship between the two measures and their ability to discriminate between depressed and nondepressed…
Descriptors: Concurrent Validity, Depression (Psychology), Item Analysis, Patients
Peer reviewedBritton, Warner H.; Eaves, Ronald C. – American Journal of Mental Deficiency, 1986
The relationship between the Vineland Adaptive Behavior Scales-Classroom Edition and its predecessor, the Vineland Social Maturity Scale was examined with 54 educable and trainable mentally retarded children. The concurrent validity of the two scales was moderate. The mean scores from the newer instrument were significantly lower. (Author/CL)
Descriptors: Elementary Secondary Education, Mild Mental Retardation, Moderate Mental Retardation, Test Validity


