Publication Date
| In 2026 | 0 |
| Since 2025 | 5 |
| Since 2022 (last 5 years) | 25 |
| Since 2017 (last 10 years) | 67 |
| Since 2007 (last 20 years) | 120 |
Descriptor
| Test Use | 771 |
| Test Validity | 771 |
| Test Reliability | 297 |
| Test Construction | 239 |
| Elementary Secondary Education | 150 |
| Higher Education | 123 |
| Scores | 105 |
| Foreign Countries | 101 |
| Standardized Tests | 98 |
| Test Interpretation | 95 |
| Evaluation Methods | 93 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 55 |
| Researchers | 26 |
| Teachers | 23 |
| Administrators | 13 |
| Parents | 9 |
| Students | 8 |
| Policymakers | 5 |
| Community | 3 |
| Counselors | 2 |
| Support Staff | 1 |
Location
| Australia | 13 |
| Canada | 11 |
| New York | 7 |
| United Kingdom (England) | 7 |
| Tennessee | 6 |
| United States | 6 |
| South Korea | 5 |
| Turkey | 5 |
| China | 4 |
| Japan | 4 |
| New Jersey | 4 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedRoyer, James M.; Lynch, Douglas J. – Reading Psychology, 1982
Reviews four uses for norm-referenced tests of reading comprehension. Concludes that three of the uses are actually misuses and that the only appropriate use is for predicting future student performance. (FL)
Descriptors: Norm Referenced Tests, Reading Comprehension, Reading Instruction, Reading Research
Bond, Lloyd – New Directions for Testing and Measurement, 1981
Three important issues related to the testing debate, particularly in the context of college and professional school admissions, are reviewed and evaluated. (Author/AL)
Descriptors: Admission Criteria, College Entrance Examinations, Equal Protection, Test Bias
Peer reviewedGunning, Thomas G. – Reading Teacher, 1982
Argues that giving students a reading test that is above or below their levels of achievement will yield inaccurate information. (FL)
Descriptors: Elementary Education, Reading Difficulties, Reading Instruction, Reading Tests
Peer reviewedJongsma, Eugene A. – Reading Teacher, 1982
Reviews the ANSER System, a set of questionnaires designed to collect background information from parents, teachers, and students that would be useful in conducting indepth evaluations of learning and behavioral problems. Concludes that because it relies on subjective interpretations, its use should be limited to specialists. (FL)
Descriptors: Behavior Problems, Diagnostic Tests, Elementary Education, Learning Disabilities
Peer reviewedShepard, Lorrie A. – Educational Measurement: Issues and Practice, 1997
It is argued that consequences are a logical part of the evaluation of test use, which has been an accepted part of test validity for several decades. The examination of effects following from test use is essential in evaluating test validity and not merely the domain of policymakers and politicians. (SLD)
Descriptors: Educational Assessment, Educational Policy, Educational Testing, Elementary Secondary Education
Peer reviewedLinn, Robert L. – Educational Measurement: Issues and Practice, 1997
It is argued that consequential validity is a concept worth considering. The solution to defining "validity" is not to narrow the concept, but to allow for the differential prediction provided by tests in different circumstances. Consequences of the uses and interpretations of test scores are central to their evaluation. (SLD)
Descriptors: Educational Assessment, Educational Testing, Elementary Secondary Education, Evaluation Methods
Peer reviewedHoekje, Barbara; Linnell, Kimberly – TESOL Quarterly, 1994
Bachman's framework of language testing and standard of authenticity for language testing instruments were used to evaluate three instruments--the SPEAK (Spoken Proficiency English Assessment Kit) test, OPI (Oral Proficiency Interview), and a performance test--as language tests for nonnative-English-speaking teaching assistants. (Contains 53…
Descriptors: English (Second Language), Foreign Students, Language Proficiency, Language Tests
Peer reviewedMcCusker, Paul J. – Psychological Assessment, 1994
Three short forms of the Wechsler Adult Intelligence Scale-Revised (WAIS-R), developed in 1991, were cross-validated on 207 male and 133 female adolescent psychiatric inpatients and outpatients. Results show psychometric properties for the short forms that are comparable to those of the WAIS-R standardization sample. (SLD)
Descriptors: Adolescents, Clinical Diagnosis, Comparative Analysis, Intelligence Tests
Peer reviewedRabiner, Donna J.; And Others – Evaluation and Program Planning, 1994
A 14-item instrument, the Dentist Satisfaction Survey-14, a form of a previously validated instrument, is described. Use with 522 dentists, and 29 in a follow-up, indicates that the short form is a parsimonious tool for general evaluation of dentists' job satisfaction. (SLD)
Descriptors: Attitude Measures, Dentists, Evaluation Methods, Followup Studies
Poteet, James A. – Diagnostique, 1990
A framework is presented for implementing standardized achievement testing. Fundamental concepts and formats of the tests are reviewed, and useful references are listed. Standardized tests are considered and categorized in terms of administration format, functions, validity, and type (cognitive versus noncognitive). Thirteen basic recommendations…
Descriptors: Achievement Tests, Elementary Secondary Education, Learning Problems, Standardized Tests
Peer reviewedThomas, Volker; Olson, David H. – Journal of Marital and Family Therapy, 1993
Examined validity, reliability, and curvilinearity of Clinical Rating Scale and tested scale's ability to discriminate between clinical families and nonclinical families on family cohesion, adaptability, and communication. Data from two groups of problem families and two control groups supported curvilinear hypothesis that problem families are…
Descriptors: Adjustment (to Environment), Family Counseling, Family Problems, Family Relationship
Peer reviewedDozois, David J. A.; Ahnberg, Jamie L.; Dobson, Keith S. – Psychological Assessment, 1998
Provides psychometric information on the second edition of the Beck Depression Inventory (BDI-II) (A. Beck, R. Steer, and G. Brown, 1996) for internal consistency, factorial validity, and gender differences. Results indicate that the BDI-II is a stronger instrument than its predecessor in terms of factor structure. (SLD)
Descriptors: Depression (Psychology), Factor Analysis, Factor Structure, Psychometrics
Kyriakides, Leonidas – Assessment in Education Principles Policy and Practice, 2004
This paper argues for an expanded conception of test validity, in which teachers, as end-users of tests, contribute a distinctive perspective on validity, referred to as inferential validity. It also offers a methodology that could be adopted in order to subject this dimension of validity to scrutiny. An investigation conducted into the meanings…
Descriptors: Test Validity, Foreign Countries, Emergent Literacy, Achievement Tests
Kirisci, Levent; Clark, Duncan B. – 1996
The reliability and validity of the State-Trait Anxiety Inventory for Children (STAIC) was studied with 675 adolescents aged 12 to 18 recruited from clinical and community sources. The STAIC is a self-report measure that has been widely used to assess state and trait anxiety of children. It has been suggested that the child version may be more…
Descriptors: Adolescents, Anxiety, Children, Factor Structure
Reimer, Judy – 1996
Work Keys provides a metric that translates skill requirements for individual jobs into levels of proficiency. It has been developed as a multifunctional program with the interactive components of Job Profiling, Instructional Support, Reporting, and Assessment. The assessment component contains applied mathematics, applied technology, teamwork,…
Descriptors: Career Planning, Job Skills, National Norms, Occupational Tests

Direct link
