Publication Date
| In 2026 | 0 |
| Since 2025 | 2 |
| Since 2022 (last 5 years) | 6 |
| Since 2017 (last 10 years) | 27 |
| Since 2007 (last 20 years) | 46 |
Descriptor
| Test Reliability | 418 |
| Test Use | 418 |
| Test Validity | 297 |
| Test Construction | 143 |
| Elementary Secondary Education | 77 |
| Higher Education | 66 |
| Evaluation Methods | 60 |
| Psychometrics | 56 |
| Foreign Countries | 52 |
| Scoring | 49 |
| Standardized Tests | 49 |
| More ▼ | |
Source
Author
| Stansfield, Charles W. | 4 |
| Straus, Murray A. | 4 |
| Thompson, Bruce | 4 |
| Baker, Eva L. | 3 |
| Alsalam, Nabeel | 2 |
| Anderson, Stephen A. | 2 |
| Axelrod, Bradley N. | 2 |
| Boesel, David | 2 |
| Bricker, Diane | 2 |
| Burrell, Brenda | 2 |
| Clark, Duncan B. | 2 |
| More ▼ | |
Publication Type
Education Level
| Higher Education | 11 |
| Postsecondary Education | 11 |
| Elementary Education | 10 |
| Early Childhood Education | 7 |
| Elementary Secondary Education | 5 |
| Primary Education | 5 |
| Secondary Education | 5 |
| Grade 3 | 4 |
| Grade 4 | 4 |
| Grade 5 | 4 |
| Grade 6 | 4 |
| More ▼ | |
Audience
| Practitioners | 43 |
| Teachers | 17 |
| Researchers | 9 |
| Students | 8 |
| Administrators | 7 |
| Parents | 5 |
| Policymakers | 3 |
| Community | 2 |
| Counselors | 2 |
| Support Staff | 1 |
Location
| Australia | 10 |
| Canada | 6 |
| New York | 6 |
| Hong Kong | 3 |
| Finland | 2 |
| Georgia | 2 |
| Ireland | 2 |
| Israel | 2 |
| Massachusetts | 2 |
| Michigan | 2 |
| Netherlands | 2 |
| More ▼ | |
Laws, Policies, & Programs
| Education Consolidation… | 2 |
| Elementary and Secondary… | 1 |
| Every Student Succeeds Act… | 1 |
| Individuals with Disabilities… | 1 |
| Individuals with Disabilities… | 1 |
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedBritton, Gwyneth; Lumpkin, Margarte – Reading Psychology, 1982
Subjecting the comprehension passages of the Gates-McGinitie Reading Test to readability analysis using a multiformula computer program revealed that the instrument can best be used for making comparisons between groups using the same test forms and levels in the same sequence. (FL)
Descriptors: Computer Assisted Testing, Elementary Secondary Education, Readability, Reading Comprehension
Peer reviewedGoodwin, David A. J.; And Others – Psychological Assessment, 1994
Development of a parent report measure for assessing the quality of life of children with cancer is described. The Pediatric Oncology Quality of Life Scale assesses physical function and role restriction, emotional distress, and reaction to current medical treatment. Reliability and validity assessments provide preliminary support for the…
Descriptors: Cancer, Children, Emotional Problems, Evaluation Methods
Peer reviewedKlein, Stephen P.; And Others – Applied Measurement in Education, 1995
Portfolios are the centerpiece of Vermont's statewide assessment program in mathematics. Portfolio scores in the first two years were not reliable enough to permit the reporting of student-level results, but increasing the number of readers or the number of portfolio pieces is not operationally feasible. (SLD)
Descriptors: Educational Assessment, Elementary Secondary Education, Mathematics Tests, Performance Based Assessment
Peer reviewedRusson, Craig; Koehly, Laura M. – Evaluation and Program Planning, 1995
A scale was developed for measuring the persuasive impact of qualitative and quantitative evaluation reports on decision makers. Using two exploratory (n=192 graduate and undergraduate students) and two confirmatory (n=200 administrators) samples, researchers developed a 28-item Likert-type scale that demonstrated high reliability and validity.…
Descriptors: Administrators, Attention, College Students, Comprehension
Peer reviewedFrisbie, David A. – Educational Measurement: Issues and Practice, 1992
Literature related to the multiple true-false (MTF) item format is reviewed. Each answer cluster of a MTF item may have several true items and the correctness of each is judged independently. MTF tests appear efficient and reliable, although they are a bit harder than multiple choice items for examinees. (SLD)
Descriptors: Achievement Tests, Difficulty Level, Literature Reviews, Multiple Choice Tests
Peer reviewedNeto, Felix – Journal of Youth and Adolescence, 1993
The applicability of the Satisfaction With Life Scale (SWLS), developed in the United States, to another culture was assessed by investigating reliability and validity of the SWLS with 99 boys and 118 girls from Portugal. The cross-national validity of the scale and its utility with different age groups are supported. (SLD)
Descriptors: Adolescents, Age Differences, Attitude Measures, Comparative Testing
Peer reviewedReeb, Roger N.; Katsuyama, Ronald M.; Sammon, Julie A.; Yoder, David S. – Michigan Journal of Community Service Learning, 1998
Presents three studies examining the psychometric properties of the Community Service Self-Efficacy Scale developed for program evaluation of service learning in college. The scale was constructed to assess an individual's confidence in his or her ability to make clinically significant contributions to the community through service. Reliability,…
Descriptors: College Students, Construct Validity, Higher Education, Public Service
Beetham, James – American Language Review, 1997
The International English Language Testing System is described, including the test's underlying principles, design, administration, scoring, reliability, and interpretation. Some criticisms of the program are briefly discussed. (MSE)
Descriptors: English (Second Language), Foreign Students, Language Tests, Program Design
Peer reviewedMcBee, Maridyth M.; Barnes, Laura L. B. – Applied Measurement in Education, 1998
The temporal stability and intertask consistency of an eighth-grade mathematics performance assessment and how task similarity affects the ability to generalize results of the assessments were studied with results from 101 eighth graders. Results support the suggestion that large-scale performance assessments be used with considerable caution…
Descriptors: Academic Achievement, Grade 8, Junior High School Students, Junior High Schools
Peer reviewedGreve, Kevin W.; And Others – Assessment, 1995
Interset and set-to-total correlations for the California Card Sorting Test were studied for 135 college students and 63 elementary school students. Although subsets were not equivalent, the generally good correlations between subset and total test scores indicated that use of the individual subsets as short forms is justified. (SLD)
Descriptors: Cognitive Processes, College Students, Correlation, Elementary Education
Peer reviewedIngram, Rick E.; And Others – Psychological Assessment, 1995
Original data and other studies using the Positive Automatic Thoughts Questionnaire (ATP-Q) show that the reliability and norms of the instrument appear stable and that the ATP-Q is inversely associated with negative affective states but unrelated to conditions such as medical condition not accompanied by psychological distress. (SLD)
Descriptors: Affective Behavior, Affective Measures, Cognitive Processes, Literature Reviews
Huang, Chi-yu; And Others – 1995
Generalizability theory is used to examine the sources of variability present in a teacher and course evaluation instrument. Two studies were conducted. In the first study, four different forms commonly used by one specific college of a large midwestern university were examined using responses of 915 students. The analysis of variance performed on…
Descriptors: Analysis of Variance, College Students, Course Evaluation, Evaluation Methods
Leyva, Collette – 1997
The Test of Pragmatic Language (TOPL) is an individually administered instrument designed to assess pragmatic language skills that can be used with students in kindergarten through high school. It is more specifically intended for use with children, adolescents, and adults with learning disabilities, language delays, reading difficulties, or…
Descriptors: Adolescents, Adults, Children, Communication Skills
Kaufman, Alan S.; And Others – 1994
The reliability and validity of three short forms of the Wechsler Intelligence Scale for Children III (WISC-III) were compared. Each of the short forms was a tetrad composed of two verbal and two performance subtests. The first tetrad was selected based primarily on practical considerations, particularly its brevity to administer and score. The…
Descriptors: Adolescents, Age Differences, Children, Clinical Diagnosis
Bartel, Kathleen – 1991
Literacy Volunteers of America (LVA) affiliates were surveyed regarding standardized and informal assessment devices they currently used and their frequency of use and effectiveness. A literature review focused on assessment tools and their limitations. The survey was sent to 39 LVA affiliates in Illinois, Indiana, Michigan, Missouri, Ohio, and…
Descriptors: Adult Basic Education, Adult Literacy, Evaluation Utilization, Informal Assessment


