Descriptor
| Generalization | 2 |
| Test Reliability | 2 |
| Test Use | 2 |
| Academic Achievement | 1 |
| Accountability | 1 |
| Achievement Tests | 1 |
| Black Students | 1 |
| Comparative Testing | 1 |
| Educational Assessment | 1 |
| Elementary School Students | 1 |
| Grade 3 | 1 |
| More ▼ | |
Source
| Applied Measurement in… | 1 |
Publication Type
| Speeches/Meeting Papers | 2 |
| Journal Articles | 1 |
| Reports - Evaluative | 1 |
| Reports - Research | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedDunbar, Stephen B.; And Others – Applied Measurement in Education, 1991
Issues pertaining to the quality of performance assessments, including reliability and validity, are discussed. The relatively limited generalizability of performance across tasks is indicative of the care needed to evaluate performance assessments. Quality control is an empirical matter when measurement is intended to inform public policy. (SLD)
Descriptors: Educational Assessment, Generalization, Interrater Reliability, Measurement Techniques
Koretz, Daniel M.; And Others – 1991
Detailed evidence is presented about the extent of generalization from high-stakes tests to other tests and about the instructional effects of high-stakes testing. Data are from grade 3 of a large, high-poverty urban district with large numbers of Black and Hispanic American students. The district's results in 1990 for two tests, designated Test B…
Descriptors: Academic Achievement, Accountability, Achievement Tests, Black Students


