Descriptor
| Models | 2 |
| Scoring | 2 |
| Test Items | 2 |
| Test Length | 2 |
| Adaptive Testing | 1 |
| Algebra | 1 |
| Comparative Analysis | 1 |
| Computer Assisted Testing | 1 |
| Difficulty Level | 1 |
| Educational Assessment | 1 |
| Item Sampling | 1 |
| More ▼ | |
Source
| Assessment & Evaluation in… | 1 |
Author
| Burton, Richard F. | 1 |
| Wainer, Howard | 1 |
Publication Type
| Journal Articles | 1 |
| Reports - Evaluative | 1 |
| Reports - Research | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Burton, Richard F. – Assessment & Evaluation in Higher Education, 2006
Many academic tests (e.g. short-answer and multiple-choice) sample required knowledge with questions scoring 0 or 1 (dichotomous scoring). Few textbooks give useful guidance on the length of test needed to do this reliably. Posey's binomial error model of 1932 provides the best starting point, but allows neither for heterogeneity of question…
Descriptors: Item Sampling, Tests, Test Length, Test Reliability
Wainer, Howard; And Others – 1990
The initial development of a testlet-based algebra test was previously reported (Wainer and Lewis, 1990). This account provides the details of this excursion into the use of hierarchical testlets and validity-based scoring. A pretest of two 15-item hierarchical testlets was carried out in which examinees' performance on a 4-item subset of each…
Descriptors: Adaptive Testing, Algebra, Comparative Analysis, Computer Assisted Testing

Peer reviewed
Direct link
