Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 0 |
| Since 2007 (last 20 years) | 1 |
Descriptor
| Test Reliability | 6 |
| Advanced Placement | 4 |
| Scoring | 4 |
| High School Students | 3 |
| Higher Education | 3 |
| Interrater Reliability | 3 |
| Scores | 3 |
| College Entrance Examinations | 2 |
| College Instruction | 2 |
| Essays | 2 |
| Evaluation Criteria | 2 |
| More ▼ | |
Source
| College Teaching | 2 |
| College Board | 1 |
Author
| Braun, Henry I. | 1 |
| Bridgeman, Brent | 1 |
| Mazzeo, John | 1 |
| McLauchlan, William | 1 |
| Melican, Gerald J. | 1 |
| Miller, Jeff | 1 |
| Reshetar, Rosemary | 1 |
Publication Type
| Reports - Evaluative | 3 |
| Journal Articles | 2 |
| Opinion Papers | 2 |
| Reports - Research | 2 |
| Speeches/Meeting Papers | 1 |
Education Level
| High Schools | 1 |
| Secondary Education | 1 |
Audience
| Practitioners | 2 |
| Teachers | 2 |
Location
Laws, Policies, & Programs
Assessments and Surveys
| Advanced Placement… | 6 |
What Works Clearinghouse Rating
Reshetar, Rosemary; Melican, Gerald J. – College Board, 2010
This paper discusses issues related to the design and psychometric work for mixed-format tests --tests containing both multiple-choice (MC) and constructed-response (CR) items. The issues of validity, fairness, reliability and score consistency can be addressed but for mixed-format tests there are many decisions to be made and no examination or…
Descriptors: Psychometrics, Test Construction, Multiple Choice Tests, Test Items
Braun, Henry I. – 1986
This report describes a statistically designed experiment that was carried out in an operational setting to determine the contributions of different sources of variation to the unreliability of scoring. The experiment made novel use of partially balanced incomplete block designs that facilitated the unbiased estimation of certain main effects…
Descriptors: Essay Tests, Estimation (Mathematics), Mathematical Models, Research Design
Bridgeman, Brent; And Others – 1996
The various methods for computing the reliability of scores on Advanced Placement (AP) examinations are summarized. For the free response portion of the examinations, raters can contribute to score unreliability through both systematic severity errors (in which some raters consistently rate more severely than other raters) and through…
Descriptors: Advanced Placement, College Entrance Examinations, Error of Measurement, High School Students
Mazzeo, John; And Others – 1993
This report describes three exploratory studies of the performance of males and females on the multiple-choice and constructed-response sections of four Advanced Placement Examinations: United States History, Biology, Chemistry, and English Language and Composition. Analyses were carried out for each racial or ethnic group with a sample size of at…
Descriptors: Advanced Placement, College Entrance Examinations, Constructed Response, Ethnic Groups
Peer reviewedMiller, Jeff – College Teaching, 1999
A college faculty member who has graded Advanced Placement exam essays on U.S. government and politics, taken mostly by high school juniors and seniors, suggests that high school teachers and college faculty who assess the essays are not the best qualified persons to do so and that despite efforts to ensure consistency, the resulting scores are…
Descriptors: Advanced Placement, College Instruction, Essays, Evaluation Criteria
Peer reviewedMcLauchlan, William – College Teaching, 1999
A faculty consultant to the Educational Testing Service for advanced placement (AP) test reading in U.S. government and politics responds to an article criticizing essay evaluation methods and criteria, finding in it a fundamental misunderstanding of the AP reading process and explaining why the essays are subject to less scrutiny for style,…
Descriptors: Advanced Placement, College Instruction, Essays, Evaluation Criteria


