Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 0 |
| Since 2007 (last 20 years) | 12 |
Descriptor
| Advanced Placement Programs | 20 |
| Test Items | 20 |
| Test Construction | 7 |
| High School Students | 6 |
| Achievement Tests | 5 |
| Comparative Analysis | 5 |
| Difficulty Level | 5 |
| Item Response Theory | 5 |
| Models | 5 |
| Multiple Choice Tests | 5 |
| Science Tests | 5 |
| More ▼ | |
Source
Author
| Hendrickson, Amy | 4 |
| Huff, Kristen | 3 |
| Kaliski, Pamela | 3 |
| Engelhard, George, Jr. | 2 |
| Ewing, Maureen | 2 |
| Patterson, Brian | 2 |
| Reshetar, Rosemary | 2 |
| Wind, Stefanie A. | 2 |
| Cleary, Vincent J. | 1 |
| France, Megan | 1 |
| Kaliski, Pamela K. | 1 |
| More ▼ | |
Publication Type
Education Level
| High Schools | 8 |
| Secondary Education | 8 |
| Higher Education | 5 |
| Postsecondary Education | 5 |
| Elementary Education | 1 |
| Elementary Secondary Education | 1 |
| Grade 10 | 1 |
| Grade 11 | 1 |
| Grade 12 | 1 |
| Grade 4 | 1 |
| Intermediate Grades | 1 |
| More ▼ | |
Audience
| Practitioners | 3 |
| Policymakers | 1 |
| Researchers | 1 |
| Teachers | 1 |
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Hendrickson, Amy; Ewing, Maureen; Kaliski, Pamela; Huff, Kristen – Journal of Applied Testing Technology, 2013
Evidence-centered design (ECD) is an orientation towards assessment development. It differs from conventional practice in several ways and consists of multiple activities. Each of these activities results in a set of useful documentation: domain analysis, domain modeling, construction of the assessment framework, and assessment…
Descriptors: Evidence, Test Construction, Educational Assessment, Learning Theories
Kaliski, Pamela; Wind, Stefanie A.; Engelhard, George, Jr.; Morgan, Deanna; Plake, Barbara; Reshetar, Rosemary – College Board, 2012
The Many-Facet Rasch (MFR) Model is traditionally used to evaluate the quality of ratings on constructed response assessments; however, it can also be used to evaluate the quality of judgments from panel-based standard setting procedures. The current study illustrates the use of the MFR Model by examining the quality of ratings obtained from a…
Descriptors: Advanced Placement Programs, Achievement Tests, Item Response Theory, Models
Perrett, Jamis J. – Journal of Statistics Education, 2012
This article demonstrates how textbooks differ in their description of the term "experimental unit". Advanced Placement Statistics teachers and students are often limited in their statistical knowledge by the information presented in their classroom textbook. Definitions and descriptions differ among textbooks as well as among different…
Descriptors: Statistics, Advanced Placement Programs, Textbooks, Mathematics Instruction
Kaliski, Pamela K.; Wind, Stefanie A.; Engelhard, George, Jr.; Morgan, Deanna L.; Plake, Barbara S.; Reshetar, Rosemary A. – Educational and Psychological Measurement, 2013
The many-faceted Rasch (MFR) model has been used to evaluate the quality of ratings on constructed response assessments; however, it can also be used to evaluate the quality of judgments from panel-based standard setting procedures. The current study illustrates the use of the MFR model for examining the quality of ratings obtained from a standard…
Descriptors: Item Response Theory, Models, Standard Setting (Scoring), Science Tests
Kaliski, Pamela; France, Megan; Huff, Kristen; Thurber, Allison – College Board, 2011
Developing a cognitive model of task performance is an important and often overlooked phase in assessment design; failing to establish such a model can threaten the validity of the inferences made from the scores produced by an assessment (e.g., Leighton, 2004). Conducting think aloud interviews (TAIs), where students think aloud while completing…
Descriptors: World History, Advanced Placement Programs, Achievement Tests, Protocol Analysis
Yuan, Kun; Le, Vi-Nhuan – RAND Corporation, 2014
In 2010, the William and Flora Hewlett Foundation's Education Program has established the Deeper Learning Initiative, which focuses on students' development of deeper learning skills (i.e., the mastery of core academic content, critical-thinking, problem-solving, collaboration, communication, and "learn-how-to-learn" skills). Two test…
Descriptors: Test Items, Cognitive Processes, Difficulty Level, Skill Development
Hendrickson, Amy; Huff, Kristen; Luecht, Richard – Applied Measurement in Education, 2010
Evidence-centered assessment design (ECD) explicates a transparent evidentiary argument to warrant the inferences we make from student test performance. This article describes how the vehicles for gathering student evidence--task models and test specifications--are developed. Task models, which are the basis for item development, flow directly…
Descriptors: Evidence, Test Construction, Measurement, Classification
Reshetar, Rosemary; Melican, Gerald J. – College Board, 2010
This paper discusses issues related to the design and psychometric work for mixed-format tests --tests containing both multiple-choice (MC) and constructed-response (CR) items. The issues of validity, fairness, reliability and score consistency can be addressed but for mixed-format tests there are many decisions to be made and no examination or…
Descriptors: Psychometrics, Test Construction, Multiple Choice Tests, Test Items
Hendrickson, Amy; Patterson, Brian; Ewing, Maureen – College Board, 2010
The psychometric considerations and challenges associated with including constructed response items on tests are discussed along with how these issues affect the form assembly specifications for mixed-format exams. Reliability and validity, security and fairness, pretesting, content and skills coverage, test length and timing, weights, statistical…
Descriptors: Multiple Choice Tests, Test Format, Test Construction, Test Validity
Siko, Jason Paul – International Journal of E-Learning & Distance Education, 2014
In this study, the perceptions of parents (n = 14) and students (n = 47) enrolled in a blended learning course, the first of its kind at their school, were examined. Student performance in the blended and in the traditional portion of the course was examined, and the Educational Success Prediction Instrument (ESPRI) was administered to predict…
Descriptors: Blended Learning, Educational Technology, Distance Education, Electronic Learning
College Board, 2011
This catalog lists research reports, research notes, and other publications available from the College Board's website. The catalog briefly describes research publications available free of charge. Introduced in 1981, the Research Report series includes studies and reviews in areas such as college admission, special populations, subgroup…
Descriptors: Research Reports, Publications, Educational Research, College Students
Peer reviewedStovel, John E. – Social Science Record, 1987
Discusses the Document Based Question (DBQ) as a way of assessing critical thinking skills. Examines several weaknesses with this type of test item and offers suggestions for overcoming them. Advocates the method continue to be used. (JDH)
Descriptors: Advanced Placement Programs, Critical Thinking, Educational Testing, Essay Tests
Peer reviewedCleary, Vincent J. – Classical Outlook, 1986
Analyzes several questions and student answers, and the graders' evaluations of student responses on each of two advanced placement examinations--the one which tests Vergil and the one which tests Catallus and Horace. The percentages of participants scoring at each grade level of the exam are also presented. (SED)
Descriptors: Advanced Placement Programs, Grading, Language Tests, Latin
von Davier, Alina A.; Wilson, Christine – ETS Research Report Series, 2005
This paper discusses the assumptions required by the item response theory (IRT) true-score equating method (with Stocking & Lord, 1983; scaling approach), which is commonly used in the nonequivalent groups with an anchor data-collection design. More precisely, this paper investigates the assumptions made at each step by the IRT approach to…
Descriptors: Item Response Theory, True Scores, Equated Scores, Test Items
von Davier, Alina A., Ed.; Liu, Mei, Ed. – ETS Research Report Series, 2006
This report builds on and extends existent research on population invariance to new tests and issues. The authors lay the foundation for a deeper understanding of the use of population invariance measures in a wide variety of practical contexts. The invariance of linear, equipercentile and IRT equating methods are examined using data from five…
Descriptors: Equated Scores, Statistical Analysis, Data Collection, Test Format
Previous Page | Next Page »
Pages: 1 | 2
Direct link
