Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 0 |
| Since 2007 (last 20 years) | 7 |
Descriptor
Source
| College Board | 4 |
| ETS Research Report Series | 2 |
| College Entrance Examination… | 1 |
| Educational and Psychological… | 1 |
| Journal of Applied Testing… | 1 |
| Journal of Statistics… | 1 |
Author
| Kaliski, Pamela | 3 |
| Engelhard, George, Jr. | 2 |
| Huff, Kristen | 2 |
| Reshetar, Rosemary | 2 |
| Wind, Stefanie A. | 2 |
| Ewing, Maureen | 1 |
| France, Megan | 1 |
| Hendrickson, Amy | 1 |
| Kaliski, Pamela K. | 1 |
| Liu, Mei, Ed. | 1 |
| Mazzeo, John | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 6 |
| Journal Articles | 5 |
| Reports - Evaluative | 3 |
| Non-Print Media | 2 |
| Reference Materials - General | 2 |
| Collected Works - General | 1 |
| Reference Materials -… | 1 |
| Speeches/Meeting Papers | 1 |
| Tests/Questionnaires | 1 |
Education Level
| High Schools | 5 |
| Secondary Education | 5 |
| Higher Education | 4 |
| Postsecondary Education | 4 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
| Advanced Placement… | 11 |
| College Level Examination… | 2 |
| SAT (College Admission Test) | 2 |
| ACT Assessment | 1 |
| Law School Admission Test | 1 |
| National Merit Scholarship… | 1 |
| Preliminary Scholastic… | 1 |
What Works Clearinghouse Rating
Hendrickson, Amy; Ewing, Maureen; Kaliski, Pamela; Huff, Kristen – Journal of Applied Testing Technology, 2013
Evidence-centered design (ECD) is an orientation towards assessment development. It differs from conventional practice in several ways and consists of multiple activities. Each of these activities results in a set of useful documentation: domain analysis, domain modeling, construction of the assessment framework, and assessment…
Descriptors: Evidence, Test Construction, Educational Assessment, Learning Theories
Kaliski, Pamela; Wind, Stefanie A.; Engelhard, George, Jr.; Morgan, Deanna; Plake, Barbara; Reshetar, Rosemary – College Board, 2012
The Many-Facet Rasch (MFR) Model is traditionally used to evaluate the quality of ratings on constructed response assessments; however, it can also be used to evaluate the quality of judgments from panel-based standard setting procedures. The current study illustrates the use of the MFR Model by examining the quality of ratings obtained from a…
Descriptors: Advanced Placement Programs, Achievement Tests, Item Response Theory, Models
Perrett, Jamis J. – Journal of Statistics Education, 2012
This article demonstrates how textbooks differ in their description of the term "experimental unit". Advanced Placement Statistics teachers and students are often limited in their statistical knowledge by the information presented in their classroom textbook. Definitions and descriptions differ among textbooks as well as among different…
Descriptors: Statistics, Advanced Placement Programs, Textbooks, Mathematics Instruction
Kaliski, Pamela K.; Wind, Stefanie A.; Engelhard, George, Jr.; Morgan, Deanna L.; Plake, Barbara S.; Reshetar, Rosemary A. – Educational and Psychological Measurement, 2013
The many-faceted Rasch (MFR) model has been used to evaluate the quality of ratings on constructed response assessments; however, it can also be used to evaluate the quality of judgments from panel-based standard setting procedures. The current study illustrates the use of the MFR model for examining the quality of ratings obtained from a standard…
Descriptors: Item Response Theory, Models, Standard Setting (Scoring), Science Tests
Kaliski, Pamela; France, Megan; Huff, Kristen; Thurber, Allison – College Board, 2011
Developing a cognitive model of task performance is an important and often overlooked phase in assessment design; failing to establish such a model can threaten the validity of the inferences made from the scores produced by an assessment (e.g., Leighton, 2004). Conducting think aloud interviews (TAIs), where students think aloud while completing…
Descriptors: World History, Advanced Placement Programs, Achievement Tests, Protocol Analysis
Reshetar, Rosemary; Melican, Gerald J. – College Board, 2010
This paper discusses issues related to the design and psychometric work for mixed-format tests --tests containing both multiple-choice (MC) and constructed-response (CR) items. The issues of validity, fairness, reliability and score consistency can be addressed but for mixed-format tests there are many decisions to be made and no examination or…
Descriptors: Psychometrics, Test Construction, Multiple Choice Tests, Test Items
College Board, 2011
This catalog lists research reports, research notes, and other publications available from the College Board's website. The catalog briefly describes research publications available free of charge. Introduced in 1981, the Research Report series includes studies and reviews in areas such as college admission, special populations, subgroup…
Descriptors: Research Reports, Publications, Educational Research, College Students
von Davier, Alina A.; Wilson, Christine – ETS Research Report Series, 2005
This paper discusses the assumptions required by the item response theory (IRT) true-score equating method (with Stocking & Lord, 1983; scaling approach), which is commonly used in the nonequivalent groups with an anchor data-collection design. More precisely, this paper investigates the assumptions made at each step by the IRT approach to…
Descriptors: Item Response Theory, True Scores, Equated Scores, Test Items
von Davier, Alina A., Ed.; Liu, Mei, Ed. – ETS Research Report Series, 2006
This report builds on and extends existent research on population invariance to new tests and issues. The authors lay the foundation for a deeper understanding of the use of population invariance measures in a wide variety of practical contexts. The invariance of linear, equipercentile and IRT equating methods are examined using data from five…
Descriptors: Equated Scores, Statistical Analysis, Data Collection, Test Format
Morgan, Rick; Mazzeo, John – 1988
The dimensional structure of the 1987 Advanced Placement (AP) French language examination was tested in four populations using a series of confirmatory linear factor analysis models. To mitigate problems with the linear factor analysis of multiple choice items, the linear factor analysis of item parcel scores, made of small mutually exclusive…
Descriptors: Advanced Placement Programs, College Students, Comparative Analysis, Error of Measurement
Stricker, Lawrence J. – College Entrance Examination Board, 1998
Steele and Aronson (1995) found that the performance of African-American subjects on test items portrayed as a problem-solving task, in a laboratory experiment, was adversely affected when they were asked about their ethnicity. This outcome was attributed to "stereotype threat". Performance was disrupted by the subjects' concerns about…
Descriptors: Ethnicity, Ethnic Groups, Test Items, Problem Solving

Peer reviewed
Direct link
