ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	7

Descriptor

Probability	10
Standard Setting	10
Academic Achievement	4
Cutting Scores	4
College Readiness	3
Item Response Theory	3
Scores	3
Test Items	3
Academic Standards	2
Achievement Rating	2
Benchmarking	2
Career Readiness	2
College Entrance Examinations	2
Correlation	2
Difficulty Level	2
Error of Measurement	2
Evaluation Methods	2
Grades (Scholastic)	2
Item Analysis	2
Mathematics Tests	2
Reading Tests	2
State Standards	2
Success	2
Tables (Data)	2
Algebra	1
More ▼

Source

Journal of Educational…	2
Northwest Evaluation…	2
ACT, Inc.	1
Applied Psychological…	1
College Board	1
Educational Measurement:…	1
Performance Improvement…	1

Author

Allen, Jeff	1
Babcock, Ben	1
Beretvas, S. Natasha	1
Chang, Lei	1
Clauser, Brian E.	1
Clauser, Jerome C.	1
Huynh, Huynh	1
Kane, Michael	1
Kobrin, Jennifer L.	1
Mattern, Krista D.	1
Moore, Joann	1
Munyofu, Paul	1
Patterson, Brian F.	1
Radunzel, Justine	1
Vos, Hans J.	1
Wiley, Andrew	1
Wyse, Adam E.	1
van der Linden, Wim J.	1
More ▼

Publication Type

Journal Articles	5
Reports - Evaluative	4
Numerical/Quantitative Data	3
Reports - Descriptive	3
Reports - Research	3
Tests/Questionnaires	1

Education Level

Higher Education	2
Postsecondary Education	1

Audience

Practitioners

Location

Illinois	1
Kentucky	1
New York	1
North Carolina	1
Pennsylvania	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
California Learning…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 10 results Save | Export

A Method for Detecting Regression of Hard and Easy Item Angoff Ratings

Peer reviewed

Direct link

Wyse, Adam E.; Babcock, Ben – Journal of Educational Measurement, 2019

One common phenomenon in Angoff standard setting is that panelists regress their ratings in toward the middle of the probability scale. This study describes two indices based on taking ratios of standard deviations that can be utilized with a scatterplot of item ratings versus expected probabilities of success to identify whether ratings are…

Descriptors: Item Analysis, Standard Setting, Probability, Feedback (Response)

Examining the Precision of Cut Scores within a Generalizability Theory Framework: A Closer Look at the Item Effect

Peer reviewed

Direct link

Clauser, Brian E.; Kane, Michael; Clauser, Jerome C. – Journal of Educational Measurement, 2020

An Angoff standard setting study generally yields judgments on a number of items by a number of judges (who may or may not be nested in panels). Variability associated with judges (and possibly panels) contributes error to the resulting cut score. The variability associated with items plays a more complicated role. To the extent that the mean item…

Descriptors: Cutting Scores, Generalization, Decision Making, Standard Setting

Smarter Balanced Preliminary Performance Levels: Estimated MAP Scores Corresponding to the Preliminary Performance Levels of the Smarter Balanced Assessment Consortium (Smarter Balanced)

Download full text

Northwest Evaluation Association, 2015

Recently, the Smarter Balanced Assessment Consortium (Smarter Balanced) released a document that established initial performance levels and the associated threshold scale scores for the Smarter Balanced assessment. The report included estimated percentages of students expected to perform at each of the four performance levels, reported by grade…

Descriptors: Standard Setting, Standard Setting (Scoring), Pretesting, Cutting Scores

Evidence for Standard Setting: Probabilities of Success in "Benchmark" College Courses, by ACT Test Scores. Technical Brief

Download full text

Allen, Jeff; Radunzel, Justine; Moore, Joann – ACT, Inc., 2017

The ACT College Readiness Benchmarks are the ACT scores associated with a 50% chance of earning a B or higher grade in selected first-year credit-bearing courses at a typical postsecondary institution. The Benchmarks were established by linking ACT test scores with grades in first-year college courses from the same subject area. Benchmarks were…

Descriptors: Standard Setting, Probability, Success, Benchmarking

A Standard-Setting Study to Establish College Success Criteria to Inform the SAT® College and Career Readiness Benchmark. Research Report 2012-3

Download full text

Kobrin, Jennifer L.; Patterson, Brian F.; Wiley, Andrew; Mattern, Krista D. – College Board, 2012

In 2011, the College Board released its SAT college and career readiness benchmark, which represents the level of academic preparedness associated with a high likelihood of college success and completion. The goal of this study, which was conducted in 2008, was to establish college success criteria to inform the development of the benchmark. The…

Descriptors: College Entrance Examinations, Standard Setting, College Readiness, Career Readiness

North Carolina Linking Study: A Study of the Alignment of the NWEA RIT Scale with the North Carolina State End of Grade (EOG) Testing Program

Download full text

Northwest Evaluation Association, 2014

Recently, the Northwest Evaluation Association (NWEA) completed a study to connect the scale of the North Carolina State End of Grade (EOG) Testing Program used for North Carolina's mathematics and reading assessments with NWEA's Rausch Interval Unit (RIT) scale. Information from the state assessments was used in a study to establish…

Descriptors: Alignment (Education), Testing Programs, Equated Scores, Standard Setting

Setting Meaningful Criterion-Reference Cut Scores as an Effective Professional Development

Direct link

Munyofu, Paul – Performance Improvement Quarterly, 2010

The state of Pennsylvania, like many organizations interested in performance improvement, routinely engages in professional development activities. Educators in this hands-on activity engaged in setting meaningful criterion-referenced cut scores for career and technical education assessments using two methods. The main purposes of this study were…

Descriptors: Standard Setting, Cutting Scores, Professional Development, Vocational Education

Detecting Intrajudge Inconsistency in Standard Setting Using Test Items with a Selected-Response Format. Research Report.

Download full text

van der Linden, Wim J.; Vos, Hans J.; Chang, Lei – 2000

In judgmental standard setting experiments, it may be difficult to specify subjective probabilities that adequately take the properties of the items into account. As a result, these probabilities are not consistent with each other in the sense that they do not refer to the same borderline level of performance. Methods to check standard setting…

Descriptors: Interrater Reliability, Judges, Probability, Standard Setting

A Clarification on the Response Probability Criterion RP67 for Standard Settings Based on Bookmark and Item Mapping

Peer reviewed

Direct link

Huynh, Huynh – Educational Measurement: Issues and Practice, 2006

By analyzing the Fisher information allotted to the correct response of a Rasch binary item, Huynh (1994) established the response probability criterion 0.67 (RP67) for standard settings based on bookmarks and item mapping. The purpose of this note is to help clarify the conceptual and psychometric framework of the RP criterion.

Descriptors: Probability, Standard Setting, Item Response Theory, Psychometrics

Comparison of Bookmark Difficulty Locations Under Different Item Response Models

Peer reviewed

Direct link

Beretvas, S. Natasha – Applied Psychological Measurement, 2004

In the bookmark standard-setting procedure, judges place "bookmarks" in a reordered test booklet containing items presented in order of increasing difficulty. Traditionally, the bookmark difficulty location (BDL) is on the trait continuum where, for dichotomous items, there is a two-thirds probability of a correct response and, for a score of "k"…

Descriptors: Probability, Standard Setting, Item Response Theory, Test Items