ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	19

Descriptor

Probability	27
Standard Setting (Scoring)	27
Cutting Scores	13
Test Items	10
Item Response Theory	6
Scoring	6
Judges	5
Mathematics Tests	5
Reading Tests	5
Difficulty Level	4
Error of Measurement	4
Scores	4
Comparative Analysis	3
Criterion Referenced Tests	3
Educational Assessment	3
Foreign Countries	3
Generalizability Theory	3
Interrater Reliability	3
Multiple Choice Tests	3
Ability	2
Achievement Rating	2
Alignment (Education)	2
Benchmarking	2
Career Readiness	2
College Readiness	2
More ▼

Source

Educational and Psychological…	6
Applied Measurement in…	4
Educational Measurement:…	2
Northwest Evaluation…	2
Advances in Health Sciences…	1
Assessment & Evaluation in…	1
Educational Sciences: Theory…	1
Higher Education Studies	1
International Journal of…	1
Measurement:…	1
Practical Assessment,…	1
ProQuest LLC	1
More ▼

Publication Type

Journal Articles	19
Reports - Research	17
Reports - Evaluative	7
Speeches/Meeting Papers	3
Numerical/Quantitative Data	2
Collected Works - Serials	1
Dissertations/Theses -…	1
Reports - Descriptive	1

Education Level

Elementary Education	3
Higher Education	3
Postsecondary Education	3
Grade 8	2
Grade 11	1
Grade 5	1
Grade 7	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

Australia	1
Illinois	1
Kentucky	1
Michigan	1
Minnesota	1
New York	1
United Kingdom	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

National Assessment of…	1
United States Medical…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 27 results Save | Export

The Choice of Response Probability in Bookmark Standard Setting: An Experimental Study

Peer reviewed

Direct link

Baldwin, Peter; Margolis, Melissa J.; Clauser, Brian E.; Mee, Janet; Winward, Marcia – Educational Measurement: Issues and Practice, 2020

Evidence of the internal consistency of standard-setting judgments is a critical part of the validity argument for tests used to make classification decisions. The bookmark standard-setting procedure is a popular approach to establishing performance standards, but there is relatively little research that reflects on the internal consistency of the…

Descriptors: Standard Setting (Scoring), Probability, Cutting Scores, Evaluation Methods

Equating Angoff Standard-Setting Ratings with the Rasch Model

Peer reviewed

Direct link

Wyse, Adam E. – Measurement: Interdisciplinary Research and Perspectives, 2018

A key part of determining cut-scores when performing Angoff standard setting is utilizing equating methods to place standard-setting ratings onto the scale used to report scores to examinees. This article describes three equating methods that can be employed to place Angoff ratings onto the scale used to report scores to examinees when applying…

Descriptors: Standard Setting (Scoring), Equated Scores, Probability, Regression (Statistics)

Relative Diagnostic Profile: A Subscore Reporting Framework

Peer reviewed

Direct link

Liu, Ren; Qian, Hong; Luo, Xiao; Woo, Ada – Educational and Psychological Measurement, 2018

Subscore reporting under item response theory models has always been a challenge partly because the test length of each subdomain is limited for precisely locating individuals on multiple continua. Diagnostic classification models (DCMs), providing a pass/fail decision and associated probability of pass on each subdomain, are promising…

Descriptors: Classification, Probability, Pass Fail Grading, Scores

The Objective Borderline Method: A Probabilistic Method for Standard Setting

Peer reviewed

Direct link

Shulruf, Boaz; Poole, Phillippa; Jones, Philip; Wilkinson, Tim – Assessment & Evaluation in Higher Education, 2015

A new probability-based standard setting technique, the Objective Borderline Method (OBM), was introduced recently. This was based on a mathematical model of how test scores relate to student ability. The present study refined the model and tested it using 2500 simulated data-sets. The OBM was feasible to use. On average, the OBM performed well…

Descriptors: Probability, Methods, Standard Setting (Scoring), Scores

Smarter Balanced Preliminary Performance Levels: Estimated MAP Scores Corresponding to the Preliminary Performance Levels of the Smarter Balanced Assessment Consortium (Smarter Balanced)

Download full text

Northwest Evaluation Association, 2015

Recently, the Smarter Balanced Assessment Consortium (Smarter Balanced) released a document that established initial performance levels and the associated threshold scale scores for the Smarter Balanced assessment. The report included estimated percentages of students expected to perform at each of the four performance levels, reported by grade…

Descriptors: Standard Setting, Standard Setting (Scoring), Pretesting, Cutting Scores

Getting Lucky: How Guessing Threatens the Validity of Performance Classifications

Peer reviewed
PDF on ERIC

Download full text

Foley, Brett P. – Practical Assessment, Research & Evaluation, 2016

There is always a chance that examinees will answer multiple choice (MC) items correctly by guessing. Design choices in some modern exams have created situations where guessing at random through the full exam--rather than only for a subset of items where the examinee does not know the answer--can be an effective strategy to pass the exam. This…

Descriptors: Guessing (Tests), Multiple Choice Tests, Case Studies, Test Construction

The Issue of Range Restriction in Bookmark Standard Setting

Peer reviewed

Direct link

Wyse, Adam E. – Educational Measurement: Issues and Practice, 2015

This article uses data from a large-scale assessment program to illustrate the potential issue of range restriction with the Bookmark method in the context of trying to set cut scores to closely align with a set of college and career readiness benchmarks. Analyses indicated that range restriction issues existed across different response…

Descriptors: Cutting Scores, Alignment (Education), College Readiness, Career Readiness

Increasing the Validity of Angoff Standards through Analysis of Judge-Level Internal Consistency

Peer reviewed

Direct link

Clauser, Jerome C.; Clauser, Brian E.; Hambleton, Ronald K. – Applied Measurement in Education, 2014

The purpose of the present study was to extend past work with the Angoff method for setting standards by examining judgments at the judge level rather than the panel level. The focus was on investigating the relationship between observed Angoff standard setting judgments and empirical conditional probabilities. This relationship has been used as a…

Descriptors: Standard Setting (Scoring), Validity, Reliability, Correlation

Evaluating the Consistency of Angoff-Based Cut Scores Using Subsets of Items within a Generalizability Theory Framework

Peer reviewed

Direct link

Kannan, Priya; Sgammato, Adrienne; Tannenbaum, Richard J.; Katz, Irvin R. – Applied Measurement in Education, 2015

The Angoff method requires experts to view every item on the test and make a probability judgment. This can be time consuming when there are large numbers of items on the test. In this study, a G-theory framework was used to determine if a subset of items can be used to make generalizable cut-score recommendations. Angoff ratings (i.e.,…

Descriptors: Reliability, Standard Setting (Scoring), Cutting Scores, Test Items

Using Student Ability and Item Difficulty for Making Defensible Pass/Fail Decisions for Borderline Grades

Peer reviewed
PDF on ERIC

Download full text

Shulruf, Boaz; Jones, Phil; Turner, Rolf – Higher Education Studies, 2015

The determination of Pass/Fail decisions over Borderline grades, (i.e., grades which do not clearly distinguish between the competent and incompetent examinees) has been an ongoing challenge for academic institutions. This study utilises the Objective Borderline Method (OBM) to determine examinee ability and item difficulty, and from that…

Descriptors: Undergraduate Students, Pass Fail Grading, Decision Making, Probability

The Objective Borderline Method (OBM): A Probability-Based Model for Setting up an Objective Pass/Fail Cut-Off Score in Medical Programme Assessments

Peer reviewed

Direct link

Shulruf, Boaz; Turner, Rolf; Poole, Phillippa; Wilkinson, Tim – Advances in Health Sciences Education, 2013

The decision to pass or fail a medical student is a "high stakes" one. The aim of this study is to introduce and demonstrate the feasibility and practicality of a new objective standard-setting method for determining the pass/fail cut-off score from borderline grades. Three methods for setting up pass/fail cut-off scores were compared: the…

Descriptors: Standard Setting (Scoring), Probability, Medical Schools, Medical Students

Requiring a Consistent Unit of Scale between the Responses of Students and Judges in Standard Setting

Peer reviewed

Direct link

Humphry, Stephen; Heldsinger, Sandra; Andrich, David – Applied Measurement in Education, 2014

One of the best-known methods for setting a benchmark standard on a test is that of Angoff and its modifications. When scored dichotomously, judges estimate the probability that a benchmark student has of answering each item correctly. As in most methods of standard setting, it is assumed implicitly that the unit of the latent scale of the…

Descriptors: Foreign Countries, Standard Setting (Scoring), Judges, Item Response Theory

A Comparison of Bookmark and Angoff Standard Setting Methods

Peer reviewed
PDF on ERIC

Download full text

Çetin, Sevda; Gelbal, Selahattin – Educational Sciences: Theory and Practice, 2013

In this research, the cut score of a foundation university was re-calculated with bookmark method and with Angoff method, each of which is a standard setting method; and the cut scores found were compared with the current proficiency score. Thus, the final cut score was found to be 27.87 with the cooperative work of 17 experts through the Angoff…

Descriptors: Standard Setting (Scoring), Comparative Analysis, Cutting Scores, Correlation

Minnesota Linking Study: A Study of the Alignment of the NWEA RIT Scale with the Minnesota Comprehensive Assessments (MCA) Testing Program

Download full text

Northwest Evaluation Association, 2014

Recently, Northwest Evaluation Association (NWEA) completed a study to connect the scale of the Minnesota Comprehensive Assessments (MCA) Testing Program used for Minnesota's mathematics and reading assessments with NWEA's RIT (Rasch Unit) scale. Information from the state assessments was used in a study to establish performance-level scores on…

Descriptors: Alignment (Education), Testing Programs, State Programs, Mathematics Tests

Re-Conceptualization of Modified Angoff Standard Setting: Unified Statistical, Measurement, Cognitive, and Social Psychological Theories

Direct link

Iyioke, Ifeoma Chika – ProQuest LLC, 2013

This dissertation describes a design for training, in accordance with probability judgment heuristics principles, for the Angoff standard setting method. The new training with instruction, practice, and feedback tailored to the probability judgment heuristics principles was called the Heuristic training and the prevailing Angoff method training…

Descriptors: Standard Setting (Scoring), Probability, Heuristics, Training

Previous Page | Next Page »

Pages: 1 | 2

Plake, Barbara S.	4
Clauser, Brian E.	3
Ferdous, Abdullah A.	3
Shulruf, Boaz	3
Wyse, Adam E.	3
Margolis, Melissa J.	2
Poole, Phillippa	2
Turner, Rolf	2
Wilkinson, Tim	2
van der Linden, Wim J.	2
Andrich, David	1
Baldwin, Peter	1
Chis, Liliana	1
Clauser, Jerome C.	1
Davey, Tim	1
Fehrmann, Melinda L.	1
Fisher, William P., Jr., Ed.	1
Foley, Brett P.	1
Gelbal, Selahattin	1
Giraud, Gerald	1
Hambleton, Ronald K.	1
Harik, Polina	1
Heldsinger, Sandra	1
Hertzog, Melody	1
More ▼