ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	4
Since 2007 (last 20 years)	9

Descriptor

Test Items	14
Validity	14
Cutting Scores	9
Standard Setting (Scoring)	8
Standard Setting	6
Academic Standards	4
Item Response Theory	4
Difficulty Level	3
Foreign Countries	3
Judges	3
Reliability	3
Accuracy	2
Achievement Tests	2
Advanced Placement Programs	2
Decision Making	2
English (Second Language)	2
Estimation (Mathematics)	2
Evaluation Methods	2
Expertise	2
Faculty Development	2
Gender Differences	2
Guidelines	2
Item Analysis	2
Language Tests	2
Licensing Examinations…	2
More ▼

Source

Educational Measurement:…	4
Applied Measurement in…	1
College Board	1
Educational and Psychological…	1
Journal of Applied Testing…	1
Language Assessment Quarterly	1
Maryland State Department of…	1

Publication Type

Journal Articles	8
Reports - Research	8
Speeches/Meeting Papers	4
Reports - Descriptive	3
Reports - Evaluative	2
Non-Print Media	1
Numerical/Quantitative Data	1
Reference Materials - General	1

Education Level

Higher Education	2
Postsecondary Education	2
Secondary Education	2
Early Childhood Education	1
High Schools	1
Kindergarten	1
Preschool Education	1
Primary Education	1

Audience

Researchers

Location

European Union	1
Germany	1
Maryland	1
Netherlands	1
United Kingdom (London)	1

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	2
National Assessment of…	1

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Setting and Validating Multiple Standards on a Multistage-Adaptive Test

Peer reviewed

Direct link

Lewis, Jennifer; Lim, Hwanggyu; Padellaro, Frank; Sireci, Stephen G.; Zenisky, April L. – Educational Measurement: Issues and Practice, 2022

Setting cut scores on (MSTs) is difficult, particularly when the test spans several grade levels, and the selection of items from MST panels must reflect the operational test specifications. In this study, we describe, illustrate, and evaluate three methods for mapping panelists' Angoff ratings into cut scores on the scale underlying an MST. The…

Descriptors: Cutting Scores, Adaptive Testing, Test Items, Item Analysis

Bridging the Standard Setting Gap via Assessment Engineering

Peer reviewed

Direct link

Furter, Robert T. – Journal of Applied Testing Technology, 2019

Standard setting is the process of identifying the point(s) on a scale that serve to differentiate between individuals of distinct proficiency levels. While standard setting is ultimately a policy decision, most of the process is carried out by subject matter experts who are tasked with reconciling item-level or examinee-level information (e.g.…

Descriptors: Standard Setting, Cutting Scores, Decision Making, Test Construction

Using an Innovative Standard-Setting Approach to Align Integrated and Independent Writing Tasks to the CEFR

Peer reviewed

Direct link

Harsch, Claudia; Kanistra, Voula Paraskevi – Language Assessment Quarterly, 2020

We report on a standard-setting project in which the Item-Descriptor-Matching Method (IDM) and a complementary benchmarking approach were employed to align a suite of English language proficiency exams to the "Common European Framework of Reference" (CEFR), with a particular focus on the integrated and independent writing exams. Judges'…

Descriptors: Standard Setting, Guidelines, Rating Scales, Definitions

Effect of Content Knowledge on Angoff-Style Standard Setting Judgments

Peer reviewed

Direct link

Margolis, Melissa J.; Mee, Janet; Clauser, Brian E.; Winward, Marcia; Clauser, Jerome C. – Educational Measurement: Issues and Practice, 2016

Evidence to support the credibility of standard setting procedures is a critical part of the validity argument for decisions made based on tests that are used for classification. One area in which there has been limited empirical study is the impact of standard setting judge selection on the resulting cut score. One important issue related to…

Descriptors: Academic Standards, Standard Setting (Scoring), Cutting Scores, Credibility

The Effect of Small Group Discussion on Cutoff Scores during Standard Setting

Peer reviewed

Direct link

Deunk, Marjolein I.; van Kuijk, Mechteld F.; Bosker, Roel J. – Applied Measurement in Education, 2014

Standard setting methods, like the Bookmark procedure, are used to assist education experts in formulating performance standards. Small group discussion is meant to help these experts in setting more reliable and valid cutoff scores. This study is an analysis of 15 small group discussions during two standards setting trajectories and their effect…

Descriptors: Cutting Scores, Standard Setting, Group Discussion, Reading Tests

Using the Many-Facet Rasch Model to Evaluate Standard-Setting Judgments: Setting Performance Standards for Advanced Placement® Examinations

Download full text

Kaliski, Pamela; Wind, Stefanie A.; Engelhard, George, Jr.; Morgan, Deanna; Plake, Barbara; Reshetar, Rosemary – College Board, 2012

The Many-Facet Rasch (MFR) Model is traditionally used to evaluate the quality of ratings on constructed response assessments; however, it can also be used to evaluate the quality of judgments from panel-based standard setting procedures. The current study illustrates the use of the MFR Model by examining the quality of ratings obtained from a…

Descriptors: Advanced Placement Programs, Achievement Tests, Item Response Theory, Models

Setting Standards for English Foreign Language Assessment: Methodology, Validation, and a Degree of Arbitrariness

Peer reviewed

Direct link

Tiffin-Richards, Simon P.; Pant, Hans Anand; Koller, Olaf – Educational Measurement: Issues and Practice, 2013

Cut-scores were set by expert judges on assessments of reading and listening comprehension of English as a foreign language (EFL), using the bookmark standard-setting method to differentiate proficiency levels defined by the Common European Framework of Reference (CEFR). Assessments contained stratified item samples drawn from extensive item…

Descriptors: Foreign Countries, English (Second Language), Language Tests, Standard Setting (Scoring)

Using the Many-Faceted Rasch Model to Evaluate Standard Setting Judgments: An Illustration with the Advanced Placement Environmental Science Exam

Peer reviewed

Direct link

Kaliski, Pamela K.; Wind, Stefanie A.; Engelhard, George, Jr.; Morgan, Deanna L.; Plake, Barbara S.; Reshetar, Rosemary A. – Educational and Psychological Measurement, 2013

The many-faceted Rasch (MFR) model has been used to evaluate the quality of ratings on constructed response assessments; however, it can also be used to evaluate the quality of judgments from panel-based standard setting procedures. The current study illustrates the use of the MFR model for examining the quality of ratings obtained from a standard…

Descriptors: Item Response Theory, Models, Standard Setting (Scoring), Science Tests

Ready for Kindergarten: Maryland's Early Childhood Comprehensive Assessment System. The 2017-2018 Kindergarten Readiness Assessment Technical Report

Download full text

Maryland State Department of Education, 2018

Based on Maryland's 2017-2018 Kindergarten Readiness Assessment (KRA) results, nearly half of all entering kindergarten children show foundational skills indicating they are fully ready for kindergarten, more than a third are approaching readiness, and 18% have emerging readiness skills. Results for the 2017-2018 school year show a slight increase…

Descriptors: Kindergarten, School Readiness, Academic Standards, Gender Differences

The Bookmark Standard-Setting Method: A Literature Review

Peer reviewed

Direct link

Karantonis, Ana; Sireci, Stephen G. – Educational Measurement: Issues and Practice, 2006

The Bookmark method for setting standards on educational tests is currently one of the most popular standard-setting methods. However, research to support the method is scarce. In this report, we review the published and unpublished literature on this method as well as some seminal work in the area of evaluating standard-setting studies. Our…

Descriptors: Academic Standards, Educational Testing, Literature Reviews, Validity

Validity of Item Performance Estimates from an Angoff Standard Setting Study.

Download full text

Irwin, Patrick M.; Plake, Barbara S.; Impara, James C. – 2000

Judgmental standard setting methods, such as the W. H. Angoff (1971) method, use item performance estimates as the basis for determining the minimum passing score (MPS). Therefore, the accuracy of these item performance estimates is crucial to the validity of the resulting MPS. Recent researchers, (L. A. Shephard 1994; J. Impara, 1997) have called…

Descriptors: Estimation (Mathematics), Judges, Licensing Examinations (Professions), Performance Factors

Validation of Angoff-based Predictions of Item Performance.

Download full text

Plake, Barbara S.; Impara, James C.; Irwin, Patrick – 1999

Judgmental standard setting methods, such as the Angoff method (W. Angoff, 1971), use item performance estimates as the basis for determining the minimum passing score (MPS). Therefore the accuracy of these item performance estimates is crucial to the validity of the resulting MPS. Recent researchers (L. Shepard, 1994; J. Impara, 1997) have called…

Descriptors: Cutting Scores, Estimation (Mathematics), Judges, Performance Factors

Analysis of Item Ratings for Ensuring the Procedural Validity of the 1998 NAEP Achievement-Levels Setting.

Download full text

Yang, Wen-Ling – 2000

The Achievement-Levels Setting (ALS) process for the National Assessment of Educational Progress (NAEP) resulted in numerical cutscores on the NAEP score scale representing the performance standards for three achievement levels: Basic, Proficient, and Advanced. This paper focuses on an important, but less researched, aspect of the standard setting…

Descriptors: Academic Achievement, Academic Standards, Civics, Evaluation Methods

A Generalizability Study of the Angoff Method Applied to Setting Cutoff Scores of Professional Certification Tests.

Cope, Ronald T. – 1987

This study used generalizability theory and other statistical concepts to assess the application of the Angoff method to setting cutoff scores on two professional certification tests. A panel of ten judges gave pre- and post-feedback Angoff probability ratings of items of two forms of a professional certification test, and another panel of nine…

Descriptors: Certification, Correlation, Cutting Scores, Error of Measurement

Plake, Barbara S.	3
Engelhard, George, Jr.	2
Impara, James C.	2
Sireci, Stephen G.	2
Wind, Stefanie A.	2
Bosker, Roel J.	1
Clauser, Brian E.	1
Clauser, Jerome C.	1
Cope, Ronald T.	1
Deunk, Marjolein I.	1
Furter, Robert T.	1
Harsch, Claudia	1
Irwin, Patrick	1
Irwin, Patrick M.	1
Kaliski, Pamela	1
Kaliski, Pamela K.	1
Kanistra, Voula Paraskevi	1
Karantonis, Ana	1
Koller, Olaf	1
Lewis, Jennifer	1
Lim, Hwanggyu	1
Margolis, Melissa J.	1
Mee, Janet	1
Morgan, Deanna	1
Morgan, Deanna L.	1
More ▼