ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	7

Descriptor

Correlation	10
Standard Setting (Scoring)	10
Cutting Scores	5
Test Items	5
Difficulty Level	4
Interrater Reliability	4
Licensing Examinations…	4
Judges	3
Validity	3
Accuracy	2
Comparative Analysis	2
Error of Measurement	2
Language Tests	2
Probability	2
Rating Scales	2
Reliability	2
Achievement Rating	1
Certification	1
Cloze Procedure	1
College Faculty	1
College Students	1
Credentials	1
Data	1
Dictation	1
Differences	1
More ▼

Source

Applied Measurement in…	2
Assessment & Evaluation in…	1
Educational Sciences: Theory…	1
International Journal of…	1
Online Submission	1
Practical Assessment,…	1
Research Matters	1
System	1

Publication Type

Reports - Research	9
Journal Articles	8
Reports - Descriptive	1
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Higher Education	2
Postsecondary Education	2

Audience

Researchers

Location

United Kingdom

Laws, Policies, & Programs

Assessments and Surveys

United States Medical…

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Comparing Small-Sample Equating with Angoff Judgement for Linking Cut-Scores on Two Tests

Download full text

Bramley, Tom – Research Matters, 2020

The aim of this study was to compare, by simulation, the accuracy of mapping a cut-score from one test to another by expert judgement (using the Angoff method) versus the accuracy with a small-sample equating method (chained linear equating). As expected, the standard-setting method resulted in more accurate equating when we assumed a higher level…

Descriptors: Cutting Scores, Standard Setting (Scoring), Equated Scores, Accuracy

Rounding in Angoff Ratings

Peer reviewed
PDF on ERIC

Download full text

Wyse, Adam E. – Practical Assessment, Research & Evaluation, 2018

One common modification to the Angoff standard-setting method is to have panelists round their ratings to the nearest 0.05 or 0.10 instead of 0.01. Several reasons have been offered as to why it may make sense to have panelists round their ratings to the nearest 0.05 or 0.10. In this article, we examine one reason that has been suggested, which is…

Descriptors: Interrater Reliability, Evaluation Criteria, Scoring Formulas, Achievement Rating

Increasing the Validity of Angoff Standards through Analysis of Judge-Level Internal Consistency

Peer reviewed

Direct link

Clauser, Jerome C.; Clauser, Brian E.; Hambleton, Ronald K. – Applied Measurement in Education, 2014

The purpose of the present study was to extend past work with the Angoff method for setting standards by examining judgments at the judge level rather than the panel level. The focus was on investigating the relationship between observed Angoff standard setting judgments and empirical conditional probabilities. This relationship has been used as a…

Descriptors: Standard Setting (Scoring), Validity, Reliability, Correlation

Assessing the Viability of External Searchable Resources on the American Board of Family Medicine's Certification Examination

Download full text

O'Neill, Thomas R.; Peabody, Michael R.; Stelter, Keith L.; Hagen, Michael D. – Online Submission, 2015

(Purpose) The purpose of our study was to assess the need for an external searchable resource to be used in conjunction with the American Board of Family Medicine's (ABFM) Maintenance of Certification for Family Physicians (MC-FP) Examination, discuss the philosophical question of whether an ESR should be allowed on the examination, and outline…

Descriptors: Licensing Examinations (Professions), Family Practice (Medicine), Physicians, Online Searching

The Effect of Data Format on Integration of Performance Data into Angoff Judgments

Peer reviewed

Direct link

Clauser, Brian E.; Mee, Janet; Margolis, Melissa J. – International Journal of Testing, 2013

This study investigated the extent to which the performance data format impacted data use in Angoff standard setting exercises. Judges from two standard settings (a total of five panels) were randomly assigned to one of two groups. The full-data group received two types of data: (1) the proportion of examinees selecting each option and (2) plots…

Descriptors: Standard Setting (Scoring), Cutting Scores, Validity, Reliability

A Comparison of Bookmark and Angoff Standard Setting Methods

Peer reviewed
PDF on ERIC

Download full text

Çetin, Sevda; Gelbal, Selahattin – Educational Sciences: Theory and Practice, 2013

In this research, the cut score of a foundation university was re-calculated with bookmark method and with Angoff method, each of which is a standard setting method; and the cut scores found were compared with the current proficiency score. Thus, the final cut score was found to be 27.87 with the cooperative work of 17 experts through the Angoff…

Descriptors: Standard Setting (Scoring), Comparative Analysis, Cutting Scores, Correlation

Psychometric Characteristics of Integrated Multi-Specialty Examinations: Ebel Ratings and Unidimensionality

Peer reviewed

Direct link

Homer, Matt; Darling, Jonathan; Pell, Godfrey – Assessment & Evaluation in Higher Education, 2012

Over recent years, UK medical schools have moved to more integrated summative examinations. This paper analyses data from the written assessment of undergraduate medical students to investigate two key psychometric aspects of this type of high-stakes assessment. Firstly, the strength of the relationship between examiner predictions of item…

Descriptors: Foreign Countries, Medical Schools, Summative Evaluation, High Stakes Tests

Proficiency Standards and Cut-Scores for Language Proficiency Tests.

Peer reviewed

Moy, Raymond H. – System, 1984

Discusses the problems associated with "grading on a curve," the approach often used for standard setting on language proficiency tests. Proposes four main steps presented in the setting of a non-arbitrary cut-score. These steps not only establish a proficiency standard checked by external criteria, but also check to see that the test covers the…

Descriptors: Cloze Procedure, Correlation, Dictation, English (Second Language)

A Generalizability Study of the Angoff Method Applied to Setting Cutoff Scores of Professional Certification Tests.

Cope, Ronald T. – 1987

This study used generalizability theory and other statistical concepts to assess the application of the Angoff method to setting cutoff scores on two professional certification tests. A panel of ten judges gave pre- and post-feedback Angoff probability ratings of items of two forms of a professional certification test, and another panel of nine…

Descriptors: Certification, Correlation, Cutting Scores, Error of Measurement

Cross-State Comparability of Judgments of Student Writing: Results from the New Standards Project.

Peer reviewed

Linn, Robert L.; And Others – Applied Measurement in Education, 1992

Ten states participated in a cross-state scoring workshop in 1991, evaluating writing from elementary school, middle school, and high school students. Correlation of scores assigned by readers from one state with those from readers from another state were generally quite high. Implications for defining common standards are discussed. (SLD)

Descriptors: Comparative Analysis, Correlation, Elementary School Students, Elementary Secondary Education

Clauser, Brian E.	2
Bramley, Tom	1
Clauser, Jerome C.	1
Cope, Ronald T.	1
Darling, Jonathan	1
Gelbal, Selahattin	1
Hagen, Michael D.	1
Hambleton, Ronald K.	1
Homer, Matt	1
Linn, Robert L.	1
Margolis, Melissa J.	1
Mee, Janet	1
Moy, Raymond H.	1
O'Neill, Thomas R.	1
Peabody, Michael R.	1
Pell, Godfrey	1
Stelter, Keith L.	1
Wyse, Adam E.	1
Çetin, Sevda	1
More ▼