ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	3

Descriptor

Interrater Reliability	21
Test Items	21
Standard Setting (Scoring)	18
Cutting Scores	11
Judges	9
Scoring	9
Standards	9
Difficulty Level	8
Licensing Examinations…	6
Criterion Referenced Tests	5
Error of Measurement	4
Evaluators	4
Higher Education	4
Minimum Competency Testing	4
Multiple Choice Tests	4
Test Construction	4
Academic Standards	3
Certification	3
Comparative Analysis	3
Educational Assessment	3
Elementary Secondary Education	3
English	3
Evaluation Methods	3
Item Response Theory	3
Mathematics Tests	3
More ▼

Source

New Mexico Public Education…	2
Applied Measurement in…	1
Online Submission	1

Publication Type

Speeches/Meeting Papers	15
Reports - Research	13
Reports - Evaluative	5
Reports - Descriptive	3
Numerical/Quantitative Data	2
Journal Articles	1

Education Level

Elementary Secondary Education

Audience

Researchers

Location

New Mexico	2
California	1
New Jersey	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

Regression Effects in Angoff Ratings: Examples from Credentialing Exams

Peer reviewed

Direct link

Wyse, Adam E. – Applied Measurement in Education, 2018

This article discusses regression effects that are commonly observed in Angoff ratings where panelists tend to think that hard items are easier than they are and easy items are more difficult than they are in comparison to estimated item difficulties. Analyses of data from two credentialing exams illustrate these regression effects and the…

Descriptors: Regression (Statistics), Test Items, Difficulty Level, Licensing Examinations (Professions)

Assessing the Viability of External Searchable Resources on the American Board of Family Medicine's Certification Examination

Download full text

O'Neill, Thomas R.; Peabody, Michael R.; Stelter, Keith L.; Hagen, Michael D. – Online Submission, 2015

(Purpose) The purpose of our study was to assess the need for an external searchable resource to be used in conjunction with the American Board of Family Medicine's (ABFM) Maintenance of Certification for Family Physicians (MC-FP) Examination, discuss the philosophical question of whether an ESR should be allowed on the examination, and outline…

Descriptors: Licensing Examinations (Professions), Family Practice (Medicine), Physicians, Online Searching

Detecting Intrajudge Inconsistency in Standard Setting Using Test Items with a Selected-Response Format. Research Report.

Download full text

van der Linden, Wim J.; Vos, Hans J.; Chang, Lei – 2000

In judgmental standard setting experiments, it may be difficult to specify subjective probabilities that adequately take the properties of the items into account. As a result, these probabilities are not consistent with each other in the sense that they do not refer to the same borderline level of performance. Methods to check standard setting…

Descriptors: Interrater Reliability, Judges, Probability, Standard Setting

Intrajudge Consistency Using the Angoff Standard-Setting Method.

Download full text

Plake, Barbara S.; Impara, James C. – 1996

This study investigated the intrajudge consistency of Angoff-based item performance estimates. The examination used was a certification examination in an emergency medicine specialty. Ten expert panelists rated the same 24 items twice during an operational standard setting study. Results indicate that the panelists were highly consistent, in terms…

Descriptors: Cutting Scores, Interrater Reliability, Licensing Examinations (Professions), Performance Based Assessment

Establishing Upper Limits for Item Ratings for the Angoff Method: Are Resulting Standards More 'Realistic'?

Reid, Jerry B. – 1985

This report investigates an area of uncertainty in using the Angoff method for setting standards, namely whether or not a judge's conceptualizations of borderline group performance are realistic. Ratings are usually made with reference to the performance of this hypothetical group, therefore the Angoff method's success is dependent on this point.…

Descriptors: Certification, Cutting Scores, Difficulty Level, Interrater Reliability

Does a Standard Reflect Minimal Competency of Examinees or Judge Competency?

Download full text

Chang, Lei; And Others – 1994

The present study examines the influence of judges' item-related knowledge on setting standards for competency tests. Seventeen judges from different professions took a 122-item teacher-certification test in economics while setting competency standards for the test using the Angoff procedure. Judges tended to set higher standards for items they…

Descriptors: Economics, Evaluators, Experience, Interrater Reliability

Judgmental Standard Setting Using a Cognitive Components Model.

Download full text

McGinty, Dixie; Neel, John H. – 1996

A new standard setting approach is introduced, called the cognitive components approach. Like the Angoff method, the cognitive components method generates minimum pass levels (MPLs) for each item. In both approaches, the item MPLs are summed for each judge, then averaged across judges to yield the standard. In the cognitive components approach,…

Descriptors: Cognitive Processes, Criterion Referenced Tests, Evaluation Methods, Grade 3

A Generalizability Study of the Angoff Method Applied to Setting Cutoff Scores of Professional Certification Tests.

Cope, Ronald T. – 1987

This study used generalizability theory and other statistical concepts to assess the application of the Angoff method to setting cutoff scores on two professional certification tests. A panel of ten judges gave pre- and post-feedback Angoff probability ratings of items of two forms of a professional certification test, and another panel of nine…

Descriptors: Certification, Correlation, Cutting Scores, Error of Measurement

An Empirical Comparison of Judgmental Approaches to Standard Setting Procedures.

Download full text

Rock, D. A.; And Others – 1980

An experiment was designed that varied cutting score procedures, instructions, and types of judges in order to address the following questions concerning the Real Estate Licensing Examination: (1) Will the cutting score levels produced by groups of judges from differing backgrounds (academicians vs. practitioners vs. lawyers) using the same method…

Descriptors: Competence, Content Analysis, Criterion Referenced Tests, Cutting Scores

A Comparison of the Paper Selection Method and the Contrasting Groups Method for Setting Standards on Constructed-Response Items.

Download full text

Webb, Melvin W., II; Miller, Eva R. – 1995

As constructed-response items become an integral part of educational assessments, setting student performance standards on constructed-response items has become an important issue. Two standard-setting methods, one used for setting standards on the National Assessment of Educational Progress (NAEP) in reading in grade 8 and the other used to set…

Descriptors: Comparative Analysis, Constructed Response, Criteria, Educational Assessment

Assessing Inconsistencies in Standard Setting with the Angoff or Nedelsky Technique.

Download full text

van der Linden, Wim J. – 1982

A latent trait method is presented to investigate the possibility that Angoff or Nedelsky judges specify inconsistent probabilities in standard setting techniques for objectives-based instructional programs. It is suggested that judges frequently specify a low probability of success for an easy item but a large probability for a hard item. The…

Descriptors: Criterion Referenced Tests, Cutting Scores, Error of Measurement, Interrater Reliability

Interjudge Variability and Intrajudge Consistency Using the Cognitive Components Model for Standard Setting.

Download full text

McGinty, Dixie; Neel, John H.; Hsu, Yu-Sheng – 1996

The cognitive components standard setting method, recently introduced by D. McGinty and J. Neel (1996), asks judges to specify minimum levels of performance not for the test items, but for smaller portions of items, the component skills and concepts required to answer each item correctly. Items are decomposed into these components before judges…

Descriptors: Cognitive Processes, Criterion Referenced Tests, Elementary Education, Evaluation Methods

A Comparison between the Nedelsky and Angoff Standard-Setting Methods.

Download full text

Chang, Lei – 1996

It was hypothesized that, when compared to the Angoff method (W. H. Angoff, 1971), the Nedelsky method (L. Nedelsky, 1954) for standard setting had lower intrajudge inconsistency, lower cutscores, and lower cutscores especially for items presenting challenges to the judges. These hypotheses were tested and supported in a sample of 22 graduate…

Descriptors: Comparative Analysis, Cutting Scores, Difficulty Level, Distractors (Tests)

Construct Validation of Minimum Competence in Standard Setting. Revised.

Download full text

DeMauro, Gerald E. – 1995

Studies of the Angoff method of standard setting suggest that judges agree in their estimates of the relative difficulties of test questions for minimally competent examinees and that each judge's estimates correlate well with the observed item difficulties for examinees whose total test scores are near the judge's personal standard (G. E.…

Descriptors: Ability, Competence, Construct Validity, Difficulty Level

Technical Issues in Performance Assessment: Setting Performance Standards.

Download full text

Hansche, Linda – 1994

Setting standards on performance measures is discussed in the context of the State Collaborative on Assessment and Student Standards (SCASS) initiative supported by the Council of Chief State School Offices. The usual item-based methods for standard setting, the methods developed by Nedelsky (1954), Angoff (1971), and Ebel (1972), were developed…

Descriptors: Decision Making, Educational Assessment, Educational Policy, Elementary Secondary Education

Previous Page | Next Page »

Pages: 1 | 2

Chang, Lei	3
McGinty, Dixie	2
Neel, John H.	2
van der Linden, Wim J.	2
Cope, Ronald T.	1
DeMauro, Gerald E.	1
Garrido, Mariquita	1
Griph, Gerald W.	1
Hagen, Michael D.	1
Hansche, Linda	1
Hsu, Yu-Sheng	1
Impara, James C.	1
Jaeger, Richard M.	1
Miller, Eva R.	1
O'Neill, Thomas R.	1
Payne, David A.	1
Peabody, Michael R.	1
Plake, Barbara S.	1
Reckase, Mark D.	1
Reid, Jerry B.	1
Rock, D. A.	1
Stelter, Keith L.	1
Vos, Hans J.	1
Webb, Melvin W., II	1
More ▼