ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	0
Since 2007 (last 20 years)	1

Descriptor

Mathematical Models	11
Standard Setting (Scoring)	11
Cutting Scores	8
Evaluators	4
Latent Trait Theory	4
Standards	4
Error of Measurement	3
Foreign Countries	3
Reliability	3
Secondary Education	3
Test Items	3
Comparative Analysis	2
Evaluation Methods	2
Interrater Reliability	2
Mastery Tests	2
Probability	2
Sampling	2
Test Interpretation	2
Testing Problems	2
Academic Standards	1
Achievement Tests	1
Classification	1
Computer Simulation	1
Criterion Referenced Tests	1
Definitions	1
More ▼

Source

Journal of Educational…	4
Educational Measurement:…	1
Research Papers in Education	1

Author

Kane, Michael T.	2
van der Linden, Wim J.	2
Benton, Tom	1
Beuk, Cees H.	1
Elliott, Gill	1
Hester, Yvette	1
Jaeger, Richard M.	1
Krippendorff, Klaus	1
Plake, Barbara S.	1
Van der Linden, Wim J.	1
deGruijter, Dato N. M.	1
More ▼

Publication Type

Journal Articles	6
Reports - Evaluative	5
Reports - Research	5
Speeches/Meeting Papers	4
Opinion Papers	1
Reports - Descriptive	1

Education Level

Secondary Education

Audience

Researchers

Location

Netherlands	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 11 results Save | Export

The Reliability of Setting Grade Boundaries Using Comparative Judgement

Peer reviewed

Direct link

Benton, Tom; Elliott, Gill – Research Papers in Education, 2016

In recent years the use of expert judgement to set and maintain examination standards has been increasingly criticised in favour of approaches based on statistical modelling. This paper reviews existing research on this controversy and attempts to unify the evidence within a framework where expertise is utilised in the form of comparative…

Descriptors: Reliability, Expertise, Mathematical Models, Standard Setting (Scoring)

Recent Developments in Reliability Analysis.

Krippendorff, Klaus – 1992

When one wants to set data reliability standards for a class of scientific inquiries or when one needs to compare and select among many different kinds of data with reliabilities that are crucial to a particular research undertaking, then one needs a single reliability coefficient that is adaptable to all or most situations. Work toward this goal…

Descriptors: Definitions, Equations (Mathematics), Mathematical Models, Reliability

A Review of Strategies for Standard Setting and Identifying Cutoff Scores.

Download full text

Hester, Yvette – 1993

Some of the different approaches to standard setting are discussed. Brief comments and references are offered concerning strategies that rely primarily on the use of expert judgment. Controversy surrounds methods that use expert judges, as well as those using test groups to set scores empirically. A minimax procedure developed by H. Huynh, an…

Descriptors: Academic Standards, Classification, Cutting Scores, Evaluation Methods

On the Use of IRT Models with Judgmental Standard Setting Procedures.

Peer reviewed

Kane, Michael T. – Journal of Educational Measurement, 1987

The use of item response theory models for analyzing the results of judgmental standard setting studies (the Angoff technique) for establishing minimum pass levels is discussed. A comparison of three methods indicates the traditional approach may not be best. A procedure based on generalizability theory is suggested. (GDC)

Descriptors: Comparative Analysis, Cutting Scores, Generalizability Theory, Latent Trait Theory

A Latent Trait Method for Determining Inconsistencies in the Use of the Angoff and Nedelsky Techniques of Standard Setting. Twente Educational Report Number 12.

Download full text

van der Linden, Wim J. – 1981

It has often been argued that all techniques of standard setting are arbitrary and likely to yield different results for different techniques or persons. This paper deals with a related but hitherto ignored aspect of standard setting, namely, the possibility that Angoff or Nedelsky judges misspecify the probabilities of the borderline student's…

Descriptors: Error of Measurement, Evaluators, Foreign Countries, Latent Trait Theory

A Method for Reaching a Compromise between Absolute and Relative Standards in Examinations.

Peer reviewed

Beuk, Cees H. – Journal of Educational Measurement, 1984

A systematic method for compromise between absolute and relative examination standards is proposed. The passing score is assumed to be related to expected pass rate through a simple linear function. Results define a function relating the percentage of successful candidates given a specified passing score to the passing score. (Author/DWH)

Descriptors: Achievement Tests, Cutting Scores, Foreign Countries, Mathematical Models

A Latent Trait Method for Determining Intrajudge Inconsistency in the Angoff and Nedelsky Techniques of Standard Setting.

Peer reviewed

Van der Linden, Wim J. – Journal of Educational Measurement, 1982

An ignored aspect of standard setting, namely the possibility that Angoff or Nedelsky judges specify inconsistent probabilities (e.g., low probabilities for easy items but large probabilities for hard items) is explored. A latent trait method is proposed to estimate such misspecifications, and an index of consistency is defined. (Author/PN)

Descriptors: Cutting Scores, Latent Trait Theory, Mastery Tests, Mathematical Models

Accounting for the Uncertainty in Performance Standards.

Download full text

deGruijter, Dato N. M. – 1980

The setting of standards involves subjective value judgments. The inherent arbitrariness of specific standards has been severely criticized by Glass. His antagonists agree that standard setting is a judgmental task but they have pointed out that arbitrariness in the positive sense of serious judgmental decisions is unavoidable. Further, small…

Descriptors: Cutting Scores, Difficulty Level, Error of Measurement, Mastery Tests

Comparison of Methods for Combining the Minimum Passing Levels for Individual Items into a Passing Score for a Test.

Peer reviewed

Plake, Barbara S.; Kane, Michael T. – Journal of Educational Measurement, 1991

Several methods for determining a passing score on an examination from individual raters' estimates of minimal pass levels were compared through simulation. The methods used differed in the weighting estimates for each item received in the aggregation process. Reasons why the simplest procedure is most preferred are discussed. (SLD)

Descriptors: Comparative Analysis, Computer Simulation, Cutting Scores, Estimation (Mathematics)

Selection of Judges for Standard-Setting.

Peer reviewed

Jaeger, Richard M. – Educational Measurement: Issues and Practice, 1991

Issues concerning the selection of judges for standard setting are discussed. Determining the consistency of judges' recommendations, or their congruity with other expert recommendations, would help in selection. Enough judges must be chosen to allow estimation of recommendations by an entire population of judges. (SLD)

Descriptors: Cutting Scores, Evaluation Methods, Evaluators, Examiners

Assessing Inconsistencies in Standard Setting with the Angoff or Nedelsky Technique.

Download full text

van der Linden, Wim J. – 1982

A latent trait method is presented to investigate the possibility that Angoff or Nedelsky judges specify inconsistent probabilities in standard setting techniques for objectives-based instructional programs. It is suggested that judges frequently specify a low probability of success for an easy item but a large probability for a hard item. The…

Descriptors: Criterion Referenced Tests, Cutting Scores, Error of Measurement, Interrater Reliability