Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 4 |
| Since 2007 (last 20 years) | 7 |
Descriptor
Source
Author
| Linn, Robert L. | 2 |
| Alderson, J. Charles | 1 |
| Babcock, Ben | 1 |
| Baldwin, Su G. | 1 |
| Beuk, Cees H. | 1 |
| Bourque, Mary Lyn | 1 |
| Cahan, Sorel | 1 |
| Caines, Jade | 1 |
| Clauser, Brian E. | 1 |
| Cohen, Nora | 1 |
| Dillon, Gerard F. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 23 |
| Reports - Evaluative | 11 |
| Reports - Research | 8 |
| Opinion Papers | 3 |
| Speeches/Meeting Papers | 3 |
| Reports - Descriptive | 2 |
| Information Analyses | 1 |
| Reports - General | 1 |
Education Level
| Adult Education | 1 |
| Elementary Secondary Education | 1 |
| High Schools | 1 |
| Higher Education | 1 |
| Postsecondary Education | 1 |
| Secondary Education | 1 |
Audience
| Researchers | 1 |
Location
| United Kingdom | 2 |
| Canada | 1 |
| Japan | 1 |
| Netherlands | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| National Assessment of… | 2 |
| National Teacher Examinations | 1 |
| Test of English as a Foreign… | 1 |
| Test of English for… | 1 |
What Works Clearinghouse Rating
Skaggs, Gary; Hein, Serge F.; Wilkins, Jesse L. M. – Educational Measurement: Issues and Practice, 2020
In test-centered standard-setting methods, borderline performance can be represented by many different profiles of strengths and weaknesses. As a result, asking panelists to estimate item or test performance for a hypothetical group study of borderline examinees, or a typical borderline examinee, may be an extremely difficult task and one that can…
Descriptors: Standard Setting (Scoring), Cutting Scores, Testing Problems, Profiles
Leventhal, Brian C.; Grabovsky, Irina – Educational Measurement: Issues and Practice, 2020
Standard setting is arguably one of the most subjective techniques in test development and psychometrics. The decisions when scores are compared to standards, however, are arguably the most consequential outcomes of testing. Providing licensure to practice in a profession has high stake consequences for the public. Denying graduation or forcing…
Descriptors: Standard Setting (Scoring), Weighted Scores, Test Construction, Psychometrics
Wyse, Adam E.; Babcock, Ben – Educational Measurement: Issues and Practice, 2020
A common belief is that the Bookmark method is a cognitively simpler standard-setting method than the modified Angoff method. However, a limited amount of research has investigated panelist's ability to perform well the Bookmark method, and whether some of the challenges panelists face with the Angoff method may also be present in the Bookmark…
Descriptors: Standard Setting (Scoring), Evaluation Methods, Testing Problems, Test Items
Saito, Hidetoshi; Sawaki, Yasuyo; Kasahara, Kiwamu – Language Assessment Quarterly, 2022
The study evaluated a recently postponed national test policy on the use of external agencies' English tests measuring four skills for Japanese university admission purposes. Using Kunnan's principles of fairness and justice, we generated and evaluated three claims regarding the test policy: vocabulary coverage of the tests, the justifiability of…
Descriptors: Ethics, Equal Education, English (Second Language), Second Language Learning
How Good Is Good Enough? Educational Standard Setting and Its Effect on African American Test Takers
Caines, Jade; Engelhard, George, Jr. – Journal of Negro Education, 2012
Standard setting (the process of establishing minimum passing scores on high-stakes exams) is a highly evaluative and policy-driven process. It is a common belief that standard setting panels should be diverse and representative. There is concern, however, that panelists with varying characteristics may differentially influence the results of the…
Descriptors: Geographic Regions, Cutting Scores, Standard Setting, African American Achievement
Alderson, J. Charles – Language Assessment Quarterly, 2011
The International Civil Aviation Association has developed a set of Language Proficiency Requirements (LPRs) and a Language Proficiency Rating Scale, which seeks to define proficiency in the language needed for aviation purposes at six different levels. Pilots, air traffic controllers and aeronautical station operators are required to achieve at…
Descriptors: Business Communication, Rating Scales, Language Proficiency, Educational Policy
Clauser, Brian E.; Mee, Janet; Baldwin, Su G.; Margolis, Melissa J.; Dillon, Gerard F. – Journal of Educational Measurement, 2009
Although the Angoff procedure is among the most widely used standard setting procedures for tests comprising multiple-choice items, research has shown that subject matter experts have considerable difficulty accurately making the required judgments in the absence of examinee performance data. Some authors have viewed the need to provide…
Descriptors: Standard Setting (Scoring), Program Effectiveness, Expertise, Health Personnel
Peer reviewedCahan, Sorel; Cohen, Nora – Educational and Psychological Measurement, 1990
A solution is offered to problems associated with the inequality in the manipulability of probabilities of classification errors of masters versus nonmasters, based on competency test results. Eschewing the typical arbitrary establishment of observed-score standards below 100 percent, the solution incorporates a self-correction of wrong answers.…
Descriptors: Classification, Error of Measurement, Mastery Tests, Minimum Competency Testing
Peer reviewedBourque, Mary Lyn; Hambleton, Ronald K. – Measurement and Evaluation in Counseling and Development, 1993
Notes that the methods used to set standards for National Assessment of Education Progress (NAEP) tests suggest recommendations for state-level policymakers. Explains the national assessment, basic assumptions in setting performance standards on NAEP, selection of judges, standard-setting methodology for NAEP, and measurement issues in setting…
Descriptors: Elementary Secondary Education, National Norms, Standard Setting (Scoring), Standards
Peer reviewedBeuk, Cees H. – Journal of Educational Measurement, 1984
A systematic method for compromise between absolute and relative examination standards is proposed. The passing score is assumed to be related to expected pass rate through a simple linear function. Results define a function relating the percentage of successful candidates given a specified passing score to the passing score. (Author/DWH)
Descriptors: Achievement Tests, Cutting Scores, Foreign Countries, Mathematical Models
McLaughlin, Milbrey W. – Phi Delta Kappan, 1991
Characterizing this special "Kappan" section on test-based accountability as a cautionary plea, this article sees five themes: tests seldom measure what matters; standardization gets confused with standards; tests constitute a limited reform lever; test-based accountability plans often misplace trust and protection; and the…
Descriptors: Accountability, Elementary Secondary Education, Multiple Choice Tests, National Competency Tests
Peer reviewedGreen, Bert F. – Educational Measurement: Issues and Practice, 1995
If annual performance assessments are to yield results that can be compared from year to year, many technical problems must be addressed. It is essential that tests to be equated measure the same construct. Methods of equating performance assessment scores, ways of equating system assessments, and standard setting are discussed. (SLD)
Descriptors: Comparative Analysis, Educational Assessment, Educational Change, Equated Scores
Peer reviewedGeisinger, Kurt F. – Educational Measurement: Issues and Practice, 1991
Ways to use standard-setting data to adjust cutoff scores on examinations are reviewed. Ten sources of information to be used in determining standards are listed. The decision to modify passing scores should be based on these types of information and consideration of adverse impact or rating process irregularities. (SLD)
Descriptors: Cutting Scores, Evaluation Utilization, Evaluators, Interrater Reliability
Peer reviewedLinn, Robert L. – Journal of Educational Measurement, 1978
Political and educational consequences of standard setting for criterion-referenced tests are discussed. Suggestions for setting standards are given. (JKS)
Descriptors: Academic Standards, Criterion Referenced Tests, Decision Making, Evaluation Criteria
Peer reviewedLinn, Robert L. – Educational Measurement: Issues and Practice, 1982
Confusion in the terminology used in criterion-referenced measurement specifications and development and standard setting and the attendant role of cut-off scores are shown to need practical clarification through psychometric research on test applications and consequences. (CM)
Descriptors: Academic Standards, Criterion Referenced Tests, Cutting Scores, Measurement Objectives
Previous Page | Next Page ยป
Pages: 1 | 2
Direct link
