Publication Date
| In 2026 | 0 |
| Since 2025 | 11 |
| Since 2022 (last 5 years) | 58 |
| Since 2017 (last 10 years) | 207 |
| Since 2007 (last 20 years) | 861 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Policymakers | 46 |
| Researchers | 45 |
| Practitioners | 39 |
| Teachers | 38 |
| Administrators | 25 |
| Counselors | 6 |
| Parents | 5 |
| Students | 4 |
| Community | 2 |
| Media Staff | 2 |
Location
| Australia | 64 |
| United Kingdom | 53 |
| United States | 52 |
| Canada | 46 |
| United Kingdom (England) | 26 |
| China | 19 |
| California | 17 |
| Germany | 17 |
| Netherlands | 17 |
| Europe | 15 |
| European Union | 14 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Whetton, Chris; Twist, Elizabeth; Sainsbury, Marian – 2000
It is now seen as an economic and political necessity for countries to produce higher levels of performance across the spectrum of ability of all their students. This paper describes one example of the influence of political conditions on the process of developing assessment instruments and on measuring standards. In England, the trend is toward…
Descriptors: Academic Standards, Accountability, Educational Trends, Equated Scores
Loomis, Susan Cooper – 2000
This paper describes the feedback included in operational achievement level setting (ALS) procedures for the National Assessment of Educational Progress (NAEP). It does not describe the feedback form field trials, pilot studies, or other research studies related to the NAEP. The NAEP ALS process includes three rounds of item-by-item ratings, and…
Descriptors: Academic Achievement, Academic Standards, Civics, Classification
Peer reviewedRowley, Glenn L. – Journal of Educational Measurement, 1982
Survey research establishes that beardedness correlates with many desired outcomes and supports that a minimal level of beardedness be set as a prerequisite for high school graduation. Research problems of concept clarification, domain-definition, instrument development, and standard setting methods are discussed. The political considerations of…
Descriptors: Administrative Principles, Criterion Referenced Tests, Cutting Scores, Dress Codes
Peer reviewedBehuniak, Peter, Jr.; And Others – Educational and Psychological Measurement, 1982
This study examined how local content specialists performed when applying the Angoff and Nedelsky standard setting procedures to objective-referenced instruments in reading and mathematics. Results revealed several differences between the standard setting procedures in terms of both level and consistency of the cut scores generated. (Author/BW)
Descriptors: Comparative Analysis, Criterion Referenced Tests, Cutting Scores, Interrater Reliability
Peer reviewedO'Neil, John – Educational Leadership, 1991
Many policymakers believe that student achievement will not increase markedly until high standards are set and quality work by all students is expected and rewarded. Performance standards are being set for the three grade levels measured by the National Assessment of Educational Progress, and a national exam is winning support. Includes seven…
Descriptors: Academic Achievement, Academic Standards, Education Work Relationship, Elementary Secondary Education
Peer reviewedGreen, Bert F. – Educational Measurement: Issues and Practice, 1995
If annual performance assessments are to yield results that can be compared from year to year, many technical problems must be addressed. It is essential that tests to be equated measure the same construct. Methods of equating performance assessment scores, ways of equating system assessments, and standard setting are discussed. (SLD)
Descriptors: Comparative Analysis, Educational Assessment, Educational Change, Equated Scores
Peer reviewedFisher, William P., Jr., Ed.; Wright, Benjamin D., Ed. – International Journal of Educational Research, 1994
This special issue demonstrates the symmetry and rigor of probabilistic conjoint measurement in practical applications in education and other human sciences. Following the opening chapter introducing the theory, other chapters focus on conjoint measurement in testing and student evaluation, standard setting, and the study of behavior and attitude.…
Descriptors: Attitude Measures, Behavior, Educational Assessment, Educational Research
Peer reviewedEducational Measurement: Issues and Practice, 1992
Eight criteria developed by the National Forum on Assessment (Washington, DC) as guidelines for evaluating existing and proposed assessment systems at any level are listed. Criteria call for clear definitions and fair assessments to assist educators and policymakers in improving instruction. Continuous review and improvement are recommended. (SLD)
Descriptors: Decision Making, Definitions, Educational Assessment, Educational Improvement
Peer reviewedNorcini, John; And Others – Applied Psychological Measurement, 1991
Effects of numbers of experts (NOEs) and common items (CIs) on the scaling of cutting scores from expert judgments were studied for 11,917 physicians taking 2 forms of a medical specialty examination. Increasing NOEs and CIs reduced error; beyond 5 experts and 25 CIs, error differences were small. (SLD)
Descriptors: Comparative Testing, Cutting Scores, Equated Scores, Estimation (Mathematics)
Peer reviewedGeisinger, Kurt F. – Educational Measurement: Issues and Practice, 1991
Ways to use standard-setting data to adjust cutoff scores on examinations are reviewed. Ten sources of information to be used in determining standards are listed. The decision to modify passing scores should be based on these types of information and consideration of adverse impact or rating process irregularities. (SLD)
Descriptors: Cutting Scores, Evaluation Utilization, Evaluators, Interrater Reliability
Peer reviewedHeid, Camilla A. – Educational Evaluation and Policy Analysis, 1991
The education establishment's approach to set Chapter 1 standards at or near the legislative minimum for program improvement has unanticipated outcomes that do not advance the improvement effort. These outcomes and the dilemma between the negative aspects of being identified and the positive objective of program improvement are considered. (SLD)
Descriptors: Compensatory Education, Elementary Secondary Education, Evaluation Problems, Federal Legislation
The Consistency and Uncertainty in Examiners' Definitions of Pass/Fail Performance on OSCE Stations.
Peer reviewedRothman, Arthur I.; And Others – Evaluation and the Health Professions, 1996
Results of the fall 1993 administration of part two of the Medical Council of Canada's Evaluating Examination for 744 candidates provided evidence of the consistency of the pass/fail and cutting score definitions for the objective- structured clinical examination stations used across examiners. These results support the validity of this…
Descriptors: Cutting Scores, Definitions, Foreign Countries, Medical Education
American Educator, 1996
Defines four essential elements of a system of educational standards: (1) common, rigorous student achievement standards; (2) state-administered assessments; (3) explicit rewards for achievement; and (4) opportunity to reach the standards. Practices to ensure these conditions and enact a system based on these elements are discussed. (SLD)
Descriptors: Academic Achievement, Accountability, Educational Assessment, Educational Change
Peer reviewedChang, Lei; And Others – Applied Measurement in Education, 1996
The influence of judges' knowledge on standard setting for competency tests was studied with 17 judges who took an economics teacher certification test while setting competency standards using the Angoff procedure. Judges tended to set higher standards for items they answered correctly and lower standards for items they answered incorrectly. (SLD)
Descriptors: Competence, Difficulty Level, Economics, Judges
Peer reviewedCohen, Allan S.; Kane, Michael T.; Crooks, Terence J. – Applied Measurement in Education, 1999
Describes examinee-centered method for setting multiple cutscores on a test involving both objective and extended-response items. Judges evaluate a representative sample of examinee performance using a rating scale that is defined in terms of performance standards, and these ratings are linked to examinee's test scores to generate a functional…
Descriptors: Academic Standards, Achievement Tests, Constructed Response, Cutting Scores


