Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 0 |
| Since 2007 (last 20 years) | 3 |
Descriptor
| Interrater Reliability | 20 |
| Minimum Competency Testing | 20 |
| Standard Setting (Scoring) | 12 |
| Scoring | 8 |
| Evaluators | 7 |
| Difficulty Level | 6 |
| Higher Education | 5 |
| Minimum Competencies | 5 |
| Cutting Scores | 4 |
| Elementary Secondary Education | 4 |
| Judges | 4 |
| More ▼ | |
Source
| Educational Measurement:… | 3 |
| Educational and Psychological… | 2 |
| Applied Measurement in… | 1 |
| British Journal of Guidance &… | 1 |
| College & Research Libraries | 1 |
| Educational Gerontology | 1 |
Author
| Busch, John Christian | 2 |
| Jaeger, Richard M. | 2 |
| Plake, Barbara S. | 2 |
| Booth, Robin | 1 |
| Caputi, Peter | 1 |
| Chang, Lei | 1 |
| Clayton, Berwyn | 1 |
| Dance, Betty | 1 |
| Davis, Erin | 1 |
| DeMauro, Gerald E. | 1 |
| Deane, Frank P. | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 14 |
| Journal Articles | 9 |
| Speeches/Meeting Papers | 8 |
| Reports - Evaluative | 4 |
| Reports - Descriptive | 2 |
| Numerical/Quantitative Data | 1 |
Education Level
| Higher Education | 3 |
| Postsecondary Education | 2 |
| Adult Education | 1 |
Audience
| Researchers | 3 |
Location
| Australia | 2 |
| New Jersey | 1 |
| New Zealand | 1 |
| Utah | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| National Teacher Examinations | 2 |
What Works Clearinghouse Rating
Consistency of Supervisor and Peer Ratings of Assessment Interviews Conducted by Psychology Trainees
Gonsalvez, Craig J.; Deane, Frank P.; Caputi, Peter – British Journal of Guidance & Counselling, 2016
Observation of counsellor skills through a one-way mirror, video or audio recording followed by supervisors and peers feedback is common in counsellor training. The nature and extent of agreement between supervisor-peer dyads are unclear. Using a standard scale, supervisors and peers rated 32 interviews by psychology trainees observed through a…
Descriptors: Interviews, Supervisory Methods, Trainees, Minimum Competency Testing
Holliday, Wendy; Dance, Betty; Davis, Erin; Fagerheim, Britt; Hedrich, Anne; Lundstrom, Kacy; Martin, Pamela – College & Research Libraries, 2015
This paper outlines the process and results of an authentic assessment of student work using a revised version of the AAC&U's Information Literacy VALUE rubric. This rigorous assessment, which included the scoring of nearly 900 student papers from four different stages across the undergraduate curriculum, revealed much about the process of…
Descriptors: Information Literacy, Performance Based Assessment, Undergraduate Students, Student Evaluation
Koder, Deborah-Anne; Klahr, Amanda – Educational Gerontology, 2010
The Mini-Mental State Examination (MMSE) is one of the most commonly used instruments to screen for cognitive deficits within the hospital setting. However training in how to administer this widely used tool is scarce with little, if any, formal training for nursing staff. Scores are also often misused with over reliance on results and cut-offs to…
Descriptors: Nursing Education, Nurses, Dementia, Knowledge Level
Merrill, Beverly; Peterson, Sarah – 1986
When the Mesa, Arizona Public Schools initiated an ambitious writing instruction program in 1978, two assessments based on student writing samples were developed. The first is based on a ninth grade proficiency test. If the student does not pass the test, high school remediation is provided. After 1987, students must pass this test in order to…
Descriptors: Computer Assisted Testing, Elementary Secondary Education, Graduation Requirements, Holistic Evaluation
Humes, Ann – 1983
This paper, as an illustration of the procedures involved in a cooperative effort, describes a project in which the Southwest Regional Laboratory (SWRL) designed and developed a minimum standards test in collaboration with a large urban school district in California. The activity described focuses on the writing sample included in the test. The…
Descriptors: High Schools, Institutional Cooperation, Interrater Reliability, Minimum Competency Testing
Friedman, Charles B.; Ho, Kevin T. – 1990
Eleven judges representing 11 different geographic regions in the United States participated in a standard-setting session designed to determine the possibility of obtaining interjudge consensus and intrajudge consistency simultaneously. Each judge had experience in the field for which standards were being set. The judges rated 65 multiple-choice…
Descriptors: Evaluators, Feedback, Interrater Reliability, Licensing Examinations (Professions)
Peer reviewedPlake, Barbara S.; And Others – Educational Measurement: Issues and Practice, 1991
Possible sources of intrajudge inconsistency in standard setting are reviewed, and approaches are presented to improve the accuracy of rating. Procedures for providing judges with feedback through discussion or computerized communication are discussed. Monitoring and maintaining judges' consistency throughout the rating process are essential. (SLD)
Descriptors: Computer Assisted Instruction, Evaluators, Examiners, Feedback
Chang, Lei; And Others – 1994
The present study examines the influence of judges' item-related knowledge on setting standards for competency tests. Seventeen judges from different professions took a 122-item teacher-certification test in economics while setting competency standards for the test using the Angoff procedure. Judges tended to set higher standards for items they…
Descriptors: Economics, Evaluators, Experience, Interrater Reliability
University of South Florida, Tampa. Coll. of Education. – 1980
This report describes the procedures followed in scoring the October 1978 Florida Minimal Writing Production Skills Assessment and reports the results of that assessment. The assessment was conducted on a sample of Florida public school students in grades 3, 5, 8, and 11. Sections include descriptions of the rating scale and scorer's guide as well…
Descriptors: Educational Assessment, Elementary Secondary Education, Interrater Reliability, Minimum Competency Testing
Peer reviewedMills, Craig N.; And Others – Educational Measurement: Issues and Practice, 1991
An approach is presented to the definition of minimal competence for judges to use in standard setting. Panelists in standard setting must receive training to ensure that differences in rating result from differences in perceptions of item difficulty, not in differences of opinion about the definition of minimal competence. (SLD)
Descriptors: Cutting Scores, Decision Making, Definitions, Difficulty Level
Peer reviewedReid, Jerry B. – Educational Measurement: Issues and Practice, 1991
Training judges to generate item ratings in standard setting once the reference group has been defined is discussed. It is proposed that sensitivity to the factors that determine difficulty can be improved through training. Three criteria for determining when training is sufficient are offered. (SLD)
Descriptors: Computer Assisted Instruction, Difficulty Level, Evaluators, Interrater Reliability
Peer reviewedHalpin, Gerald; And Others – Educational and Psychological Measurement, 1983
Although arbitrary, whenever multiple judgmental standard-setting procedures are utilized by different groups concurrently, stability across raters can be achieved and decisions can be made in a relatively judicious manner. Greater stability across methods (Ebel, Nedelsky, Angoff) may be effected by slightly modifying the Ebel approach. (Author/PN)
Descriptors: Admission Criteria, College Entrance Examinations, Cutting Scores, Higher Education
DeMauro, Gerald E. – 1995
Studies of the Angoff method of standard setting suggest that judges agree in their estimates of the relative difficulties of test questions for minimally competent examinees and that each judge's estimates correlate well with the observed item difficulties for examinees whose total test scores are near the judge's personal standard (G. E.…
Descriptors: Ability, Competence, Construct Validity, Difficulty Level
Peer reviewedPlake, Barbara S.; Melican, Gerald J. – Educational and Psychological Measurement, 1989
The impact of overall test length and difficulty on the expert judgments of item performance by the Nedelsky method were studied. Five university-level instructors predicting the performance of minimally competent candidates on a mathematics examination were fairly consistent in their assessments regardless of length or difficulty of the test.…
Descriptors: Difficulty Level, Estimation (Mathematics), Evaluators, Higher Education
Peer reviewedBusch, John Christian – Applied Measurement in Education, 1988
A panel of 24 public school teachers and 37 college/university faculty members provided recommendations on minimal standards for the essay portion of the National Teacher Examinations Communication Skills Test. Public school judges' recommendations were significantly more variable than were those of college/university judges. (TJH)
Descriptors: College Faculty, Communication Skills, Elementary Secondary Education, Essay Tests
Previous Page | Next Page ยป
Pages: 1 | 2
Direct link
