ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	12

Descriptor

Interrater Reliability	16
Standard Setting	16
Academic Standards	7
Item Response Theory	4
Student Evaluation	4
Test Reliability	4
Achievement Rating	3
Cutting Scores	3
Evaluation Criteria	3
Evaluation Methods	3
Psychometrics	3
Scoring	3
Test Construction	3
Test Items	3
Test Validity	3
Academic Achievement	2
Clinical Experience	2
Computation	2
Educational Quality	2
English	2
Error of Measurement	2
Evaluation Research	2
Evaluative Thinking	2
Foreign Countries	2
Goodness of Fit	2
More ▼

Source

New Mexico Public Education…	2
Research Papers in Education	2
Advances in Health Sciences…	1
Assessment & Evaluation in…	1
Counseling Psychologist	1
International Journal of…	1
International Journal of…	1
Journal for the Education of…	1
Journal of Negro Education	1
Journal of Technology and…	1
New York State Education…	1
RAND Corporation	1
More ▼

Publication Type

Journal Articles	10
Reports - Descriptive	6
Reports - Research	5
Reports - Evaluative	4
Numerical/Quantitative Data	2
Opinion Papers	2
Information Analyses	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	3
Adult Education	2
Higher Education	2
Early Childhood Education	1
Elementary Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Primary Education	1
Secondary Education	1
More ▼

Audience

Location

New Mexico	2
New York	1
United Kingdom	1
United Kingdom (England)	1
United Kingdom (Northern…	1
United Kingdom (Wales)	1

Laws, Policies, & Programs

Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

Comparison Study of Judged Clinical Skills Competence from Standard Setting Ratings Generated under Different Administration Conditions

Peer reviewed

Direct link

Roberts, William L.; Boulet, John; Sandella, Jeanne – Advances in Health Sciences Education, 2017

When the safety of the public is at stake, it is particularly relevant for licensing and credentialing exam agencies to use defensible standard setting methods to categorize candidates into competence categories (e.g., pass/fail). The aim of this study was to gather evidence to support change to the Comprehensive Osteopathic Medical Licensing-USA…

Descriptors: Standard Setting, Comparative Analysis, Clinical Experience, Skill Analysis

Test Theories, Educational Priorities and Reliability of Public Examinations in England

Peer reviewed

Direct link

Baird, Jo-Anne; Black, Paul – Research Papers in Education, 2013

Much has already been written on the controversies surrounding the use of different test theories in educational assessment. Other authors have noted the prevalence of classical test theory over item response theory in practice. This Special Issue draws together articles based upon work conducted on the Reliability Programme for England's…

Descriptors: Test Theory, Foreign Countries, Test Reliability, Item Response Theory

How Good Is Good Enough? Educational Standard Setting and Its Effect on African American Test Takers

Peer reviewed

Direct link

Caines, Jade; Engelhard, George, Jr. – Journal of Negro Education, 2012

Standard setting (the process of establishing minimum passing scores on high-stakes exams) is a highly evaluative and policy-driven process. It is a common belief that standard setting panels should be diverse and representative. There is concern, however, that panelists with varying characteristics may differentially influence the results of the…

Descriptors: Geographic Regions, Cutting Scores, Standard Setting, African American Achievement

Objective Standard Setting for Judge-Mediated Examinations

Peer reviewed

Direct link

Stone, Gregory Ethan; Beltyukova, Svetlana; Fox, Christine M. – International Journal of Testing, 2008

Judge-mediated examinations are defined as those for which expert evaluation (using rubrics) is required to determine correctness, completeness, and reasonability of test-taker responses. The use of multifaceted Rasch modeling has led to improvements in the reliability of scoring such examinations. The establishment of criterion-referenced…

Descriptors: Interrater Reliability, High Stakes Tests, Standard Setting, Minimum Competencies

Clinical Judgment Accuracy: From Meta-Analysis to Metatheory

Peer reviewed

Direct link

Ridley, Charles R.; Shaw-Ridley, Mary – Counseling Psychologist, 2009

Clinical judgment is foundational to psychological practice. Accurate judgment forms the basis for establishing reasonable goals and selecting appropriate treatments, which in turn are essential in achieving positive therapeutic outcomes. Therefore, Spengler and colleagues' meta-analytic finding--clinical judgment accuracy improves marginally with…

Descriptors: Medical Evaluation, Clinical Experience, Inferences, Therapy

Curriculum for Highly Able Learners that Conforms to General Education and Gifted Education Quality Indicators

Peer reviewed
PDF on ERIC

Download full text

Direct link

Hockett, Jessica A. – Journal for the Education of the Gifted, 2009

Legislative measures designed to ensure that all students meet minimal expectations have concerned leaders in gifted education. In this current educational climate of standards and accountability, however, there is arguably greater agreement than ever before between experts and professional organizations in general education and their counterparts…

Descriptors: Curriculum Design, General Education, Gifted, Educational Indicators

The Collegiate Learning Assessment: Setting Standards for Performance at a College or University. Technical Report

Direct link

Hardison, Chaitra M.; Vilamovska, Anna-Marie – RAND Corporation, 2009

The Collegiate Learning Assessment (CLA) is a measure of how much students' critical thinking improves after attending college or university. This report illustrates how institutions can set their own standards on the CLA using a method that is appropriate for the CLA's unique characteristics. The authors examined evidence of reliability and…

Descriptors: Standard Setting, Evaluation Methods, Research Reports, Critical Thinking

New York State Alternate Assessment Technical Report, 2013-14

Download full text

New York State Education Department, 2014

This technical report provides an overview of the New York State Alternate Assessment (NYSAA), including a description of the purpose of the NYSAA, the processes utilized to develop and implement the NYSAA program, and Stakeholder involvement in those processes. The purpose of this report is to document the technical aspects of the 2013-14 NYSAA.…

Descriptors: Alternative Assessment, Educational Assessment, State Departments of Education, Student Evaluation

Investigating a Judgemental Rank-Ordering Method for Maintaining Standards in UK Examinations

Peer reviewed

Direct link

Black, Beth; Bramley, Tom – Research Papers in Education, 2008

A new judgemental method of equating raw scores on two tests, based on rank-ordering scripts from both tests, has been developed by Bramley. The rank-ordering method has potential application as a judgemental standard-maintaining mechanism, because given a mark on one test (e.g. the A grade boundary mark), the equivalent mark (i.e. at the same…

Descriptors: Foreign Countries, Equated Scores, Test Theory, Evaluative Thinking

Judging Quality through Substantive Conversations between Markers

Peer reviewed

Direct link

Grainger, Peter; Purnell, Ken; Zipf, Reyna – Assessment & Evaluation in Higher Education, 2008

Decisions by markers about quality in student work remain confusing to most students and markers. This may in part be due to the relatively subjective nature of what constitutes a quality response to an assessment task. This paper reports on an experiment that documented the process of decision-making by multiple markers at a university who…

Descriptors: Student Evaluation, Educational Quality, Achievement Rating, Interrater Reliability

Detecting Intrajudge Inconsistency in Standard Setting Using Test Items with a Selected-Response Format. Research Report.

Download full text

van der Linden, Wim J.; Vos, Hans J.; Chang, Lei – 2000

In judgmental standard setting experiments, it may be difficult to specify subjective probabilities that adequately take the properties of the items into account. As a result, these probabilities are not consistent with each other in the sense that they do not refer to the same borderline level of performance. Methods to check standard setting…

Descriptors: Interrater Reliability, Judges, Probability, Standard Setting

The Issue of Quality in Qualitative Research

Direct link

Hammersley, Martyn – International Journal of Research & Method in Education, 2007

This article addresses the perennial issue of the criteria by which qualitative research should be evaluated. At the present time, there is a sharp conflict between demands for explicit criteria, for example in order to serve systematic reviewing and evidence-based practice, and arguments on the part of some qualitative researchers that such…

Descriptors: Qualitative Research, Research Methodology, Evaluation Criteria, Research Problems

The Role of Deliberation Style in Standard Setting for Licensing and Certification Examinations.

Download full text

Hertz, Norman R.; Chinn, Roberta N. – 2002

Nearly all of the research on standard setting focuses on different standard setting methods rather than the interaction of group members and the instructions given to group members. This study explored the effect of deliberation style and the requirement to reach consensus on the passing score, on rater satisfaction, and on postdecision…

Descriptors: Decision Making, Evaluation Methods, Evaluators, Interaction

Online Academy: Content Validation through a Juror Process

Peer reviewed

Direct link

Meyen, Edward; Bui, Yvonne N. – Journal of Technology and Teacher Education, 2003

The Online Academy (HO29K73002) was funded by the Office of Special Education Programs (OSEP) to develop research-based online instructional modules in the content areas of reading, positive behavior support and technology across the curriculum. Targeted to preservice teacher education programs in Institutions of Higher Education (IHE), but also…

Descriptors: Teacher Education Programs, Learning Modules, Program Descriptions, Online Systems

New Mexico Standards-Based Assessment Technical Report: Spring 2007 Administration

Download full text

New Mexico Public Education Department, 2007

The purpose of the NMSBA technical report is to provide users and other interested parties with a general overview of and technical characteristics of the 2007 NMSBA. The 2007 technical report contains the following information: (1) Test development; (2) Scoring procedures; (3) Summary of student performance; (4) Statistical analyses of item and…

Descriptors: Interrater Reliability, Standard Setting, Measures (Individuals), Scoring

Previous Page | Next Page »

Pages: 1 | 2

Baird, Jo-Anne	1
Beltyukova, Svetlana	1
Black, Beth	1
Black, Paul	1
Boulet, John	1
Bramley, Tom	1
Bui, Yvonne N.	1
Caines, Jade	1
Chang, Lei	1
Chinn, Roberta N.	1
Engelhard, George, Jr.	1
Fox, Christine M.	1
Grainger, Peter	1
Griph, Gerald W.	1
Hammersley, Martyn	1
Hardison, Chaitra M.	1
Hertz, Norman R.	1
Hockett, Jessica A.	1
Meyen, Edward	1
Purnell, Ken	1
Ridley, Charles R.	1
Roberts, William L.	1
Sandella, Jeanne	1
Shaw-Ridley, Mary	1
Stone, Gregory Ethan	1
More ▼