ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	0
Since 2007 (last 20 years)	9

Source

International Journal of…

Publication Type

Journal Articles	9
Reports - Descriptive	4
Reports - Evaluative	3
Reports - Research	2

Education Level

Elementary Secondary Education	2
Grade 5	2
Elementary Education	1
Grade 3	1
Grade 7	1
Higher Education	1
Intermediate Grades	1
Postsecondary Education	1

Audience

Location

Africa	1
South Africa	1

Laws, Policies, & Programs

Assessments and Surveys

International English…	1
United States Medical…	1

What Works Clearinghouse Rating

International Journal of Testing X

Showing all 9 results Save | Export

Identifying and Evaluating External Validity Evidence for Passing Scores

Peer reviewed

Direct link

Davis-Becker, Susan L.; Buckendahl, Chad W. – International Journal of Testing, 2013

A critical component of the standard setting process is collecting evidence to evaluate the recommended cut scores and their use for making decisions and classifying students based on test performance. Kane (1994, 2001) proposed a framework by which practitioners can identify and evaluate evidence of the results of the standard setting from (1)…

Descriptors: Standard Setting (Scoring), Evidence, Validity, Cutting Scores

Standard Setting to an International Reference Framework: Implications for Theory and Practice

Peer reviewed

Direct link

Lim, Gad S.; Geranpayeh, Ardeshir; Khalifa, Hanan; Buckendahl, Chad W. – International Journal of Testing, 2013

Standard setting theory has largely developed with reference to a typical situation, determining a level or levels of performance for one exam for one context. However, standard setting is now being used with international reference frameworks, where some parameters and assumptions of classical standard setting do not hold. We consider the…

Descriptors: Standard Setting (Scoring), Validity, Models, Language Tests

The Effect of Data Format on Integration of Performance Data into Angoff Judgments

Peer reviewed

Direct link

Clauser, Brian E.; Mee, Janet; Margolis, Melissa J. – International Journal of Testing, 2013

This study investigated the extent to which the performance data format impacted data use in Angoff standard setting exercises. Judges from two standard settings (a total of five panels) were randomly assigned to one of two groups. The full-data group received two types of data: (1) the proportion of examinees selecting each option and (2) plots…

Descriptors: Standard Setting (Scoring), Cutting Scores, Validity, Reliability

Standard Setting Lessons Learned in the South African Context: Implications for International Implementation

Peer reviewed

Direct link

Pitoniak, Mary J.; Yeld, Nan – International Journal of Testing, 2013

Criterion-referenced assessments have become more common around the world, with performance standards being set to differentiate different levels of student performance. However, use of standard setting methods developed in the United States may be complicated by factors related to the political and educational contexts within another country. In…

Descriptors: Standard Setting (Scoring), Criterion Referenced Tests, Benchmarking, Student Evaluation

Applying Rasch Model and Generalizability Theory to Study Modified-Angoff Cut Scores

Peer reviewed

Direct link

Arce, Alvaro J.; Wang, Ze – International Journal of Testing, 2012

The traditional approach to scale modified-Angoff cut scores transfers the raw cuts to an existing raw-to-scale score conversion table. Under the traditional approach, cut scores and conversion table raw scores are not only seen as interchangeable but also as originating from a common scaling process. In this article, we propose an alternative…

Descriptors: Generalizability Theory, Item Response Theory, Cutting Scores, Scaling

Evaluating the Bookmark Standard Setting Method: The Impact of Random Item Ordering

Peer reviewed

Direct link

Davis-Becker, Susan L.; Buckendahl, Chad W.; Gerrow, Jack – International Journal of Testing, 2011

Throughout the world, cut scores are an important aspect of a high-stakes testing program because they are a key operational component of the interpretation of test scores. One method for setting standards that is prevalent in educational testing programs--the Bookmark method--is intended to be a less cognitively complex alternative to methods…

Descriptors: Standard Setting (Scoring), Cutting Scores, Educational Testing, Licensing Examinations (Professions)

Evaluating Panelists' Standard Setting Perceptions in a Developing Nation

Peer reviewed

Direct link

Ferdous, Abdullah A.; Buckendahl, Chad W. – International Journal of Testing, 2013

Considerable research about standard setting has revolved around a U.S.-centric policy context. That is, over the past decade, conclusions about thought processes and the interaction of education policy and panelists' judgments have been based on assumptions of comparable policy settings. However, whether these assumptions generalize to other…

Descriptors: Standard Setting (Scoring), Cognitive Processes, Mathematics Tests, Language Tests

Objective Standard Setting for Judge-Mediated Examinations

Peer reviewed

Direct link

Stone, Gregory Ethan; Beltyukova, Svetlana; Fox, Christine M. – International Journal of Testing, 2008

Judge-mediated examinations are defined as those for which expert evaluation (using rubrics) is required to determine correctness, completeness, and reasonability of test-taker responses. The use of multifaceted Rasch modeling has led to improvements in the reliability of scoring such examinations. The establishment of criterion-referenced…

Descriptors: Interrater Reliability, High Stakes Tests, Standard Setting, Minimum Competencies

Scoring Guide Alignment: Combining Scorer Judgments with Item Parameter Estimates to Set Cut Scores

Peer reviewed

Direct link

Childs, Ruth A.; Jaciw, Andrew P.; Saunders, Kelsey – International Journal of Testing, 2007

Many approaches to standard-setting use item calibration and student score estimation results to structure panelists' tasks. However, this requires collecting standard-setting judgments after the item analysis results are available. The Scoring Guide Alignment approach collects standard-setting judgments during the scoring sessions from teachers…

Descriptors: Testing Programs, Scoring, Item Analysis, Test Items

Standard Setting (Scoring)	9
Cutting Scores	7
Validity	4
Licensing Examinations…	3
Test Items	3
Criterion Referenced Tests	2
Difficulty Level	2
English (Second Language)	2
Foreign Countries	2
Item Response Theory	2
Language Tests	2
Second Language Learning	2
Student Evaluation	2
Testing Programs	2
Attitudes	1
Benchmarking	1
Case Studies	1
Cognitive Processes	1
College Bound Students	1
Computation	1
Control Groups	1
Correlation	1
Data	1
Developing Nations	1
Educational History	1
More ▼

Buckendahl, Chad W.	4
Davis-Becker, Susan L.	2
Arce, Alvaro J.	1
Beltyukova, Svetlana	1
Childs, Ruth A.	1
Clauser, Brian E.	1
Ferdous, Abdullah A.	1
Fox, Christine M.	1
Geranpayeh, Ardeshir	1
Gerrow, Jack	1
Jaciw, Andrew P.	1
Khalifa, Hanan	1
Lim, Gad S.	1
Margolis, Melissa J.	1
Mee, Janet	1
Pitoniak, Mary J.	1
Saunders, Kelsey	1
Stone, Gregory Ethan	1
Wang, Ze	1
Yeld, Nan	1
More ▼