ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	9
Since 2017 (last 10 years)	38
Since 2007 (last 20 years)	79

Descriptor

Test Items	145
Standard Setting (Scoring)	94
Cutting Scores	62
Standard Setting	51
Difficulty Level	46
Test Construction	39
Scoring	31
Standards	28
Item Analysis	26
Item Response Theory	26
Licensing Examinations…	25
Test Validity	24
Interrater Reliability	21
Evaluation Methods	19
Mathematics Tests	19
Foreign Countries	18
Criterion Referenced Tests	17
Evaluators	16
Psychometrics	15
Test Reliability	15
Academic Standards	14
Error of Measurement	14
Judges	14
Minimum Competency Testing	14
Scores	14
More ▼

Publication Type

Reports - Research	84
Journal Articles	81
Speeches/Meeting Papers	36
Reports - Evaluative	30
Reports - Descriptive	17
Numerical/Quantitative Data	8
Opinion Papers	4
Tests/Questionnaires	4
Dissertations/Theses -…	3
Guides - General	3
Guides - Classroom - Teacher	2
Information Analyses	2
Guides - Non-Classroom	1
Non-Print Media	1
Reference Materials - General	1
Reports - General	1
More ▼

Education Level

Secondary Education	16
Higher Education	14
Junior High Schools	11
Middle Schools	11
Postsecondary Education	11
Elementary Education	10
Elementary Secondary Education	10
Grade 8	7
Grade 5	5
High Schools	4
Intermediate Grades	3
Adult Education	2
Grade 11	2
Grade 12	2
Grade 4	2
Grade 6	2
Grade 7	2
Early Childhood Education	1
Grade 10	1
Grade 3	1
Grade 9	1
Kindergarten	1
Preschool Education	1
Primary Education	1
More ▼

Audience

Researchers	4
Practitioners	2
Administrators	1
Counselors	1
Teachers	1

Location

Nebraska	4
Tennessee	4
Europe	3
Thailand	3
Germany	2
Netherlands	2
New Jersey	2
New Mexico	2
Nigeria	2
Turkey	2
United Kingdom	2
California	1
Canada	1
China	1
European Union	1
Illinois	1
Maryland	1
Massachusetts	1
South Africa	1
Taiwan	1
United Kingdom (London)	1
More ▼

Laws, Policies, & Programs

Comprehensive Education…	2
No Child Left Behind Act 2001	2
Education Consolidation…	1
Individuals with Disabilities…	1

Assessments and Surveys

National Assessment of…	8
Advanced Placement…	2
National Teacher Examinations	2
ACT Assessment	1
Massachusetts Comprehensive…	1
Praxis Series	1
Test of English as a Foreign…	1
Test of English for…	1
United States Medical…	1

What Works Clearinghouse Rating

Test Items X

Showing 1 to 15 of 145 results Save | Export

Embedding Embedded Standard Setting: An Application of Cross-Classified Item Response Theory. CRESST Report 876

Download full text

Yun-Kyung Kim; Li Cai – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2025

This paper introduces an application of cross-classified item response theory (IRT) modeling to an assessment utilizing the embedded standard setting (ESS) method (Lewis & Cook). The cross-classified IRT model is used to treat both item and person effects as random, where the item effects are regressed on the target performance levels (target…

Descriptors: Standard Setting (Scoring), Item Response Theory, Test Items, Difficulty Level

Embedded Standard Setting for Credentialing

Peer reviewed

Direct link

Daniel Lewis; Melanie Graw; Michael Baker – Journal of Applied Testing Technology, 2024

Embedded Standard Setting (ESS; Lewis & Cook, 2020) transforms standard setting from a standalone workshop to an active part of the assessment development lifecycle. ESS purports to lower costs by eliminating the standard-setting workshop and enhance the validity argument by maintaining a consistent focus on the evidentiary relationship…

Descriptors: Standard Setting (Scoring), Test Items, Test Construction, Food Service

Evaluating Methodological Enhancements to the Yes/No Angoff Standard-Setting Method in Language Proficiency Assessment

Peer reviewed

Direct link

Tia M. Fechter; Heeyeon Yoon – Language Testing, 2024

This study evaluated the efficacy of two proposed methods in an operational standard-setting study conducted for a high-stakes language proficiency test of the U.S. government. The goal was to seek low-cost modifications to the existing Yes/No Angoff method to increase the validity and reliability of the recommended cut scores using a convergent…

Descriptors: Standard Setting, Language Proficiency, Language Tests, Evaluation Methods

Setting and Validating Multiple Standards on a Multistage-Adaptive Test

Peer reviewed

Direct link

Lewis, Jennifer; Lim, Hwanggyu; Padellaro, Frank; Sireci, Stephen G.; Zenisky, April L. – Educational Measurement: Issues and Practice, 2022

Setting cut scores on (MSTs) is difficult, particularly when the test spans several grade levels, and the selection of items from MST panels must reflect the operational test specifications. In this study, we describe, illustrate, and evaluate three methods for mapping panelists' Angoff ratings into cut scores on the scale underlying an MST. The…

Descriptors: Cutting Scores, Adaptive Testing, Test Items, Item Analysis

Using Diagnostic Profiles to Describe Borderline Performance in Standard Setting

Peer reviewed

Direct link

Skaggs, Gary; Hein, Serge F.; Wilkins, Jesse L. M. – Educational Measurement: Issues and Practice, 2020

In test-centered standard-setting methods, borderline performance can be represented by many different profiles of strengths and weaknesses. As a result, asking panelists to estimate item or test performance for a hypothetical group study of borderline examinees, or a typical borderline examinee, may be an extremely difficult task and one that can…

Descriptors: Standard Setting (Scoring), Cutting Scores, Testing Problems, Profiles

Examining the Impact of a Consensus Approach to Content Alignment Studies

Peer reviewed
PDF on ERIC

Download full text

Russell, Michael; Moncaleano, Sebastian – Practical Assessment, Research & Evaluation, 2020

Although both content alignment and standard-setting procedures rely on content-expert panel judgements, only the latter employs discussion among panel members. This study employed a modified form of the Webb methodology to examine content alignment for twelve tests administered as part of the Massachusetts Comprehensive Assessment System (MCAS).…

Descriptors: Test Content, Test Items, Discussion, Test Validity

A Method for Detecting Regression of Hard and Easy Item Angoff Ratings

Peer reviewed

Direct link

Wyse, Adam E.; Babcock, Ben – Journal of Educational Measurement, 2019

One common phenomenon in Angoff standard setting is that panelists regress their ratings in toward the middle of the probability scale. This study describes two indices based on taking ratios of standard deviations that can be utilized with a scatterplot of item ratings versus expected probabilities of success to identify whether ratings are…

Descriptors: Item Analysis, Standard Setting, Probability, Feedback (Response)

It's Not Just Angoff: Misperceptions of Hard and Easy Items in Bookmark-Type Ratings

Peer reviewed

Direct link

Wyse, Adam E.; Babcock, Ben – Educational Measurement: Issues and Practice, 2020

A common belief is that the Bookmark method is a cognitively simpler standard-setting method than the modified Angoff method. However, a limited amount of research has investigated panelist's ability to perform well the Bookmark method, and whether some of the challenges panelists face with the Angoff method may also be present in the Bookmark…

Descriptors: Standard Setting (Scoring), Evaluation Methods, Testing Problems, Test Items

Embedded Standard Setting: Aligning Standard-Setting Methodology with Contemporary Assessment Design Principles

Peer reviewed

Direct link

Lewis, Daniel; Cook, Robert – Educational Measurement: Issues and Practice, 2020

In this paper we assert that the practice of principled assessment design renders traditional standard-setting methodology redundant at best and contradictory at worst. We describe the rationale for, and methodological details of, Embedded Standard Setting (ESS; previously, Engineered Cut Scores. Lewis, 2016), an approach to establish performance…

Descriptors: Standard Setting, Evaluation, Cutting Scores, Performance Based Assessment

Examining the Cut-Off Score of the English B1 Progression Exam According to Different Standard Setting Methods

Peer reviewed
PDF on ERIC

Download full text

Rümeysa Kaya; Bayram Çetin – International Journal of Assessment Tools in Education, 2025

In this study, the cut-off scores obtained from the Angoff, Angoff Y/N, Nedelsky and Ebel standard methods were compared with the 50 T score and the current cut-off score in various aspects. Data were collected from 448 students who took Module B1+ English Exit Exam IV and 14 experts. It was seen that while the Nedelsky method gave the lowest…

Descriptors: Standard Setting, Cutting Scores, Exit Examinations, Academic Achievement

Aligning Academic Reading Tests to the Common European Framework of Reference for Languages (CEFR)

Peer reviewed
PDF on ERIC

Download full text

Sivakorn Tangsakul; Kornwipa Poonpon – rEFLections, 2024

Given the significant global influence of the Common European Framework of Reference for Languages: Teaching, Learning, and Assessment (CEFR) on English language education, this study deals with aligning a university's academic reading tests to the CEFR. It aimed at validating the test construct of the academic reading tests in relation to the…

Descriptors: Alignment (Education), Reading Tests, Second Language Learning, Language Proficiency

Comparison of Passing Scores Determined by the Angoff Method in Different Item Samples

Peer reviewed
PDF on ERIC

Download full text

Kara, Hakan; Cetin, Sevda – International Journal of Assessment Tools in Education, 2020

In this study, the efficiency of various random sampling methods to reduce the number of items rated by judges in an Angoff standard-setting study was examined and the methods were compared with each other. Firstly, the full-length test was formed by combining Placement Test 2012 and 2013 mathematics subsets. After then, simple random sampling…

Descriptors: Cutting Scores, Standard Setting (Scoring), Sampling, Error of Measurement

Comparing Small-Sample Equating with Angoff Judgement for Linking Cut-Scores on Two Tests

Download full text

Bramley, Tom – Research Matters, 2020

The aim of this study was to compare, by simulation, the accuracy of mapping a cut-score from one test to another by expert judgement (using the Angoff method) versus the accuracy with a small-sample equating method (chained linear equating). As expected, the standard-setting method resulted in more accurate equating when we assumed a higher level…

Descriptors: Cutting Scores, Standard Setting (Scoring), Equated Scores, Accuracy

Exploring the Influence of Judge Proficiency on Standard-Setting Judgments

Peer reviewed

Direct link

Peabody, Michael R.; Wind, Stefanie A. – Journal of Educational Measurement, 2019

Setting performance standards is a judgmental process involving human opinions and values as well as technical and empirical considerations. Although all cut score decisions are by nature somewhat arbitrary, they should not be capricious. Judges selected for standard-setting panels should have the proper qualifications to make the judgments asked…

Descriptors: Standard Setting, Decision Making, Performance Based Assessment, Evaluators

New Meridian Comparability Review Guidelines. Version 6.17.2020

Download full text

New Meridian Corporation, 2020

New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS). The goal of the QTS is to provide guidance to states that are interested in including content from the New Meridian item bank and intend to make comparability claims with "other assessments" that include New…

Descriptors: Testing, Standards, Comparative Analysis, Guidelines

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

Applied Measurement in…	14
Educational Measurement:…	8
Journal of Educational…	8
Educational and Psychological…	7
Practical Assessment,…	5
Nebraska Department of…	4
International Journal of…	3
Journal of Applied Testing…	3
National Assessment Governing…	3
New Meridian Corporation	3
ProQuest LLC	3
Eurasian Journal of…	2
Evaluation and the Health…	2
International Journal of…	2
Journal of Educational and…	2
Language Assessment Quarterly	2
New Mexico Public Education…	2
Online Submission	2
Advances in Health Sciences…	1
Alberta Journal of…	1
Applied Psychological…	1
Assessment & Evaluation in…	1
College Board	1
College and University	1
Education Next	1
More ▼

Plake, Barbara S.	12
Impara, James C.	6
Hambleton, Ronald K.	5
Wyse, Adam E.	5
Buckendahl, Chad W.	4
Chang, Lei	4
Ferdous, Abdullah A.	4
Clauser, Brian E.	3
Davis-Becker, Susan L.	3
Kannan, Priya	3
Margolis, Melissa J.	3
Reckase, Mark D.	3
Tannenbaum, Richard J.	3
Wind, Stefanie A.	3
Babcock, Ben	2
Baldwin, Peter	2
Beretvas, S. Natasha	2
Bichi, Ado Abdu	2
Bowman, Harry L.	2
Clauser, Jerome C.	2
Engelhard, George, Jr.	2
Galindo, Jennifer	2
Gerrow, Jack	2
Harsch, Claudia	2
More ▼