ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	9
Since 2017 (last 10 years)	38
Since 2007 (last 20 years)	79

Descriptor

Test Items	145
Standard Setting (Scoring)	94
Cutting Scores	62
Standard Setting	51
Difficulty Level	46
Test Construction	39
Scoring	31
Standards	28
Item Analysis	26
Item Response Theory	26
Licensing Examinations…	25
Test Validity	24
Interrater Reliability	21
Evaluation Methods	19
Mathematics Tests	19
Foreign Countries	18
Criterion Referenced Tests	17
Evaluators	16
Psychometrics	15
Test Reliability	15
Academic Standards	14
Error of Measurement	14
Judges	14
Minimum Competency Testing	14
Scores	14
More ▼

Publication Type

Reports - Research	84
Journal Articles	81
Speeches/Meeting Papers	36
Reports - Evaluative	30
Reports - Descriptive	17
Numerical/Quantitative Data	8
Opinion Papers	4
Tests/Questionnaires	4
Dissertations/Theses -…	3
Guides - General	3
Guides - Classroom - Teacher	2
Information Analyses	2
Guides - Non-Classroom	1
Non-Print Media	1
Reference Materials - General	1
Reports - General	1
More ▼

Education Level

Secondary Education	16
Higher Education	14
Junior High Schools	11
Middle Schools	11
Postsecondary Education	11
Elementary Education	10
Elementary Secondary Education	10
Grade 8	7
Grade 5	5
High Schools	4
Intermediate Grades	3
Adult Education	2
Grade 11	2
Grade 12	2
Grade 4	2
Grade 6	2
Grade 7	2
Early Childhood Education	1
Grade 10	1
Grade 3	1
Grade 9	1
Kindergarten	1
Preschool Education	1
Primary Education	1
More ▼

Audience

Researchers	4
Practitioners	2
Administrators	1
Counselors	1
Teachers	1

Location

Nebraska	4
Tennessee	4
Europe	3
Thailand	3
Germany	2
Netherlands	2
New Jersey	2
New Mexico	2
Nigeria	2
Turkey	2
United Kingdom	2
California	1
Canada	1
China	1
European Union	1
Illinois	1
Maryland	1
Massachusetts	1
South Africa	1
Taiwan	1
United Kingdom (London)	1
More ▼

Laws, Policies, & Programs

Comprehensive Education…	2
No Child Left Behind Act 2001	2
Education Consolidation…	1
Individuals with Disabilities…	1

Assessments and Surveys

National Assessment of…	8
Advanced Placement…	2
National Teacher Examinations	2
ACT Assessment	1
Massachusetts Comprehensive…	1
Praxis Series	1
Test of English as a Foreign…	1
Test of English for…	1
United States Medical…	1

What Works Clearinghouse Rating

Test Items X

Showing 31 to 45 of 145 results Save | Export

"Quality Testing Standards" -- A Starter Kit for States. Version 6.17.2020

Download full text

New Meridian Corporation, 2020

New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS) to provide guidance to states that are interested in including New Meridian content and would like to either keep reporting scores on the New Meridian Scale or use the New Meridian performance levels; that is, the state…

Descriptors: Testing, Standards, Comparative Analysis, Test Content

The Effect of Rating Unfamiliar Items on Angoff Passing Scores

Peer reviewed

Direct link

Clauser, Jerome C.; Hambleton, Ronald K.; Baldwin, Peter – Educational and Psychological Measurement, 2017

The Angoff standard setting method relies on content experts to review exam items and make judgments about the performance of the minimally proficient examinee. Unfortunately, at times content experts may have gaps in their understanding of specific exam content. These gaps are particularly likely to occur when the content domain is broad and/or…

Descriptors: Scores, Item Analysis, Classification, Decision Making

Mapping the CU-TEP to the Common European Framework of Reference (CEFTR)

Peer reviewed
PDF on ERIC

Download full text

Wudthayagorn, Jirada – LEARN Journal: Language Education and Acquisition Research Network, 2018

The purpose of this study was to map the Chulalongkorn University Test of English Proficiency, or the CU-TEP, to the Common European Framework of Reference (CEFR) by employing a standard setting methodology. Thirteen experts judged 120 items of the CU-TEP using the Yes/No Angoff technique. The experts decided whether or not a borderline student at…

Descriptors: Guidelines, Rating Scales, English (Second Language), Language Tests

Toward Education Quality Improvement in China: A Brief Overview of the National Assessment of Education Quality

Peer reviewed

Direct link

Jiang, Yu; Zhang, Jiahui; Xin, Tao – Journal of Educational and Behavioral Statistics, 2019

This article is an overview of the National Assessment of Education Quality (NAEQ) of China in reading, mathematics, sciences, arts, physical education, and moral education at Grades 4 and 8. After a review of the background and history of NAEQ, we present the assessment framework with students' holistic development at the core and the design for…

Descriptors: Foreign Countries, Educational Quality, Educational Improvement, National Competency Tests

Using Standard Setting to Promote Meaningful Use of Mathematics Assessment Data within Initial Teacher Education Programmes

Peer reviewed
PDF on ERIC

Download full text

Moloi, Qetelo M.; Kanjee, Anil; Roberts, Nicky – Pythagoras, 2019

Within initial teacher education there is increasing pressure to enhance the use of assessment data to support students to improve their knowledge and skills, and to determine what standards they meet upon graduation. For such data to be useful, both programme designers and students require meaningful and comprehensive assessment reports on…

Descriptors: Preservice Teacher Education, Teacher Education Programs, Standard Setting, Mathematics Tests

Modeling for Directly Setting Theory-Based Performance Levels

Peer reviewed
PDF on ERIC

Download full text

Torres Irribarra, David; Diakow, Ronli; Freund, Rebecca; Wilson, Mark – Grantee Submission, 2015

This paper presents the Latent Class Level-PCM as a method for identifying and interpreting latent classes of respondents according to empirically estimated performance levels. The model, which combines elements from latent class models and reparameterized partial credit models for polytomous data, can simultaneously (a) identify empirical…

Descriptors: Item Response Theory, Test Items, Statistical Analysis, Models

Getting Lucky: How Guessing Threatens the Validity of Performance Classifications

Peer reviewed
PDF on ERIC

Download full text

Foley, Brett P. – Practical Assessment, Research & Evaluation, 2016

There is always a chance that examinees will answer multiple choice (MC) items correctly by guessing. Design choices in some modern exams have created situations where guessing at random through the full exam--rather than only for a subset of items where the examinee does not know the answer--can be an effective strategy to pass the exam. This…

Descriptors: Guessing (Tests), Multiple Choice Tests, Case Studies, Test Construction

Spring 2020 NSCAS General Summative ELA, Mathematics, and Science Technical Report

Download full text

Nebraska Department of Education, 2020

The Spring 2020 Nebraska Student-Centered Assessment System (NSCAS) General Summative testing was cancelled due to COVID-19. This technical report documents the processes and procedures that had been implemented to support the Spring 2020 assessments prior to the cancellation. The following sections are presented in this technical report: (1)…

Descriptors: English, Language Arts, Mathematics Tests, Science Tests

Effect of Content Knowledge on Angoff-Style Standard Setting Judgments

Peer reviewed

Direct link

Margolis, Melissa J.; Mee, Janet; Clauser, Brian E.; Winward, Marcia; Clauser, Jerome C. – Educational Measurement: Issues and Practice, 2016

Evidence to support the credibility of standard setting procedures is a critical part of the validity argument for decisions made based on tests that are used for classification. One area in which there has been limited empirical study is the impact of standard setting judge selection on the resulting cut score. One important issue related to…

Descriptors: Academic Standards, Standard Setting (Scoring), Cutting Scores, Credibility

Evaluating the Consistency of Angoff-Based Cut Scores Using Subsets of Items within a Generalizability Theory Framework

Peer reviewed

Direct link

Kannan, Priya; Sgammato, Adrienne; Tannenbaum, Richard J.; Katz, Irvin R. – Applied Measurement in Education, 2015

The Angoff method requires experts to view every item on the test and make a probability judgment. This can be time consuming when there are large numbers of items on the test. In this study, a G-theory framework was used to determine if a subset of items can be used to make generalizable cut-score recommendations. Angoff ratings (i.e.,…

Descriptors: Reliability, Standard Setting (Scoring), Cutting Scores, Test Items

Spring 2019 NSCAS Summative ELA, Mathematics, and Science Technical Report

Download full text

Nebraska Department of Education, 2019

This technical report documents the processes and procedures implemented to support the Spring 2019 Nebraska Student-Centered Assessment System (NSCAS) General Summative English Language Arts (ELA), Mathematics, and Science assessments by NWEA® under the supervision of the Nebraska Department of Education (NDE). The technical report shows how the…

Descriptors: English, Language Arts, Summative Evaluation, Mathematics Tests

Evaluating the Operational Feasibility of Using Subsets of Items to Recommend Minimal Competency Cut Scores

Peer reviewed

Direct link

Kannan, Priya; Sgammato, Adrienne; Tannenbaum, Richard J. – Applied Measurement in Education, 2015

Establishing cut scores using the Angoff method requires panelists to evaluate every item on a test and make a probability judgment. This can be time-consuming when there are large numbers of items on the test. Previous research using resampling studies suggest that it is possible to recommend stable Angoff-based cut score estimates using a…

Descriptors: Cutting Scores, Test Items, Standard Setting (Scoring), Feasibility Studies

Using Student Ability and Item Difficulty for Making Defensible Pass/Fail Decisions for Borderline Grades

Peer reviewed
PDF on ERIC

Download full text

Shulruf, Boaz; Jones, Phil; Turner, Rolf – Higher Education Studies, 2015

The determination of Pass/Fail decisions over Borderline grades, (i.e., grades which do not clearly distinguish between the competent and incompetent examinees) has been an ongoing challenge for academic institutions. This study utilises the Objective Borderline Method (OBM) to determine examinee ability and item difficulty, and from that…

Descriptors: Undergraduate Students, Pass Fail Grading, Decision Making, Probability

What Is Essential in Standard Setting and Construct Maps? Commentary on Adam E. Wyse's "Construct Maps as a Foundation for Standard Setting"

Peer reviewed

Direct link

Schulz, E. Matthew – Measurement: Interdisciplinary Research and Perspectives, 2013

In this article, E. Matthew Schulz responds to Adam Wyse's article, "Construct Maps as a Foundation for Standard Setting." In doing so, he asserts that one of the most important ideas in Wyse's work is that information used in standard setting needs to be better represented through the use of graphics. However, he's not…

Descriptors: Standard Setting (Scoring), Maps, Item Response Theory, Test Items

The Impact of Social Comparison on the Judgment-Based Angoff Method

Direct link

Sorensen, Henry L. – ProQuest LLC, 2013

Cut-score setting processes are used to establish the passing standards for all kinds of tests in education and for credentialing. While experts use their best efforts to guide cut-score setting processes to generate valid and reliable results, cut-score participants often have a difficult time understanding the standard at which the cut score is…

Descriptors: Cutting Scores, Standard Setting (Scoring), Comparative Analysis, Difficulty Level

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

Applied Measurement in…	14
Educational Measurement:…	8
Journal of Educational…	8
Educational and Psychological…	7
Practical Assessment,…	5
Nebraska Department of…	4
International Journal of…	3
Journal of Applied Testing…	3
National Assessment Governing…	3
New Meridian Corporation	3
ProQuest LLC	3
Eurasian Journal of…	2
Evaluation and the Health…	2
International Journal of…	2
Journal of Educational and…	2
Language Assessment Quarterly	2
New Mexico Public Education…	2
Online Submission	2
Advances in Health Sciences…	1
Alberta Journal of…	1
Applied Psychological…	1
Assessment & Evaluation in…	1
College Board	1
College and University	1
Education Next	1
More ▼

Plake, Barbara S.	12
Impara, James C.	6
Hambleton, Ronald K.	5
Wyse, Adam E.	5
Buckendahl, Chad W.	4
Chang, Lei	4
Ferdous, Abdullah A.	4
Clauser, Brian E.	3
Davis-Becker, Susan L.	3
Kannan, Priya	3
Margolis, Melissa J.	3
Reckase, Mark D.	3
Tannenbaum, Richard J.	3
Wind, Stefanie A.	3
Babcock, Ben	2
Baldwin, Peter	2
Beretvas, S. Natasha	2
Bichi, Ado Abdu	2
Bowman, Harry L.	2
Clauser, Jerome C.	2
Engelhard, George, Jr.	2
Galindo, Jennifer	2
Gerrow, Jack	2
Harsch, Claudia	2
More ▼