ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	5
Since 2007 (last 20 years)	11

Descriptor

Standard Setting (Scoring)	54
Test Validity	54
Elementary Secondary Education	21
Test Reliability	21
Cutting Scores	20
Minimum Competency Testing	18
Test Construction	15
State Standards	13
Testing Programs	13
Criterion Referenced Tests	12
Higher Education	12
Licensing Examinations…	12
Scoring	11
Standards	11
Teacher Certification	11
Test Items	11
Psychometrics	10
Scores	10
State Programs	10
Academic Standards	8
Evaluation Methods	8
Knowledge Level	8
Public School Teachers	8
Testing Problems	8
Student Evaluation	6
More ▼

Publication Type

Reports - Research	23
Journal Articles	19
Speeches/Meeting Papers	18
Reports - Evaluative	17
Reports - Descriptive	9
Tests/Questionnaires	4
Numerical/Quantitative Data	3
Information Analyses	2
Opinion Papers	2
Reports - General	2
Guides - Classroom - Teacher	1
Guides - General	1
Guides - Non-Classroom	1
More ▼

Education Level

Elementary Secondary Education	3
Early Childhood Education	2
Elementary Education	2
Middle Schools	2
Secondary Education	2
Adult Education	1
Grade 2	1
Grade 3	1
Grade 4	1
Grade 5	1
High School Equivalency…	1
High Schools	1
Higher Education	1
Intermediate Grades	1
Junior High Schools	1
Kindergarten	1
Postsecondary Education	1
Primary Education	1
More ▼

Audience

Policymakers	2
Practitioners	1
Researchers	1

Location

Tennessee	6
Arizona	2
Kansas	2
Massachusetts	2
Nebraska	2
Nevada	2
North Carolina	2
Arkansas	1
California	1
Colorado	1
Delaware	1
France	1
Idaho	1
Illinois	1
Indiana	1
Maine	1
Maryland	1
Michigan	1
Minnesota	1
Montana	1
New Hampshire	1
New Jersey	1
New Mexico	1
North Dakota	1
Ohio	1
More ▼

Laws, Policies, & Programs

Comprehensive Education…	3
No Child Left Behind Act 2001	2
Lau v Nichols	1

Assessments and Surveys

National Teacher Examinations	8
National Assessment of…	4
General Educational…	1
Massachusetts Comprehensive…	1
Pre Professional Skills Tests	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 54 results Save | Export

The Riddle Knowledge Inference Test (R-Kit)

Peer reviewed

Direct link

Nicolas Rochat; Laurent Lima; Pascal Bressoux – Journal of Psychoeducational Assessment, 2025

Inference is considered an important factor in comprehension models and has been described as a causal factor in predicting comprehension. To date, specific tests for inference are rare and often rely on specific thematic texts. This reliance on thematic inference may raise some concerns as inference is related to prior text-specific knowledge.…

Descriptors: Inferences, Reading Comprehension, Reading Tests, Test Reliability

Examining the Impact of a Consensus Approach to Content Alignment Studies

Peer reviewed
PDF on ERIC

Download full text

Russell, Michael; Moncaleano, Sebastian – Practical Assessment, Research & Evaluation, 2020

Although both content alignment and standard-setting procedures rely on content-expert panel judgements, only the latter employs discussion among panel members. This study employed a modified form of the Webb methodology to examine content alignment for twelve tests administered as part of the Massachusetts Comprehensive Assessment System (MCAS).…

Descriptors: Test Content, Test Items, Discussion, Test Validity

Objective Standard Setting in Educational Assessment and Decision Making

Peer reviewed

Direct link

Sondergeld, Toni A.; Stone, Gregory E.; Kruse, Lance M. – Educational Policy, 2020

Assessment and evaluation at all levels of educational systems have become policy priorities for many countries. Two common reasons for this are student learning expectations and accountability. Although much effort has been put into the creation and refinement of content standards, standardized tests, and methods for using testing results, there…

Descriptors: Standard Setting (Scoring), Criterion Referenced Tests, Multiple Choice Tests, Student Evaluation

2023-2024 NSCAS Growth: English Language Arts, Mathematics, and Science Technical Report

Download full text

Nebraska Department of Education, 2024

The Nebraska Student-Centered Assessment System (NSCAS) is a statewide assessment system that embodies Nebraska's holistic view of students and helps them prepare for success in postsecondary education, career, and civic life. It uses multiple measures throughout the year to provide educators and decision-makers at all levels with the insights…

Descriptors: Student Evaluation, Evaluation Methods, Elementary School Students, Middle School Students

Situating Standard Setting within Argument-Based Validity

Peer reviewed

Direct link

Papageorgiou, Spiros; Tannenbaum, Richard J. – Language Assessment Quarterly, 2016

Although there has been substantial work on argument-based approaches to validation as well as standard-setting methodologies, it might not always be clear how standard setting fits into argument-based validity. The purpose of this article is to address this lack in the literature, with a specific focus on topics related to argument-based…

Descriptors: Standard Setting (Scoring), Language Tests, Test Validity, Test Construction

Getting Lucky: How Guessing Threatens the Validity of Performance Classifications

Peer reviewed
PDF on ERIC

Download full text

Foley, Brett P. – Practical Assessment, Research & Evaluation, 2016

There is always a chance that examinees will answer multiple choice (MC) items correctly by guessing. Design choices in some modern exams have created situations where guessing at random through the full exam--rather than only for a subset of items where the examinee does not know the answer--can be an effective strategy to pass the exam. This…

Descriptors: Guessing (Tests), Multiple Choice Tests, Case Studies, Test Construction

Spring 2018 NSCAS Summative ELA, Mathematics, and Science Technical Report

Download full text

Nebraska Department of Education, 2018

The 2018 Nebraska Student-Centered Assessment System (NSCAS) Summative technical report documents the processes and procedures implemented to support the Spring 2018 NSCAS Summative English Language Arts (ELA), Mathematics, and Science assessments by NWEA under the supervision of the Nebraska Department of Education (NDE). The technical report…

Descriptors: Summative Evaluation, Language Tests, English, Mathematics Tests

Test Technical Manual 2014 GED® Test

Download full text

GED Testing Service, 2014

This manual was written to provide technical information regarding the General Educational Development (GED®) test as evidence that the GED® test is technically sound. Throughout this manual, documentation is provided regarding the development of the GED® test and data collection activities, as well as evidence of reliability and validity. This…

Descriptors: High School Equivalency Programs, Equivalency Tests, Testing Programs, Test Validity

Best Practices for Setting Placement Cut Scores in Postsecondary Education. An NCPR Working Paper

Download full text

Morgan, Deanna L. – National Center for Postsecondary Research, 2010

Cut scores are used in a variety of circumstances to aid in decision making through the establishment of a clear cut line between adjacent categories. Community colleges regularly use cut scores on placement tests to decide the appropriate course for each beginning student: the first college-level course or a developmental course, depending on…

Descriptors: Standard Setting (Scoring), Cutting Scores, Psychometrics, Best Practices

Do the AZELLA Cut Scores Meet the Standards? A Validation Review of Arizona English Language Learner Assessment

Download full text

Florez, Ida Rose – Civil Rights Project / Proyecto Derechos Civiles, 2010

The Arizona English Language Learners Assessment (AZELLA) is used by the Arizona Department of Education to determine which children should receive English support services. AZELLA results are used to determine if children are either proficient in English or have English language skills in one of four pre-proficient categories (pre-emergent,…

Descriptors: Validity, Second Language Learning, Cutting Scores, Kindergarten

Technical Issues in Large-Scale Performance Assessment.

Download full text

Phillips, Gary W., Ed. – 1996

Recently, there has been a significant expansion in the use of performance assessment in large scale testing programs. Although there has been significant support from curriculum and policy stakeholders, the technical feasibility of large scale performance assessments has remained a question. This report is intended to contribute to the debate by…

Descriptors: Comparative Analysis, Generalizability Theory, Performance Based Assessment, Psychometrics

Validating Standards-Based Test Score Interpretations

Peer reviewed

Direct link

Haertel, Edward H.; Lorie, William A. – Measurement: Interdisciplinary Research and Perspectives, 2004

Standards-based score reports interpret test performance with reference to cut scores defining categories like "below basic," "proficient," or "master." This article first develops a conceptual framework for validity arguments supporting such interpretations, then presents three applications. Two of these serve to introduce new standard-setting…

Descriptors: Scores, Test Interpretation, Test Validity, Standard Setting (Scoring)

Education Validity and the Setting of Reliable Standards.

Peer reviewed

Hamilton, J. S.; McLone, R. R. – Studies in Educational Evaluation, 1989

Influences on the educational validity of examinations are reviewed. Changes occurring in approaches to standard setting are traced. A view of reliability is presented, with emphasis on assessment of project work, which often involves individual investigation and design by students. A consistency index formula for grading standards is presented.…

Descriptors: Cutting Scores, Educational Assessment, Elementary Secondary Education, Standard Setting (Scoring)

What Is This Standard Score Stuff, Anyway?

Peer reviewed

Journal of School Improvement, 2000

States that standard scores are the numerical universal language for reporting and comparisons. Discusses what standard scores are, specifically, and why they are used, along with how the conversion assessment of raw scores to standard scores is accomplished. Provides contact information for those who would like to further their knowledge on the…

Descriptors: Educational Practices, Elementary Secondary Education, Higher Education, Standard Setting (Scoring)

The Bookmark Procedure for Setting Cut-Scores and Finalizing Performance Standards: Strengths and Weaknesses

Peer reviewed

Direct link

Lin, Jie – Alberta Journal of Educational Research, 2006

The Bookmark standard-setting procedure was developed to address the perceived problems with the most popular method for setting cut-scores: the Angoff procedure (Angoff, 1971). The purposes of this article are to review the Bookmark procedure and evaluate it in terms of Berk's (1986) criteria for evaluating cut-score setting methods. The…

Descriptors: Standard Setting (Scoring), Cutting Scores, Evaluation Criteria, Evaluation Research

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Educational Measurement:…	3
Evaluation and the Health…	2
Nebraska Department of…	2
Practical Assessment,…	2
Alberta Journal of…	1
Applied Measurement in…	1
Civil Rights Project /…	1
Educational Policy	1
Educational and Psychological…	1
GED Testing Service	1
Journal of Psychoeducational…	1
Journal of Research in…	1
Journal of School Improvement	1
Language Assessment Quarterly	1
Measurement:…	1
National Center for…	1
Review of Educational Research	1
Studies in Educational…	1
System	1
Thomas B. Fordham Institute	1
More ▼

Bowman, Harry L.	6
Hambleton, Ronald K.	3
Jaeger, Richard M.	3
Busch, John Christian	2
Eignor, Daniel R.	2
Petry, John R.	2
Adkins, Deborah	1
Bellott, Fred K.	1
Berk, Ronald A.	1
Brennan, Robert L.	1
Butler, E. Dean	1
Cizek, Gregory J.	1
Cronin, John	1
Cross, Lawrence H.	1
Crowe, Kevin	1
Dahlin, Michael	1
Darling-Hammond, Linda	1
Fitzpatrick, Steven J.	1
Florez, Ida Rose	1
Foley, Brett P.	1
Gorth, William Phillip	1
Haertel, Edward H.	1
Halpin, Gerald	1
Hambleton, Ronald K	1
More ▼