Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 2 |
| Since 2017 (last 10 years) | 5 |
| Since 2007 (last 20 years) | 11 |
Descriptor
| Standard Setting (Scoring) | 54 |
| Test Validity | 54 |
| Elementary Secondary Education | 21 |
| Test Reliability | 21 |
| Cutting Scores | 20 |
| Minimum Competency Testing | 18 |
| Test Construction | 15 |
| State Standards | 13 |
| Testing Programs | 13 |
| Criterion Referenced Tests | 12 |
| Higher Education | 12 |
| More ▼ | |
Source
Author
Publication Type
Education Level
| Elementary Secondary Education | 3 |
| Early Childhood Education | 2 |
| Elementary Education | 2 |
| Middle Schools | 2 |
| Secondary Education | 2 |
| Adult Education | 1 |
| Grade 2 | 1 |
| Grade 3 | 1 |
| Grade 4 | 1 |
| Grade 5 | 1 |
| High School Equivalency… | 1 |
| More ▼ | |
Audience
| Policymakers | 2 |
| Practitioners | 1 |
| Researchers | 1 |
Location
| Tennessee | 6 |
| Arizona | 2 |
| Kansas | 2 |
| Massachusetts | 2 |
| Nebraska | 2 |
| Nevada | 2 |
| North Carolina | 2 |
| Arkansas | 1 |
| California | 1 |
| Colorado | 1 |
| Delaware | 1 |
| More ▼ | |
Laws, Policies, & Programs
| Comprehensive Education… | 3 |
| No Child Left Behind Act 2001 | 2 |
| Lau v Nichols | 1 |
Assessments and Surveys
| National Teacher Examinations | 8 |
| National Assessment of… | 4 |
| General Educational… | 1 |
| Massachusetts Comprehensive… | 1 |
| Pre Professional Skills Tests | 1 |
| Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Nicolas Rochat; Laurent Lima; Pascal Bressoux – Journal of Psychoeducational Assessment, 2025
Inference is considered an important factor in comprehension models and has been described as a causal factor in predicting comprehension. To date, specific tests for inference are rare and often rely on specific thematic texts. This reliance on thematic inference may raise some concerns as inference is related to prior text-specific knowledge.…
Descriptors: Inferences, Reading Comprehension, Reading Tests, Test Reliability
Russell, Michael; Moncaleano, Sebastian – Practical Assessment, Research & Evaluation, 2020
Although both content alignment and standard-setting procedures rely on content-expert panel judgements, only the latter employs discussion among panel members. This study employed a modified form of the Webb methodology to examine content alignment for twelve tests administered as part of the Massachusetts Comprehensive Assessment System (MCAS).…
Descriptors: Test Content, Test Items, Discussion, Test Validity
Sondergeld, Toni A.; Stone, Gregory E.; Kruse, Lance M. – Educational Policy, 2020
Assessment and evaluation at all levels of educational systems have become policy priorities for many countries. Two common reasons for this are student learning expectations and accountability. Although much effort has been put into the creation and refinement of content standards, standardized tests, and methods for using testing results, there…
Descriptors: Standard Setting (Scoring), Criterion Referenced Tests, Multiple Choice Tests, Student Evaluation
Nebraska Department of Education, 2024
The Nebraska Student-Centered Assessment System (NSCAS) is a statewide assessment system that embodies Nebraska's holistic view of students and helps them prepare for success in postsecondary education, career, and civic life. It uses multiple measures throughout the year to provide educators and decision-makers at all levels with the insights…
Descriptors: Student Evaluation, Evaluation Methods, Elementary School Students, Middle School Students
Papageorgiou, Spiros; Tannenbaum, Richard J. – Language Assessment Quarterly, 2016
Although there has been substantial work on argument-based approaches to validation as well as standard-setting methodologies, it might not always be clear how standard setting fits into argument-based validity. The purpose of this article is to address this lack in the literature, with a specific focus on topics related to argument-based…
Descriptors: Standard Setting (Scoring), Language Tests, Test Validity, Test Construction
Foley, Brett P. – Practical Assessment, Research & Evaluation, 2016
There is always a chance that examinees will answer multiple choice (MC) items correctly by guessing. Design choices in some modern exams have created situations where guessing at random through the full exam--rather than only for a subset of items where the examinee does not know the answer--can be an effective strategy to pass the exam. This…
Descriptors: Guessing (Tests), Multiple Choice Tests, Case Studies, Test Construction
Nebraska Department of Education, 2018
The 2018 Nebraska Student-Centered Assessment System (NSCAS) Summative technical report documents the processes and procedures implemented to support the Spring 2018 NSCAS Summative English Language Arts (ELA), Mathematics, and Science assessments by NWEA under the supervision of the Nebraska Department of Education (NDE). The technical report…
Descriptors: Summative Evaluation, Language Tests, English, Mathematics Tests
GED Testing Service, 2014
This manual was written to provide technical information regarding the General Educational Development (GED®) test as evidence that the GED® test is technically sound. Throughout this manual, documentation is provided regarding the development of the GED® test and data collection activities, as well as evidence of reliability and validity. This…
Descriptors: High School Equivalency Programs, Equivalency Tests, Testing Programs, Test Validity
Morgan, Deanna L. – National Center for Postsecondary Research, 2010
Cut scores are used in a variety of circumstances to aid in decision making through the establishment of a clear cut line between adjacent categories. Community colleges regularly use cut scores on placement tests to decide the appropriate course for each beginning student: the first college-level course or a developmental course, depending on…
Descriptors: Standard Setting (Scoring), Cutting Scores, Psychometrics, Best Practices
Florez, Ida Rose – Civil Rights Project / Proyecto Derechos Civiles, 2010
The Arizona English Language Learners Assessment (AZELLA) is used by the Arizona Department of Education to determine which children should receive English support services. AZELLA results are used to determine if children are either proficient in English or have English language skills in one of four pre-proficient categories (pre-emergent,…
Descriptors: Validity, Second Language Learning, Cutting Scores, Kindergarten
Phillips, Gary W., Ed. – 1996
Recently, there has been a significant expansion in the use of performance assessment in large scale testing programs. Although there has been significant support from curriculum and policy stakeholders, the technical feasibility of large scale performance assessments has remained a question. This report is intended to contribute to the debate by…
Descriptors: Comparative Analysis, Generalizability Theory, Performance Based Assessment, Psychometrics
Haertel, Edward H.; Lorie, William A. – Measurement: Interdisciplinary Research and Perspectives, 2004
Standards-based score reports interpret test performance with reference to cut scores defining categories like "below basic," "proficient," or "master." This article first develops a conceptual framework for validity arguments supporting such interpretations, then presents three applications. Two of these serve to introduce new standard-setting…
Descriptors: Scores, Test Interpretation, Test Validity, Standard Setting (Scoring)
Peer reviewedHamilton, J. S.; McLone, R. R. – Studies in Educational Evaluation, 1989
Influences on the educational validity of examinations are reviewed. Changes occurring in approaches to standard setting are traced. A view of reliability is presented, with emphasis on assessment of project work, which often involves individual investigation and design by students. A consistency index formula for grading standards is presented.…
Descriptors: Cutting Scores, Educational Assessment, Elementary Secondary Education, Standard Setting (Scoring)
Peer reviewedJournal of School Improvement, 2000
States that standard scores are the numerical universal language for reporting and comparisons. Discusses what standard scores are, specifically, and why they are used, along with how the conversion assessment of raw scores to standard scores is accomplished. Provides contact information for those who would like to further their knowledge on the…
Descriptors: Educational Practices, Elementary Secondary Education, Higher Education, Standard Setting (Scoring)
Lin, Jie – Alberta Journal of Educational Research, 2006
The Bookmark standard-setting procedure was developed to address the perceived problems with the most popular method for setting cut-scores: the Angoff procedure (Angoff, 1971). The purposes of this article are to review the Bookmark procedure and evaluate it in terms of Berk's (1986) criteria for evaluating cut-score setting methods. The…
Descriptors: Standard Setting (Scoring), Cutting Scores, Evaluation Criteria, Evaluation Research

Direct link
