Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 3 |
Descriptor
| High Stakes Tests | 7 |
| Test Reliability | 7 |
| Test Use | 7 |
| Test Validity | 5 |
| Test Construction | 4 |
| Academic Achievement | 2 |
| Accountability | 2 |
| Decision Making | 2 |
| Educational Assessment | 2 |
| Legal Problems | 2 |
| Achievement Tests | 1 |
| More ▼ | |
Source
| Applied Measurement in… | 1 |
| Educational Measurement:… | 1 |
| Language Testing in Asia | 1 |
| Teachers College Record | 1 |
Author
| Mehrens, William A. | 2 |
| Attali, Yigal | 1 |
| Casey, Leo M. | 1 |
| Koretz, Daniel M. | 1 |
| Piggin, Gabrielle | 1 |
| Popham, W. James | 1 |
| Reckase, Mark D. | 1 |
Publication Type
| Reports - Evaluative | 5 |
| Journal Articles | 4 |
| Reports - Research | 2 |
| Speeches/Meeting Papers | 2 |
Education Level
| Elementary Secondary Education | 1 |
Audience
Location
| New York | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Attali, Yigal – Educational Measurement: Issues and Practice, 2019
Rater training is an important part of developing and conducting large-scale constructed-response assessments. As part of this process, candidate raters have to pass a certification test to confirm that they are able to score consistently and accurately before they begin scoring operationally. Moreover, many assessment programs require raters to…
Descriptors: Evaluators, Certification, High Stakes Tests, Scoring
Casey, Leo M. – Teachers College Record, 2013
Background/Context: There is a deep and yawning chasm between the world of tests and testing practices as they ought to be and the actual tests and testing practices now imposed on American students, educators, and schools. That chasm of theory and practice is a function of the dominant paradigm of educational reform, with its theory of action…
Descriptors: Educational Change, Commercialization, Models, Test Use
Piggin, Gabrielle – Language Testing in Asia, 2011
This paper aims to analyse the fit between the "use" and "usefulness" of the EIKEN Grade 1 test as a valid and reliable instrument to measure second language proficiency. The recent re-positioning of the EIKEN Grade 1 test as an internationally recognised higher stakes test of English proficiency, which allows successful test…
Descriptors: Language Tests, Second Language Learning, Language Proficiency, High Stakes Tests
Reckase, Mark D. – 1997
This paper argues that special procedures for constructing assessment tools containing performance assessment tasks are unnecessary and that current test methodology can easily be generalized to complex performance assessment tasks without destroying the desirable characteristics of those tasks. Reasonable statistical requirements for sound…
Descriptors: Educational Assessment, Generalizability Theory, High Stakes Tests, Interrater Reliability
Mehrens, William A. – 1993
As the first paper in a series of policy papers on high-stakes student assessment programs, this paper examined high school graduation tests. High stakes refers to the use of test results to make important decisions about the test taker. Whether to use a high school graduation test is an essential policy question that will be addressed in a…
Descriptors: Administration, Educational Assessment, Educational Finance, Educational Planning
Peer reviewedMehrens, William A.; Popham, W. James – Applied Measurement in Education, 1992
This paper discusses how to determine whether a test was developed in a legally defensible manner, reviewing general issues, specific cases bearing on different types of test use, some evaluative dimensions, and evidence of test quality. Tests constructed and used according to existing standards will generally stand legal scrutiny. (SLD)
Descriptors: College Entrance Examinations, Compliance (Legal), Constitutional Law, Court Litigation
Koretz, Daniel M.; And Others – 1991
Detailed evidence is presented about the extent of generalization from high-stakes tests to other tests and about the instructional effects of high-stakes testing. Data are from grade 3 of a large, high-poverty urban district with large numbers of Black and Hispanic American students. The district's results in 1990 for two tests, designated Test B…
Descriptors: Academic Achievement, Accountability, Achievement Tests, Black Students

Direct link
