Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 13 |
Descriptor
| Educational Assessment | 63 |
| Test Validity | 63 |
| Testing Problems | 63 |
| Elementary Secondary Education | 28 |
| Test Construction | 23 |
| Evaluation Methods | 21 |
| Test Reliability | 20 |
| Educational Testing | 17 |
| Psychometrics | 16 |
| Testing Programs | 16 |
| Criterion Referenced Tests | 15 |
| More ▼ | |
Source
Author
| Thurlow, Martha | 4 |
| Bielinski, John | 2 |
| Farr, Roger | 2 |
| Hurley, Christine | 2 |
| Minnema, Jane | 2 |
| Spicuzza, Richard | 2 |
| Alonzo, Alicia C. | 1 |
| Backlund, Philip M. | 1 |
| Baird, Jo-Anne | 1 |
| Bollwark, John | 1 |
| Buckman, Dana T. | 1 |
| More ▼ | |
Publication Type
Education Level
| Elementary Secondary Education | 12 |
| Elementary Education | 2 |
Audience
| Practitioners | 3 |
| Administrators | 1 |
| Researchers | 1 |
| Teachers | 1 |
Location
| United Kingdom (England) | 3 |
| Florida | 1 |
| Minnesota | 1 |
| United Kingdom | 1 |
| United Kingdom (Great Britain) | 1 |
| United Kingdom (Wales) | 1 |
| United States | 1 |
Laws, Policies, & Programs
| Improving Americas Schools… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Newton, Paul E. – Educational Measurement: Issues and Practice, 2020
Educational assessment involves eliciting, transmitting, and receiving information concerning the level of proficiency of a learner in a specified domain. With that in mind, it is perhaps surprising that the literature seems to make very little use of the signal processing metaphor. The present article begins by making a general case for greater…
Descriptors: Educational Assessment, Student Evaluation, Evaluative Thinking, Test Validity
Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010
"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…
Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques
Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010
Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…
Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics
Kozloff, Allison Burstein – ProQuest LLC, 2009
Comprehensive academic achievement tests are routinely used by school psychologists in psycho-educational assessment batteries to identify learning disabled students. A variety of assessment measures are used across age groups to determine if a discrepancy exists between academic achievement and intellectual functioning; however, among the most…
Descriptors: Intelligence, Educational Assessment, Academic Achievement, Achievement Tests
Stones, Edgar – Forum for the Discussion of New Trends in Education, 1979
The author points out the technical shortcomings inherent in traditional examinations designed to sort students and outlines more useful testing alternatives. He feels that, unfortunately, the Assessment of Performance Unit will opt for the traditional style of testing, in the name of "maintaining standards." (SJL)
Descriptors: Achievement Tests, Educational Assessment, Elementary Secondary Education, Measurement Objectives
PDF pending restorationWheeler, Patricia H. – 1995
When individuals are given tests that are too hard or too easy, the resulting scores are likely to be poor estimates of their performance. To get valid and accurate test scores that provide meaningful results, one should use functional-level testing (FLT). FLT is the practice of administering to an individual a version of a test with a difficulty…
Descriptors: Adaptive Testing, Difficulty Level, Educational Assessment, Performance
Peer reviewedFarr, Roger; Greene, Beth – Educational Horizons, 1993
A review of public demand for accountability uncovers three types of educational assessment problems: demand for valid reading measures, need for a broader range of assessments, and value of assessments for various audiences. Integration of the various types of assessments is recommended. (SK)
Descriptors: Accountability, Educational Assessment, Political Influences, Reading Tests
Hambleton, Ronald K.; Bollwark, John – 1991
The validity of results from international assessments depends on the correctness of the test translations. If the tests presented in one language are more or less difficult because of the manner in which they are translated, the validity of any interpretation of the results can be questioned. Many test translation methods exist in the literature,…
Descriptors: Cultural Differences, Educational Assessment, English, Foreign Countries
Mathis, William J. – 1975
This paper was presented with other papers in a forum dealing with statewide testing programs. The primary purpose of the paper is to address practical considerations and methods of resolution for large districts or states who are planning on conducting large scale testing or assessment programs with criterion or performance referenced measures.…
Descriptors: Criterion Referenced Tests, Educational Assessment, School Districts, State Programs
Lutz, William – 1983
After an extensive review of the available research on large-scale writing assessment, certain issues in writing assessment seem to be unresolved, and still other issues are not supported by adequate research. This paper reviews the basic issues in writing assessment, points out which topics are supported by strong research, and which topics are…
Descriptors: Educational Assessment, Essay Tests, Higher Education, Multiple Choice Tests
Graham, Darol L. – 1974
The adequacy of a test developed for statewide assessment of basic mathematics skills was investigated. The test, comprised of multiple-choice items reflecting a series of behavioral objectives, was compared with a more extensive criterion measure generated from the same objectives by the application of a strict item sampling model. In many…
Descriptors: Comparative Testing, Criterion Referenced Tests, Educational Assessment, Item Sampling
DiBello, Lou; Stout, William – Measurement: Interdisciplinary Research and Perspectives, 2007
In this article, the authors provide their critique on a set of papers that investigated Mathematics Knowledge for Teachers (MKT) assessment and the underlying theory and characteristics of the validity enterprise. Three types of assumptions and inferences--elemental, structural, and ecological--are discussed in these papers. These assumptions…
Descriptors: Test Validity, Psychometrics, Test Construction, Evaluation Research
Ferrara, Steve – Measurement: Interdisciplinary Research and Perspectives, 2007
In this issue of Measurement: Interdisciplinary Research and Perspectives, Schilling et al. are explicit about the centrality of assessment design and development and psychometric analysis in validation. Schilling and colleagues, Kane (2004, 2006), other contemporary validity theorists and practitioners, and their predecessors typically discuss…
Descriptors: Test Validity, Psychometrics, Test Construction, Evaluation Research
Farr, Roger; Roser, Nancy – 1974
This article presents views of proponents and opponents to standardized tests, isolates the major weakness of testing--questionable validity--and offers several recommendations for the betterment of test development and use. Some major misuses of tests include the following: (a) tests are at times administered with no clear purpose; (b) test…
Descriptors: Accountability, Criterion Referenced Tests, Educational Assessment, Educational Testing
Peer reviewedWaddell, Deborah D. – Journal of School Psychology, 1980
A review of the technical data available on the 1972 norms edition of the Stanford-Binet demonstrates how inadequate these data are. The Stanford-Binet should not continue to be used in important decision making processes unless this weakness is corrected. (Author)
Descriptors: Educational Assessment, Elementary Secondary Education, Intelligence Quotient, Intelligence Tests

Direct link
