ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	13

Descriptor

Educational Assessment	63
Test Validity	63
Testing Problems	63
Elementary Secondary Education	28
Test Construction	23
Evaluation Methods	21
Test Reliability	20
Educational Testing	17
Psychometrics	16
Testing Programs	16
Criterion Referenced Tests	15
Standardized Tests	15
Student Evaluation	15
Achievement Tests	14
Measurement Techniques	14
Test Interpretation	14
Test Bias	12
Evaluation Problems	11
Test Use	11
Mathematics Education	10
Teacher Evaluation	10
Knowledge Base for Teaching	9
Mathematics Instruction	9
Measurement	9
Norm Referenced Tests	9
More ▼

Source

Measurement:…	11
Educational Measurement:…	3
Journal of Educational…	2
Applied Measurement in…	1
Educational Horizons	1
English Education	1
Focus on NAEP	1
Forum for the Discussion of…	1
Journal of School Psychology	1
ProQuest LLC	1

Publication Type

Opinion Papers	22
Journal Articles	21
Reports - Evaluative	15
Speeches/Meeting Papers	12
Information Analyses	8
Reports - Research	8
Reports - Descriptive	5
Books	4
Collected Works - Proceedings	3
Guides - Non-Classroom	3
Reference Materials -…	2
Book/Product Reviews	1
Collected Works - General	1
Collected Works - Serials	1
Dissertations/Theses -…	1
Numerical/Quantitative Data	1
More ▼

Education Level

Elementary Secondary Education	12
Elementary Education	2

Audience

Practitioners	3
Administrators	1
Researchers	1
Teachers	1

Location

United Kingdom (England)	3
Florida	1
Minnesota	1
United Kingdom	1
United Kingdom (Great Britain)	1
United Kingdom (Wales)	1
United States	1

Laws, Policies, & Programs

Improving Americas Schools…

Assessments and Surveys

National Assessment of…	4
SAT (College Admission Test)	3
Advanced Placement…	1
Florida State Student…	1
Kaufman Test of Educational…	1
Metropolitan Achievement Tests	1
Stanford Binet Intelligence…	1
Wechsler Individual…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 63 results Save | Export

Deficiency, Contamination, and the Signal Processing Metaphor

Peer reviewed

Direct link

Newton, Paul E. – Educational Measurement: Issues and Practice, 2020

Educational assessment involves eliciting, transmitting, and receiving information concerning the level of proficiency of a learner in a specified domain. With that in mind, it is perhaps surprising that the literature seems to make very little use of the signal processing metaphor. The present article begins by making a general case for greater…

Descriptors: Educational Assessment, Student Evaluation, Evaluative Thinking, Test Validity

Linking through Improved Design, Not Redefinition: Commentary on Newton

Peer reviewed

Direct link

Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010

"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…

Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques

What Constitutes Legitimate Causal Linking?

Peer reviewed

Direct link

Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010

Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…

Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics

A Review of Academic Achievement Tests: Recommendations for Age Appropriate Administration

Direct link

Kozloff, Allison Burstein – ProQuest LLC, 2009

Comprehensive academic achievement tests are routinely used by school psychologists in psycho-educational assessment batteries to identify learning disabled students. A variety of assessment measures are used across age groups to determine if a discrepancy exists between academic achievement and intellectual functioning; however, among the most…

Descriptors: Intelligence, Educational Assessment, Academic Achievement, Achievement Tests

The World of APU.

Stones, Edgar – Forum for the Discussion of New Trends in Education, 1979

The author points out the technical shortcomings inherent in traditional examinations designed to sort students and outlines more useful testing alternatives. He feels that, unfortunately, the Assessment of Performance Unit will opt for the traditional style of testing, in the name of "maintaining standards." (SJL)

Descriptors: Achievement Tests, Educational Assessment, Elementary Secondary Education, Measurement Objectives

Functional-Level Testing: A "Must" for Valid and Accurate Assessment Results. EREAPA Publication Series No. 95-2.

PDF pending restoration

Wheeler, Patricia H. – 1995

When individuals are given tests that are too hard or too easy, the resulting scores are likely to be poor estimates of their performance. To get valid and accurate test scores that provide meaningful results, one should use functional-level testing (FLT). FLT is the practice of administering to an individual a version of a test with a difficulty…

Descriptors: Adaptive Testing, Difficulty Level, Educational Assessment, Performance

Improving Reading Assessments: Understanding the Social and Political Agenda for Testing.

Peer reviewed

Farr, Roger; Greene, Beth – Educational Horizons, 1993

A review of public demand for accountability uncovers three types of educational assessment problems: demand for valid reading measures, need for a broader range of assessments, and value of assessments for various audiences. Integration of the various types of assessments is recommended. (SK)

Descriptors: Accountability, Educational Assessment, Political Influences, Reading Tests

Adapting Tests for Use in Different Cultures: Technical Issues and Methods.

Download full text

Hambleton, Ronald K.; Bollwark, John – 1991

The validity of results from international assessments depends on the correctness of the test translations. If the tests presented in one language are more or less difficult because of the manner in which they are translated, the validity of any interpretation of the results can be questioned. Many test translation methods exist in the literature,…

Descriptors: Cultural Differences, Educational Assessment, English, Foreign Countries

Large-Scale Objective Referenced Testing: Some Practical Problems and Concerns.

Download full text

Mathis, William J. – 1975

This paper was presented with other papers in a forum dealing with statewide testing programs. The primary purpose of the paper is to address practical considerations and methods of resolution for large districts or states who are planning on conducting large scale testing or assessment programs with criterion or performance referenced measures.…

Descriptors: Criterion Referenced Tests, Educational Assessment, School Districts, State Programs

What We Know and Don't Know: Needed Research in Writing Assessment.

Lutz, William – 1983

After an extensive review of the available research on large-scale writing assessment, certain issues in writing assessment seem to be unresolved, and still other issues are not supported by adequate research. This paper reviews the basic issues in writing assessment, points out which topics are supported by strong research, and which topics are…

Descriptors: Educational Assessment, Essay Tests, Higher Education, Multiple Choice Tests

An Examination of the Feasibility of Using Criterion-Referenced Measurement in Large-Scale, Survey Testing Situations.

Download full text

Graham, Darol L. – 1974

The adequacy of a test developed for statewide assessment of basic mathematics skills was investigated. The test, comprised of multiple-choice items reflecting a series of behavioral objectives, was compared with a more extensive criterion measure generated from the same objectives by the application of a strict item sampling model. In many…

Descriptors: Comparative Testing, Criterion Referenced Tests, Educational Assessment, Item Sampling

A Practical and Prescriptive Approach to Validity--Commentary

Peer reviewed

Direct link

DiBello, Lou; Stout, William – Measurement: Interdisciplinary Research and Perspectives, 2007

In this article, the authors provide their critique on a set of papers that investigated Mathematics Knowledge for Teachers (MKT) assessment and the underlying theory and characteristics of the validity enterprise. Three types of assumptions and inferences--elemental, structural, and ecological--are discussed in these papers. These assumptions…

Descriptors: Test Validity, Psychometrics, Test Construction, Evaluation Research

Our Field Needs a Framework to Guide Development of Validity Research Agendas and Identification of Validity Research Questions and Threats to Validity

Peer reviewed

Direct link

Ferrara, Steve – Measurement: Interdisciplinary Research and Perspectives, 2007

In this issue of Measurement: Interdisciplinary Research and Perspectives, Schilling et al. are explicit about the centrality of assessment design and development and psychometric analysis in validation. Schilling and colleagues, Kane (2004, 2006), other contemporary validity theorists and practitioners, and their predecessors typically discuss…

Descriptors: Test Validity, Psychometrics, Test Construction, Evaluation Research

Issues and Problems Concerning Reading Assessment. Teacher Education Forum Series. Vol. 2, No. 11.

Farr, Roger; Roser, Nancy – 1974

This article presents views of proponents and opponents to standardized tests, isolates the major weakness of testing--questionable validity--and offers several recommendations for the betterment of test development and use. Some major misuses of tests include the following: (a) tests are at times administered with no clear purpose; (b) test…

Descriptors: Accountability, Criterion Referenced Tests, Educational Assessment, Educational Testing

The Stanford-Binet: An Evaluation of the Technical Data Available Since the 1972 Restandardization.

Peer reviewed

Waddell, Deborah D. – Journal of School Psychology, 1980

A review of the technical data available on the 1972 norms edition of the Stanford-Binet demonstrates how inadequate these data are. The Stanford-Binet should not continue to be used in important decision making processes unless this weakness is corrected. (Author)

Descriptors: Educational Assessment, Elementary Secondary Education, Intelligence Quotient, Intelligence Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Thurlow, Martha	4
Bielinski, John	2
Farr, Roger	2
Hurley, Christine	2
Minnema, Jane	2
Spicuzza, Richard	2
Alonzo, Alicia C.	1
Backlund, Philip M.	1
Baird, Jo-Anne	1
Bollwark, John	1
Buckman, Dana T.	1
Cervantes, Robert A.	1
Coleman, Arthur L.	1
Collins, Allan	1
DiBello, Lou	1
Ebel, Robert L.	1
El Sawaf, Hamdy	1
Engelhard, George, Jr.	1
Ercikan, Kadriye	1
Erickson, Ronald	1
Farmelo, David A.	1
Ferrara, Steve	1
Fitzpatrick, Anne R.	1
Forsyth, Robert A.	1
More ▼