Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 2 |
Descriptor
| Scoring | 39 |
| Test Validity | 39 |
| Testing Problems | 39 |
| Test Reliability | 22 |
| Test Construction | 11 |
| Testing | 9 |
| Test Bias | 8 |
| Test Interpretation | 8 |
| Higher Education | 7 |
| Scores | 7 |
| Multiple Choice Tests | 6 |
| More ▼ | |
Source
Author
| Weiss, David J. | 2 |
| Allen, Nancy L. | 1 |
| Alliger, R. J. | 1 |
| Bessa, Nicia M. | 1 |
| Brown, Frederick G. | 1 |
| Burns, Marilyn | 1 |
| CONRY, JULIANNE JOYCE | 1 |
| Cohen, Arie | 1 |
| Cowie, Colin | 1 |
| Creamer, Don G. | 1 |
| DeGeorge, George P. | 1 |
| More ▼ | |
Publication Type
Education Level
Audience
| Parents | 1 |
| Practitioners | 1 |
| Researchers | 1 |
Location
| Brazil | 1 |
| Haiti | 1 |
| Israel | 1 |
| Netherlands | 1 |
| United States | 1 |
Laws, Policies, & Programs
| Elementary and Secondary… | 1 |
Assessments and Surveys
| SAT (College Admission Test) | 2 |
| Armed Services Vocational… | 1 |
| California Psychological… | 1 |
| International English… | 1 |
| Test of English as a Foreign… | 1 |
| Torrance Tests of Creative… | 1 |
What Works Clearinghouse Rating
Hoang, Ngoc Thi Huyen – Language Education & Assessment, 2019
As validity pertains to test use rather than the test itself, using a test for unintended purposes requires a new validation program using additional evidence from relevant sources. This small-scale study contributes to the validation of the use of originally academic language tests--the International English Language Testing System and the Test…
Descriptors: Language Tests, Immigrants, Immigration, Testing Problems
Henning, Grant – English Teaching Forum, 2012
To some extent, good testing procedure, like good language use, can be achieved through avoidance of errors. Almost any language-instruction program requires the preparation and administration of tests, and it is only to the extent that certain common testing mistakes have been avoided that such tests can be said to be worthwhile selection,…
Descriptors: Testing, English (Second Language), Testing Problems, Student Evaluation
Osborn, William C. – 1977
Four essential dimensions of a performance test are detailed: directness of test method, type of criterion, standardization of conditions, and objectivity of scoring. For simplicity these factors are described as if each were dichotomous, when in actuality each is a continuum; a test method may be more or less direct, conditions more or less…
Descriptors: Performance Tests, Scoring, Test Reliability, Test Validity
Peer reviewedTuckman, Bruce W. – NASSP Bulletin, 1993
Essay tests are easily constructed, relatively valid assessments of higher cognitive processes but are harder to score reliably. Teachers using essay tests are advised to follow clearly designed objectives, construct all-inclusive, pilot-tested questions, develop a checklist of specific scoring points and a model answer for each question, and use…
Descriptors: Essay Tests, Multiple Choice Tests, Scoring, Secondary Education
Alliger, R. J.; Harvey, A. L. – 1984
This article discusses practical and theoretical problems related to the measurement of formal operations. The first section of the article discusses problems in measuring formal operations using the clinical interview method. These problems include the lack of both a standardized interview and a uniform scoring procedure. Section two discusses…
Descriptors: Developmental Stages, Group Testing, Interviews, Objective Tests
PDF pending restorationWheeler, Patricia H. – 1995
When individuals are given tests that are too hard or too easy, the resulting scores are likely to be poor estimates of their performance. To get valid and accurate test scores that provide meaningful results, one should use functional-level testing (FLT). FLT is the practice of administering to an individual a version of a test with a difficulty…
Descriptors: Adaptive Testing, Difficulty Level, Educational Assessment, Performance
Cowie, Colin – 1977
Certain testing procedures will overcome some of the problems associated with the use of essay tests. Essay tests may not validly indicate achievement because the questions included in the test may not fairly represent instructional content. Reliability may be a problem because of variations in examinee response in different situations, in test…
Descriptors: Achievement Tests, Essay Tests, Guides, Scoring
Peer reviewedQuellmalz, Edys S. – Educational Measurement: Issues and Practice, 1984
A summary of the writing assessment programs reviewed in this journal is presented. The problems inherent in the programs are outlined. A coordinated research program on major problems in writing assessment is proposed as being beneficial and cost-effective. (DWH)
Descriptors: Essay Tests, Program Evaluation, Scoring, State Programs
George Washington Univ., Washington, DC. Inst. for Educational Leadership. – 1980
The transcript of a six-part National Public Radio broadcast on standardized testing is presented. The first part focuses on the reasons tests are administered; these reasons are discussed by proponents and opponents of testing. Part Two contains a discussion of the possible bias of tests, and their validity. The third part discusses the…
Descriptors: College Entrance Examinations, Scoring, Standardized Tests, Student Attitudes
Peer reviewedCohen, Arie; Farley, Frank H. – Educational and Psychological Measurement, 1977
Cross-cultural validity studies for psychological instruments may result in overestimation of structure invariance due to some items being scored on more than one scale. This problem, called the common-item effect, is investigated with some data from the literature. (JKS)
Descriptors: Cross Cultural Studies, Factor Analysis, Item Sampling, Multidimensional Scaling
Haenn, Joseph F. – 1981
Procedures for conducting functional level testing have been available for use by practitioners for some time. However, the Title I Evaluation and Reporting System (TIERS), developed in response to the educational amendments of 1974 to the Elementary and Secondary Education Act (ESEA), has provided the impetus for widespread adoption of this…
Descriptors: Achievement Tests, Difficulty Level, Scores, Scoring
Peer reviewedvan der Linden, Wim J. – Review of Educational Research, 1981
Using criterion-referenced test item data collected in an empirical study, differences in item selection between Cox and Vargas' pretest-posttest validity index and a latent trait approach (evaluation of the item information function for the mastery score) are analyzed. (Author/GK)
Descriptors: Comparative Analysis, Criterion Referenced Tests, Foreign Countries, Latent Trait Theory
Peer reviewedDoromal, Quintin S., Jr.; Creamer, Don G. – Journal of College Student Development, 1988
Investigated certain measurement properties of the Ethical Judgment Scale. Results revealed findings of questionable validity and unacceptably low reliability for the instrument even though three different scoring methods were used in the analysis. (Author)
Descriptors: Counseling, Data Analysis, Decision Making, Ethics
Ebel, Robert L. – 1973
True-false achievement test items written by typical classroom teachers show about two-thirds of the discrimination of their multiple-choice test items. This is about what should be expected in view of the higher probability of chance success on the true-false items. However, at least half again as many true-false items as multiple-choice items…
Descriptors: Guessing (Tests), Multiple Choice Tests, Objective Tests, Scoring
Allen, Nancy L.; And Others – 1992
Many testing programs include a section of optional questions in addition to mandatory parts of a test. These optional parts of a test are not often truly parallel to one another, and groups of examinees selecting each optional test section are not equivalent to one another. This paper provides a general method based on missing-data methods for…
Descriptors: Comparative Testing, Estimation (Mathematics), Graphs, Scaling


