NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20260
Since 20250
Since 2022 (last 5 years)0
Since 2017 (last 10 years)2
Since 2007 (last 20 years)7
Audience
Researchers2
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 31 to 45 of 53 results Save | Export
Wood, Robert – Evaluation in Education: International Progress, 1977
The author surveys literature and practice, primarily in Great Britain and the United States, about multiple-choice testing, comments on criticisms, and defends the state of the art. Varous item types, item writing, test instructions and scoring formulas, item analysis, and test construction are discussed. An extensive bibliography is appended.…
Descriptors: Achievement Tests, Item Analysis, Multiple Choice Tests, Scoring Formulas
Kagan, Barbara; Yamashita, June – 1976
Comprehensive Achievement Monitoring (CAM), developed at Stanford University in 1967, was introduced into a pre-algebra course in Moanalua High School in Hawaii to help teachers by providing information on the progress of individual students and of the total class. The computer package is used to correct each student's answer sheet and to prepare…
Descriptors: Achievement Rating, Computer Managed Instruction, Computer Oriented Programs, Course Objectives
Vale, C. David; Weiss, David J. – 1975
A conventional vocabulary test and two forms of a stradaptive vocabulary test were administered by a time-shared computer system to undergraduate college students. The two stradaptive tests differed in that one counted question mark responses (i.e., omitted items) as incorrect and the other ignored items responded to with question marks.…
Descriptors: Ability, Ability Grouping, Adaptive Testing, Branching
Lenel, Julia C.; Gilmer, Jerry S. – 1986
In some testing programs an early item analysis is performed before final scoring in order to validate the intended keys. As a result, some items which are flawed and do not discriminate well may be keyed so as to give credit to examinees no matter which answer was chosen. This is referred to as allkeying. This research examined how varying the…
Descriptors: Equated Scores, Item Analysis, Latent Trait Theory, Licensing Examinations (Professions)
Legg, Sue M. – 1982
A case study of the Florida Teacher Certification Examination (FTCE) program was described to assist others launching the development of large scale item banks. FTCE has four subtests: Mathematics, Reading, Writing, and Professional Education. Rasch calibrated item banks have been developed for all subtests except Writing. The methods used to…
Descriptors: Cutting Scores, Difficulty Level, Field Tests, Item Analysis
Sullivan, Arthur P. – 1978
Sullivan's Ethical Reasoning Scale contains three dilemmas with response pairs representing Kohlberg's stages of moral development. In Kohlberg's first three stages, goodness is equated with lack of punishment, usefulness, and approval, respectively. Good is seen as conformity to rule and ruler in stage four, and stage five comprises…
Descriptors: Adolescents, Adults, Attitude Measures, Conflict Resolution
Veldman, Donald J. – 1967
This report forms the background for some current efforts to develop computer-based scoring systems for One-Word Sentence Completion data. A general discussion of response grouping problems and a three-way frequency table for data reduction are included. In addition, six "structural" response variables are defined. The report concludes with an…
Descriptors: Attitude Measures, Cluster Grouping, College Freshmen, Computer Oriented Programs
Echternacht, Gary – 1973
Estimates for the variance of empirically determined scoring weights are given. It is shown that test item writers should write distractors that discriminate on the criterion variable when this type of scoring is used. (Author)
Descriptors: Item Analysis, Measurement Techniques, Multiple Choice Tests, Performance Criteria
Bejar, Isaac I. – 1985
The Test of English as a Foreign Language (TOEFL) was used in this study, which attempted to develop a new methodology for assessing the speededness of right-scored tests. Traditional procedures of assessing speededness have assumed that the test is scored under formula-scoring instructions; this approach is not always appropriate. In this study,…
Descriptors: College Entrance Examinations, English (Second Language), Estimation (Mathematics), Evaluation Methods
Peer reviewed Peer reviewed
Downey, Ronald G. – Applied Psychological Measurement, 1979
This research attempted to interrelate several methods of producing option weights (i.e., Guttman internal and external weights and judges' weights) and examined their effects on reliability and on concurrent, predictive, and face validity. It was concluded that option weighting offered limited, if any, improvement over unit weighting. (Author/CTM)
Descriptors: Achievement Tests, Answer Keys, Comparative Testing, High Schools
Brennan, Robert L. – 1974
The first four chapters of this report primarily provide an extensive, critical review of the literature with regard to selected aspects of the criterion-referenced and mastery testing fields. Major topics treated include: (a) definitions, distinctions, and background, (b) the relevance of classical test theory, (c) validity and procedures for…
Descriptors: Computer Programs, Confidence Testing, Criterion Referenced Tests, Error of Measurement
Pascale, Pietro J. – 1971
This brief review explains some alternate scoring procedures to the classical method of summing correct responses. The novel procedures attempt in some way to retrieve and use even the information in the wrong responses. (Author)
Descriptors: Cognitive Processes, Computer Oriented Programs, Confidence Testing, Educational Diagnosis
Echternacht, Gary – 1973
This study compares various item option scoring methods with respect to coefficient alpha and a concurrent validity coefficient. The scoring methods under consideration were: (1) formula scoring, (2) a priori scoring, (3) empirical scoring with an internal criterion, and (4) two modifications of formula scoring. The study indicates a clear…
Descriptors: Item Analysis, Measurement Techniques, Multiple Choice Tests, Performance Criteria
PDF pending restoration PDF pending restoration
Vale, C. David; Weiss, David J. – 1977
Twenty multiple-choice vocabulary items and 20 free-response vocabulary items were administered to 660 college students. The free-response items consisted of the stem words of the multiple-choice items. Testees were asked to respond to the free-response items with synonyms. A computer algorithm was developed to transform the numerous…
Descriptors: Ability, Adaptive Testing, Algorithms, Aptitude Tests
Stallings, William M.; Anderson, Frances E. – 1968
The reliability and the predictive and concurrent validity of the MATAP were investigated with the implicit goal of improving the prediction of course grades in the College of Fine and Applied Arts. It was found that reliability and validity coefficients were low, and it was suggested that the scoring system was a source of error variance. (MS)
Descriptors: Art Appreciation, Biographical Inventories, College Students, Correlation
Pages: 1  |  2  |  3  |  4