NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
Elementary and Secondary…1
What Works Clearinghouse Rating
Showing 106 to 120 of 146 results Save | Export
Sabers, Darrell L.; White, Gordon W. – 1971
A procedure for scoring multiple-choice tests by assigning different weights to every option of a test item is investigated. The weighting method used was based on that proposed by Davis, which involves taking the upper and lower 27% of a sample, according to some criterion measure, and using the percentages of these groups marking an item option…
Descriptors: Computer Oriented Programs, Item Analysis, Measurement Techniques, Multiple Choice Tests
Peer reviewed Peer reviewed
Rippey, Robert M.; Smith, Susan – Evaluation and the Health Professions, 1979
Medical and dental students were administered two short confidence-scored tests on cellular and molecular biology. Increases in test reliability and predictive validity were found when test scores were adjusted for realism, but were not statistically significant. (Author/MH)
Descriptors: Confidence Testing, Dental Schools, Higher Education, Medical Students
American Coll. Testing Program, Iowa City, IA. – 1981
UNIACT, a major component of the American College Testing (ACT) Assessment Program, is one of the first interest inventories to employ a new technique for ensuring sex fairness in the reporting of scores. UNIACT was constructed with the goal that distributions of career options suggested to males and females would be similar. It is intended to…
Descriptors: Adults, Career Planning, Interest Inventories, Minority Groups
Donlon, Thomas F. – 1975
This study empirically determined the optimizing weight to be applied to the Wrongs Total Score in scoring rubrics of the general form = R - kW, where S is the Score, R the Rights Total, k the weight and W the Wrongs Total, if reliability is to be maximized. As is well known, the traditional formula score rests on a theoretical framework which is…
Descriptors: Achievement Tests, Comparative Analysis, Guessing (Tests), Multiple Choice Tests
Cohen, Stuart J.; Bengston, John K. – 1975
One hundred twenty-eight observers randomly assigned to 16 treatment conditions in a modified Latin square design, viewed three videotapes of simulated classrooms in which teacher behavior was controlled (paralleling psychophysical procedures) to fit unambiguously into specific categories on ratings of frequency and variety of social…
Descriptors: Evaluation Methods, Observation, Pictorial Stimuli, Psychophysiology
Peer reviewed Peer reviewed
Wilcox, Rand R. – Journal of Experimental Education, 1982
A closed sequential procedure for estimating true score is proposed for use with answer-until-correct tests. The accuracy of determining true score is the same as in conventional sequential solutions, but the possibility of using an unnecessarily large number of items is eliminated. (Author/CM)
Descriptors: Answer Sheets, Guessing (Tests), Item Banks, Measurement Techniques
Barter, Alice K.; And Others – 1980
A follow-up study of two instruments for evaluating college writing was conducted. The experimental scale (E Scale) was developed in 1976 and revised for this study. The control scale (C Scale) was described in the literature in 1977. Ten English majors graded ten essays from diagnostic entrance exams. Both the E Scale and the C Scale were used,…
Descriptors: College Entrance Examinations, Comparative Testing, Essay Tests, Evaluation Criteria
Larkin, Kevin C.; Weiss, David J. – 1975
A 15-stage pyramidal test and a 40-item two-stage test were constructed and administered by computer to 111 college undergraduates. The two-stage test was found to utilize a smaller proportion of its potential score range than the pyramidal test. Score distributions for both tests were positively skewed but not significantly different from the…
Descriptors: Ability, Aptitude Tests, Comparative Analysis, Computer Programs
Echternacht, Gary – 1971
Confidence testing has been used in varying forms over the past 40 years as a method for increasing the amount of information available from objective test items. This paper traces the development of the procedure from Hevner's beginning method up to the various methods in use today and describes both the testing procedures and scoring methods…
Descriptors: Confidence Testing, Guessing (Tests), Individual Characteristics, Measurement Techniques
Lenel, Julia C.; Gilmer, Jerry S. – 1986
In some testing programs an early item analysis is performed before final scoring in order to validate the intended keys. As a result, some items which are flawed and do not discriminate well may be keyed so as to give credit to examinees no matter which answer was chosen. This is referred to as allkeying. This research examined how varying the…
Descriptors: Equated Scores, Item Analysis, Latent Trait Theory, Licensing Examinations (Professions)
Maurelli, Vincent A.; Weiss, David J. – 1981
A monte carlo simulation was conducted to assess the effects in an adaptive testing strategy for test batteries of varying subtest order, subtest termination criterion, and variable versus fixed entry on the psychometric properties of an existent achievement test battery. Comparisons were made among conventionally administered tests and adaptive…
Descriptors: Achievement Tests, Adaptive Testing, Computer Assisted Testing, Latent Trait Theory
Kreitzberg, Charles B.; Jones, Douglas H. – 1980
The Broad-Range Tailored Test (BRTT) is a computerized adaptive test. Each testee responds to 25 items; at the conclusion of the test the computer calculates a verbal ability score for the individual. The test was designed to yield a verbal ability score from the fifth grade level to the graduate school level. Two forms of the BRTT were…
Descriptors: Adaptive Testing, Computer Assisted Testing, High School Students, Higher Education
Manpower Administration (DOL), Washington, DC. – 1970
This revised manual for the General Aptitude Test Battery (GATB) discusses: (1) historical development; (2) item analysis; (3) factor analysis; (4) physical format; (5) general working population norms (ages 18-54); (6) intercorrelations of raw GATB test scores and of GATB aptitude scores; (7) development of norms for specific occupations (tables…
Descriptors: Adults, Aptitude Tests, Citations (References), High Schools
Kobrin, Jennifer L.; Kimmel, Ernest W. – College Board, 2006
Based on statistics from the first few administrations of the SAT writing section, the test is performing as expected. The reliability of the writing section is very similar to that of other writing assessments. Based on preliminary validity research, the writing section is expected to add modestly to the prediction of college performance when…
Descriptors: Test Construction, Writing Tests, Cognitive Tests, College Entrance Examinations
Nyberg, Verner R.; Nyberg, Adell M. – 1982
The supplementary information on "Alberta Essay Scales: Models" presented here includes similar models to employ in grading essays, the background and development of the scales, and the rationale for developing two scales of English mechanics and style/content. A standard is presented for evaluating current writing achievement by…
Descriptors: Academic Standards, Expository Writing, Foreign Countries, Grade 12
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10