Descriptor
Test Format | 12 |
Test Reliability | 12 |
Test Construction | 7 |
Higher Education | 6 |
Comparative Testing | 5 |
Multiple Choice Tests | 5 |
Test Items | 5 |
Item Analysis | 4 |
Test Validity | 4 |
Objective Tests | 3 |
Scores | 3 |
More ▼ |
Author
Algina, James | 1 |
Allison, Donald E. | 1 |
Chissom, Brad | 1 |
Chukabarah, Prince C. O. | 1 |
Goldstein, Harvey | 1 |
Hodson, D. | 1 |
Huynh, Huynh | 1 |
Jones, Allan | 1 |
Legg, Sue M. | 1 |
Macpherson, Colin R. | 1 |
Mattheis, Floyd E. | 1 |
More ▼ |
Publication Type
Reports - Research | 9 |
Speeches/Meeting Papers | 6 |
Journal Articles | 3 |
Reports - Descriptive | 2 |
Reports - Evaluative | 2 |
Education Level
Audience
Researchers | 12 |
Practitioners | 2 |
Administrators | 1 |
Teachers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating

Hodson, D. – Research in Science and Technological Education, 1984
Investigated the effect on student performance of changes in question structure and sequence on a GCE 0-level multiple-choice chemistry test. One finding noted is that there was virtually no change in test reliability on reducing the number of options (from five to per test item). (JN)
Descriptors: Academic Achievement, Chemistry, Multiple Choice Tests, Science Education
Sax, Gilbert; Reiter, Pauline B. – 1980
Despite the popularity of both multiple-choice (MC) and true-false (TF) items, most investigations comparing the two formats have done so to determine the optimum number of choices to be given to students within a given time period. The purpose of this investigation was to compare the reliabilities and the validities of both formats when the items…
Descriptors: Analysis of Variance, Correlation, Higher Education, Item Analysis

Allison, Donald E. – Alberta Journal of Educational Research, 1984
Reports that no significant difference in reliability appeared between a heterogeneous and a homogeneous form of the same general science matching-item test administered to 316 sixth-grade students but that scores on the heterogeneous form of the test were higher, independent of the examinee's sex or intelligence. (SB)
Descriptors: Comparative Analysis, Comparative Testing, Elementary Education, Grade 6
Steele, Cam Monroe; Reinsch, N. L., Jr. – 1983
An instrument for measuring telephone apprehension was developed to facilitate research into hypothesized relationships between communication apprehension and telephone apprehension. A set of 92 Likert-type items was adapted from previous communication apprehension scales and administered to 81 undergraduate students in a speech communication…
Descriptors: Adults, Attitude Measures, Communication Apprehension, Communication Research
Stansfield, Charles W.; And Others – 1992
This report describes the development, construction, and validation of the Preliminary Chinese Proficiency Test (Pre-CPT), a standardized, nationally-normed test of listening and reading comprehension for beginning-level native English-speaking learners of Chinese as a second language. The Pre-CPT was designed as a lower-level version of the…
Descriptors: Chinese, Higher Education, Language Proficiency, Language Tests
Chissom, Brad; Chukabarah, Prince C. O. – 1985
The comparative effects of various sequences of test items were examined for over 900 graduate students enrolled in an educational research course at The University of Alabama, Tuscaloosa. experiment, which was conducted a total of four times using four separate tests, presented three different arrangements of 50 multiple-choice items: (1)…
Descriptors: Analysis of Variance, Comparative Testing, Difficulty Level, Graduate Students
Phillips, Gary W.; Huynh, Huynh – 1985
A procedure which may be used to project the frequency distribution of one test onto that of another test is described and illustrated. The procedure is useful when a test developer wishes to construct an alternate form with preferred distributional characteristics. For example, the test developer may wish to construct a new test form with a…
Descriptors: Achievement Tests, Elementary Secondary Education, Item Analysis, Item Banks
Goldstein, Harvey; Wolf, Alison – 1986
Locally developed occupational tests were administered to 16- and 17-year-olds in a government-sponsored vocational education program in the United Kingdom over a six-month period in 1984. Job skills were tested in two occupational areas: use of a micrometer and invoice completion. Some performance tests were designed by researchers and some by…
Descriptors: Comparative Testing, Criterion Referenced Tests, Evaluation Criteria, Foreign Countries
Mattheis, Floyd E.; Nakayama, Genzo – 1988
The purpose of this project was to construct a valid and reliable noncurriculum specific measure of integrated science process skills intended for use with middle school students. The major efforts in test development were focused on the refinements and modifications of the set of objectives and test items assessed by the existing Middle Grades…
Descriptors: Graphs, Item Analysis, Middle Schools, Problem Solving
Legg, Sue M.; Algina, James – 1986
This paper focuses on the questions which arise as test practitioners monitor score scales derived from latent trait theory. Large scale assessment programs are dynamic and constantly challenge the assumptions and limits of latent trait models. Even though testing programs evolve, test scores must remain reliable indicators of progress.…
Descriptors: Difficulty Level, Educational Assessment, Elementary Secondary Education, Equated Scores

Jones, Allan – Journal of Geography in Higher Education, 1997
Examines the increase in popularity of objective testing in the United Kingdom and addresses some of the accompanying academic issues. Reports on a case study of test production and implementation to illustrate issues of time costs and benefits. Discusses question styles, marking schemes, and the problem of guesswork. (MJP)
Descriptors: Comparative Testing, Educational Practices, Educational Trends, Foreign Countries
Macpherson, Colin R.; Rowley, Glenn L. – 1986
Teacher-made mastery tests were administered in a classroom-sized sample to study their decision consistency. Decision-consistency of criterion-referenced tests is usually defined in terms of the proportion of examinees who are classified in the same way after two test administrations. Single-administration estimates of decision consistency were…
Descriptors: Classroom Research, Comparative Testing, Criterion Referenced Tests, Cutting Scores