Publication Date
| In 2026 | 0 |
| Since 2025 | 197 |
| Since 2022 (last 5 years) | 1067 |
| Since 2017 (last 10 years) | 2577 |
| Since 2007 (last 20 years) | 4938 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Rentz, R. Robert; Rentz, Charlotte C. – 1978
Issues of concern to test developers interested in applying the Rasch model are discussed. The current state of the art, recommendations for use of the model, further needs, and controversies are described for the three stages of test construction: (1) definition of the content of the test and item writing; (2) item analysis; and (3) test…
Descriptors: Ability, Achievement Tests, Difficulty Level, Goodness of Fit
Catts, Ralph – 1978
The reliability of multiple choice tests--containing different numbers of response options--was investigated for 260 students enrolled in technical college economics courses. Four test forms, constructed from previously used four-option items, were administered, consisting of (1) 60 two-option items--two distractors randomly discarded; (2) 40…
Descriptors: Answer Sheets, Difficulty Level, Foreign Countries, Higher Education
Roid, Gale; Finn, Patrick – 1978
The feasibility of generating multiple-choice test questions by transforming sentences from prose instructional materials was examined. A computer-based algorithm was used to analyze prose subject matter and to identify high-information words. Sentences containing selected words were then transformed into multiple-choice items by four writers who…
Descriptors: Algorithms, Criterion Referenced Tests, Difficulty Level, Form Classes (Languages)
Benson, Jeri; And Others – 1978
The precision and efficiency of a cognitive test constructed by three different methods of item analysis was compared, using the verbal aptitude subtest of the Florida Twelfth Grade Test. Classical item analysis, factor analysis and the Rasch logistic model were used in the construction of 15 and 30 item subtests and replicated for samples of 250,…
Descriptors: Cognitive Tests, Comparative Analysis, Efficiency, Factor Analysis
Millman, Jason – 1978
Test items, all referencing the same instructional objective, are not equally difficult. This investigation attempts to identify some of the determinants of item difficulty within the context of a first course in educational statistics. Computer generated variations of items were used to provide the data. The results were used to investigate the…
Descriptors: Computer Assisted Testing, Content Analysis, Criterion Referenced Tests, Difficulty Level
Harms, Robert A. – 1978
Based on John Rawls' theory of justice as fairness, a nine-item rating scale was developed to serve as a criterion in studies of test item bias. Two principles underlie the scale: (1) Within a defined usage, test items should not affect students so that they are unable to do as well as their abilities would indicate; and (2) within the domain of a…
Descriptors: Achievement Tests, Content Analysis, Culture Fair Tests, Evaluation Criteria
Peer reviewedAlexander, John J., Ed. – Journal of Chemical Education, 1987
Contains two articles relating to chemistry examination questions. One provides examples of how to sequence multiple choice questions so that partial credit may be given for some responses. The second includes a question and solution dealing with stereoisomerism as a result of free radical chlorination of a nonstereoisometic substance. (TW)
Descriptors: Chemistry, College Science, Higher Education, Problem Sets
Peer reviewedSevenair, John P.; Burkett, Allan R. – Journal of Chemical Education, 1988
Describes statistical analyses of tests used for organic chemistry classes and attempts to pose a model to explain the results. Concluded that students who possess a slight grasp of a concept actually have less of a chance of answering an item correctly than those who merely guess. (CW)
Descriptors: Chemistry, College Science, Higher Education, Item Analysis
Peer reviewedTrafton, Paul – Arithmetic Teacher, 1987
Argues that tests should be used to improve instructional programs. Specific suggestions include the study of reports on test performance, item analysis on classroom tests, item analysis as a part of standardized test reports, use of diagnostic or inventory tests, and careful selection of test items. (PK)
Descriptors: Educational Assessment, Elementary Education, Elementary School Mathematics, Instructional Improvement
Peer reviewedGrosse, Martin E.; Wright, Benjamin D. – Evaluation and the Health Professions, 1986
Based on the standard setting procedures or the American Board of Preventive Medicine for their Core Test, this article describes how Rasch measurement can facilitate using test content judgments in setting a standard. Rasch measurement can then be used to evaluate and improve the precision of the standard and to hold it constant across time.…
Descriptors: Certification, Criterion Referenced Tests, Difficulty Level, Health Personnel
Brodeur, Doris R. – Educational Technology, 1986
Reviews seven commercially produced test generator programs appropriate for use by classroom teachers or individual instructors and identifies item construction and test formatting features that facilitate test design and delivery. Test generator programs and their manufacturers are listed. (MBR)
Descriptors: Computer Assisted Testing, Computer Software, Costs, Evaluation Criteria
Peer reviewedBennett, Randy Elliot; And Others – Journal of Educational Measurement, 1987
To identify broad classes of items on the Scholastic Aptitude Test that behave differentally for handicapped examinees taking special, extended time administrations, the performance of nine handicapped groups and one nonhandicapped group on each of two forms of the SAT was investigated through a two-stage procedure. (Author/LMO)
Descriptors: College Entrance Examinations, Disabilities, Hearing Impairments, High Schools
Peer reviewedBenson, Jeri; Hocevar, Dennis – Journal of Educational Measurement, 1985
Three rating scales--with all positive or all negative wording, or a mixture of both--were administered to 522 children in grades four through six. The results indicated that it was difficult for students to indicate agreement by disagreeing with a negative statement. This affected test validity. Author/GDC)
Descriptors: Attitude Measures, Elementary School Students, Intermediate Grades, Item Analysis
Peer reviewedMorgenstern, Carol Faltin; Renner, John W. – Journal of Research in Science Teaching, 1984
Determined which of 10 rational thinking powers are measured by 12 commercially-available standardized science tests. One result reported is that 90 percent of the test items required only recall. The conclusion was drawn that the producers of standardized tests are not concerned with measuring student achievement of the rational powers.…
Descriptors: Biology, Chemistry, Cognitive Processes, Earth Science
Peer reviewedBethell-Fox, Charles E.; And Others – Intelligence, 1984
This study of individual differences in performance of a geometric analogies task included four-alternative test items and studied eye movements and confidence judgments as well as latency and error. Results were interpreted using two hypothesized performance strategies: constructive matching and response elimination. (Author/BW)
Descriptors: Cognitive Processes, Confidence Testing, Difficulty Level, Eye Movements


