Publication Date
| In 2026 | 0 |
| Since 2025 | 10 |
| Since 2022 (last 5 years) | 40 |
| Since 2017 (last 10 years) | 118 |
| Since 2007 (last 20 years) | 211 |
Descriptor
| Multiple Choice Tests | 532 |
| Test Reliability | 532 |
| Test Validity | 302 |
| Test Construction | 238 |
| Test Items | 172 |
| Foreign Countries | 114 |
| Item Analysis | 101 |
| Higher Education | 90 |
| Difficulty Level | 85 |
| Guessing (Tests) | 74 |
| Scoring | 69 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 10 |
| Frary, Robert B. | 9 |
| Alonzo, Julie | 7 |
| Frisbie, David A. | 6 |
| Irvin, P. Shawn | 6 |
| Lai, Cheng-Fei | 6 |
| Park, Bitnara Jasmine | 6 |
| Tindal, Gerald | 6 |
| Wilcox, Rand R. | 5 |
| Albanese, Mark A. | 4 |
| Biancarosa, Gina | 4 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 11 |
| Practitioners | 8 |
| Teachers | 5 |
Location
| Indonesia | 17 |
| Turkey | 17 |
| Germany | 8 |
| Iran | 8 |
| Canada | 6 |
| Malaysia | 4 |
| Nigeria | 4 |
| Australia | 3 |
| Florida | 3 |
| Japan | 3 |
| Pakistan | 3 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedPreece, P. F. W. – School Science Review, 1974
Describes the test item analysis used in test construction. (JR)
Descriptors: Discriminant Analysis, Evaluation Methods, Item Analysis, Multiple Choice Tests
Koch, Bill R.; Reckase, Mark D. – 1978
A live tailored testing study was conducted to compare the results of using either the one-parameter logistic model or the three-parameter logistic model to measure the performance of college students on multiple choice vocabulary items. The results of the study showed the three-parameter tailored testing procedure to be superior to the…
Descriptors: Adaptive Testing, Comparative Analysis, Goodness of Fit, Higher Education
Cross, Lawrence H.; And Others – 1980
A new scoring procedure for multiple choice tests attempts to assess partial knowledge and to restrict guessing. It is a variant of Coombs' elimination scoring method, adapted for use with the carbon-shield answer sheets commonly used with answer-until-correct scoring. Examinees are directed to erase the carbon shields of choices they are certain…
Descriptors: Answer Sheets, Guessing (Tests), Higher Education, Multiple Choice Tests
Frary, Robert B.; Lowry, Stephen R. – 1976
This paper presents theory concerning the relationships between reliability, misinformation and item discrimination coefficients. It is shown that, to the extent that misinformation rather than ignorance causes examinees to miss multiple-choice items, higher item discrimination coefficients and lower difficulty indices may be expected. Data were…
Descriptors: Bias, Correlation, Educational Research, Multiple Choice Tests
Hanna, Gerald S. – 1974
It was theorized that an answer-until-correct procedure, whereby an examinee marks responses to each multiple-choice question until feedback indicates that the correct answer has been marked, would yield scores of greater reliability and validity than conventional number-right procedure. Two papers and an application exercise for an undergraduate…
Descriptors: Feedback, Multiple Choice Tests, Performance Factors, Response Style (Tests)
Peer reviewedDuncan, George T.; Milton, E. O. – Psychometrika, 1978
A multiple-answer multiple-choice test is one which offers several alternate choices for each stem and any number of those choices may be considered to be correct. In this article, a class of scoring procedures called the binary class is discussed. (Author/JKS)
Descriptors: Answer Keys, Measurement Techniques, Multiple Choice Tests, Scoring Formulas
Peer reviewedSuinn, Richard M.; And Others – Educational and Psychological Measurement, 1987
The Suinn-Lew Asian Self Identity Acculturation Scale (SL-ASIA) is modeled after a successful scale for Hispanics. Initial reliability and validity data are reported for two samples of Asian subjects from two states. (Author/BS)
Descriptors: Acculturation, Asian Americans, Higher Education, Identification (Psychology)
Peer reviewedZimmerman, Donald W.; And Others – Journal of Experimental Education, 1984
Three types of test were compared: a completion test, a matching test, and a multiple-choice test. The completion test was more reliable than the matching test, and the matching test was more reliable than the multiple-choice test. (Author/BW)
Descriptors: Comparative Analysis, Error of Measurement, Higher Education, Mathematical Models
Costin, Frank – Educ Psychol Meas, 1970
Within the limitations of the present study, evidence is presented that the use of three-choice items improves the power and discrimination of tests. (PR)
Descriptors: Achievement Tests, Measurement Instruments, Measurement Techniques, Multiple Choice Tests
Peer reviewedCollet, Leverne S. – Journal of Educational Measurement, 1971
The purpose of this paper was to provide an empirical test of the hypothesis that elimination scores are more reliable and valid than classical corrected-for-guessing scores or weighted-choice scores. The evidence presented supports the hypothesized superiority of elimination scoring. (Author)
Descriptors: Evaluation, Guessing (Tests), Multiple Choice Tests, Scoring Formulas
Peer reviewedMelancon, Janet G.; Thompson, Bruce – Psychology in the Schools, 1989
Investigated measurement characteristics of both forms of Finding Embedded Figures Test (FEFT). College students (N=302) completed both forms of FEFT or one form of FEFT and Group Embedded Figures Test. Results suggest that FEFT forms provide reasonable reliable and valid data. (Author/NB)
Descriptors: College Students, Field Dependence Independence, Higher Education, Multiple Choice Tests
Peer reviewedBoodoo, Gwyneth M. – Educational Horizons, 1993
Examination of the role of psychometrics in the development of multiple-choice tests and performance-based assessments and consideration of validity and reliability issues leads to this conclusion: Choice of assessments for instruction or large-scale accountability depends on which is more appropriate for the particular purposes. Taxonomic…
Descriptors: Alternative Assessment, Classification, Multiple Choice Tests, Performance Based Assessment
Toben, Michael – English Teachers' Journal (Israel), 1974
Some of the pitfalls in the construction of multiple choice tests are discussed and examples of invalid questions given. (RM)
Descriptors: Language Instruction, Language Tests, Multiple Choice Tests, Objective Tests
Singh, Balwant; Lambert, Leroy – 1980
This fifty-item test is intended for the 'Responsibility' unit of the 'Law in a Free Society' materials published by the California Bar Association. The test was administered as a pre- and post-test to more than 600 grade 11 students. The pre-test data for all students were item-analyzed and the results are made available along with a table of…
Descriptors: Citizenship Education, Grade 11, High Schools, Item Analysis
Ebel, Robert L. – 1981
An alternate-choice test item is a simple declarative sentence, one portion of which is given with two different wordings. For example, "Foundations like Ford and Carnegie tend to be (1) eager (2) hesitant to support innovative solutions to educational problems." The examinee's task is to choose the alternative that makes the sentence…
Descriptors: Comparative Testing, Difficulty Level, Guessing (Tests), Multiple Choice Tests


