Descriptor
| Guessing (Tests) | 13 |
| Item Analysis | 13 |
| Scoring Formulas | 13 |
| Multiple Choice Tests | 8 |
| Test Items | 6 |
| Confidence Testing | 5 |
| Difficulty Level | 4 |
| Mathematical Models | 4 |
| Test Reliability | 4 |
| Measurement Techniques | 3 |
| Response Style (Tests) | 3 |
| More ▼ | |
Author
Publication Type
| Reports - Research | 9 |
| Journal Articles | 3 |
| Speeches/Meeting Papers | 3 |
Education Level
Audience
| Researchers | 1 |
Location
Laws, Policies, & Programs
Assessments and Surveys
| Graduate Record Examinations | 1 |
What Works Clearinghouse Rating
Peer reviewedWilcox, Rand R. – Educational and Psychological Measurement, 1980
Technical problems in achievement testing associated with using latent structure models to estimate the probability of guessing correct responses by examinees is studied; also the lack of problems associated with using Wilcox's formula score. Maximum likelihood estimates are derived which may be applied when items are hierarchically related.…
Descriptors: Guessing (Tests), Item Analysis, Mathematical Models, Maximum Likelihood Statistics
Peer reviewedAustin, Joe Dan – Psychometrika, 1981
On distractor-identification tests students mark as many distractors as possible on each test item. A grading scale is developed for this type testing. The score is optimal in that it yields an unbiased estimate of the student's score as if no guessing had occurred. (Author/JKS)
Descriptors: Guessing (Tests), Item Analysis, Measurement Techniques, Scoring Formulas
Peer reviewedKane, Michael; Moloney, James – Applied Psychological Measurement, 1978
The answer-until-correct (AUC) procedure requires that examinees respond to a multi-choice item until they answer it correctly. Using a modified version of Horst's model for examinee behavior, this paper compares the effect of guessing on item reliability for the AUC procedure and the zero-one scoring procedure. (Author/CTM)
Descriptors: Guessing (Tests), Item Analysis, Mathematical Models, Multiple Choice Tests
Peer reviewedLord, Frederic M. – Educational and Psychological Measurement, 1971
Descriptors: Ability, Adaptive Testing, Computer Oriented Programs, Difficulty Level
Lowry, Stephen R. – 1979
A specially designed answer format was used for three tests in a college level agriculture class of 19 students to record responses to three things about each item: (1) the student's choice of the best answer; (2) the degree of certainty with which the answer was chosen; and (3) all the answer choices which the student was certain were incorrect.…
Descriptors: Achievement Tests, Confidence Testing, Guessing (Tests), Higher Education
PDF pending restorationKane, Michael T.; Moloney, James M. – 1976
The Answer-Until-Correct (AUC) procedure has been proposed in order to increase the reliability of multiple-choice items. A model for examinees' behavior when they must respond to each item until they answer it correctly is presented. An expression for the reliability of AUC items, as a function of the characteristics of the item and the scoring…
Descriptors: Guessing (Tests), Item Analysis, Mathematical Models, Multiple Choice Tests
Donlon, Thomas F.; Fitzpatrick, Anne R. – 1978
On the basis of past research efforts to improve multiple-choice test information through differential weighting of responses to wrong answers (distractors), two statistical indices are developed. Each describes the properties of response distributions across the options of an item. Jaspen's polyserial generalization of the biserial correlation…
Descriptors: Confidence Testing, Difficulty Level, Guessing (Tests), High Schools
Peer reviewedAtkins, Warren J.; And Others – Educational Studies in Mathematics, 1991
Results from the Australian Mathematics Competition for 1988 and 1989 (n=309,443 and n=331,660) were analyzed on three statistical measures to determine risk-taking tendencies by groups of students classified by gender, school year, and achievement level. Results showed statistically significant differences for gender but varied depending on the…
Descriptors: Distractors (Tests), Guessing (Tests), Item Analysis, Mathematics Achievement
PDF pending restorationVale, C. David; Weiss, David J. – 1977
Twenty multiple-choice vocabulary items and 20 free-response vocabulary items were administered to 660 college students. The free-response items consisted of the stem words of the multiple-choice items. Testees were asked to respond to the free-response items with synonyms. A computer algorithm was developed to transform the numerous…
Descriptors: Ability, Adaptive Testing, Algorithms, Aptitude Tests
Echternacht, Gary J.; And Others – 1971
The feasibility and the cost-effectiveness of using confidence testing as a diagnostic aid in technical training programs were studied. Two types of confidence testing, Pick-One and Distribute 100 Points, were developed for comparison to conventional multiple-choice testing. The criteria for feasibility included end of block examination grades,…
Descriptors: Confidence Testing, Cost Effectiveness, Educational Diagnosis, Evaluation
Rippey, Robert M. – 1971
Technical improvements, which may be made in the reliability and validity of tests through confidence scores, are discussed. However, studies indicate that subjects do not handle their confidence uniformly. (MS)
Descriptors: Computer Programs, Confidence Testing, Correlation, Difficulty Level
Bruno, James E.; Opp, Ronald D. – 1985
The admissable probability measurement (APM) format was used to score a criterion referenced language arts test administered in an inner city junior high school. Its 30 items covered capitalization, punctuation, parts of speech, and sentence analysis. With APM, students indicate their confidence in their answer choice, and guessing is heavily…
Descriptors: Confidence Testing, Criterion Referenced Tests, Educational Testing, Equivalency Tests
Kingston, Neal M. – 1985
Birnbaum's three-parameter logistic item response model was used to study guessing behavior of low ability examinees on the Graduate Record Examinations (GRE) General Test, Verbal Measure. GRE scoring procedures had recently changed, from a scoring formula which corrected for guessing, to number-right scoring. The three-parameter theory was used…
Descriptors: Academic Aptitude, Analysis of Variance, College Entrance Examinations, Difficulty Level


