Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 4 |
Descriptor
| Probability | 8 |
| Scoring Formulas | 8 |
| Guessing (Tests) | 3 |
| Multiple Choice Tests | 3 |
| Test Reliability | 3 |
| Comparative Analysis | 2 |
| Evaluation Methods | 2 |
| Scores | 2 |
| Scoring Rubrics | 2 |
| Test Validity | 2 |
| Athletes | 1 |
| More ▼ | |
Source
| Applied Psychological… | 2 |
| Educational and Psychological… | 2 |
| Journal of Experimental… | 1 |
| TESOL Quarterly: A Journal… | 1 |
| Teaching Mathematics and Its… | 1 |
| Teaching Statistics: An… | 1 |
Author
| Aiken, Lewis R. | 1 |
| Bonett, Douglas G. | 1 |
| Fletcher, Michael | 1 |
| Hamdan, M. A. | 1 |
| Hsu, Louis M. | 1 |
| Kreiner, Svend | 1 |
| Stewart, Jeffrey | 1 |
| Van Hecke, Tanja | 1 |
| Wagaman, John | 1 |
| White, David A. | 1 |
Publication Type
| Journal Articles | 8 |
| Reports - Research | 5 |
| Reports - Descriptive | 2 |
| Reports - Evaluative | 1 |
Education Level
| Adult Education | 1 |
| Higher Education | 1 |
Audience
Location
| Denmark | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Wagaman, John; Fletcher, Michael – Teaching Statistics: An International Journal for Teachers, 2018
This article considers how a handicapping system should be devised for squash. It looks at the American scoring system, and whether it is possible to have a fair system of handicapping. We consider "fair" from a perspective of expected number of rallies won and probability of winning.
Descriptors: Probability, Athletes, Athletics, Inhibition
Van Hecke, Tanja – Teaching Mathematics and Its Applications, 2015
Optimal assessment tools should measure in a limited time the knowledge of students in a correct and unbiased way. A method for automating the scoring is multiple choice scoring. This article compares scoring methods from a probabilistic point of view by modelling the probability to pass: the number right scoring, the initial correction (IC) and…
Descriptors: Multiple Choice Tests, Error Correction, Grading, Evaluation Methods
Kreiner, Svend – Applied Psychological Measurement, 2011
To rule out the need for a two-parameter item response theory (IRT) model during item analysis by Rasch models, it is important to check the Rasch model's assumption that all items have the same item discrimination. Biserial and polyserial correlation coefficients measuring the association between items and restscores are often used in an informal…
Descriptors: Item Analysis, Correlation, Item Response Theory, Models
Stewart, Jeffrey; White, David A. – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2011
Multiple-choice tests such as the Vocabulary Levels Test (VLT) are often viewed as a preferable estimator of vocabulary knowledge when compared to yes/no checklists, because self-reporting tests introduce the possibility of students overreporting or underreporting scores. However, multiple-choice tests have their own unique disadvantages. It has…
Descriptors: Guessing (Tests), Scoring Formulas, Multiple Choice Tests, Test Reliability
Peer reviewedHsu, Louis M. – Educational and Psychological Measurement, 1979
Though the Paired-Item-Score (Eakin and Long) (EJ 174 780) method of scoring true-false tests has certain advantages over the traditional scoring methods (percentage right and right minus wrong), these advantages are attained at the cost of a larger risk of misranking the examinees. (Author/BW)
Descriptors: Comparative Analysis, Guessing (Tests), Objective Tests, Probability
Peer reviewedHamdan, M. A. – Journal of Experimental Education, 1979
The distribution theory underlying corrections for guessing is analyzed, and the probability distributions of the random variables are derived. The correction in grade, based on random guessing of unknown answers, is compared with corrections based on educated guessing. (Author/MH)
Descriptors: Guessing (Tests), Maximum Likelihood Statistics, Multiple Choice Tests, Probability
Peer reviewedAiken, Lewis R. – Educational and Psychological Measurement, 1980
Procedures for computing content validity and consistency reliability coefficients and determining the statistical significance of these coefficients are described. Procedures employing the multinomial probability distribution for small samples and normal curve probability estimates for large samples, can be used where judgments are made on…
Descriptors: Computer Programs, Measurement Techniques, Probability, Questionnaires
Bonett, Douglas G. – Applied Psychological Measurement, 2006
Comparing variability of test scores across alternate forms, test conditions, or subpopulations is a fundamental problem in psychometrics. A confidence interval for a ratio of standard deviations is proposed that performs as well as the classic method with normal distributions and performs dramatically better with nonnormal distributions. A simple…
Descriptors: Intervals, Mathematical Concepts, Comparative Analysis, Psychometrics

Direct link
