Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 3 |
Descriptor
| Scoring | 14 |
| Scoring Formulas | 14 |
| Test Construction | 14 |
| Test Validity | 10 |
| Test Reliability | 9 |
| Test Interpretation | 5 |
| Item Analysis | 4 |
| Measurement Techniques | 4 |
| Multiple Choice Tests | 4 |
| Testing | 4 |
| Guessing (Tests) | 3 |
| More ▼ | |
Source
| Assessment in Education:… | 1 |
| Journal of Educational… | 1 |
| Journal of School Health | 1 |
| Management Science | 1 |
| Neusprachliche Mitteilungen | 1 |
| Online Submission | 1 |
Author
Publication Type
| Reports - Research | 7 |
| Speeches/Meeting Papers | 4 |
| Journal Articles | 2 |
| Reports - Descriptive | 1 |
Education Level
| Elementary Secondary Education | 1 |
| Higher Education | 1 |
| Postsecondary Education | 1 |
| Secondary Education | 1 |
Audience
Location
| United Kingdom | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| Graduate Record Examinations | 1 |
What Works Clearinghouse Rating
Yun, Young Ho; Kim, Yaeji; Sim, Jin A.; Choi, Soo Hyuk; Lim, Cheolil; Kang, Joon-ho – Journal of School Health, 2018
Background: The objective of this study was to develop the School Health Score Card (SHSC) and validate its psychometric properties. Methods: The development of the SHSC questionnaire included 3 phases: item generation, construction of domains and items, and field testing with validation. To assess the instrument's reliability and validity, we…
Descriptors: School Health Services, Psychometrics, Test Construction, Test Validity
Buri, John R.; Cromett, Cristina E.; Post, Maria C.; Landis, Anna Marie; Alliegro, Marissa C. – Online Submission, 2015
Rationale is presented for the derivation of a new measure of stressful life events for use with students [Negative Life Events Scale for Students (NLESS)]. Ten stressful life events questionnaires were reviewed, and the more than 600 items mentioned in these scales were culled based on the following criteria: (a) only long-term and unpleasant…
Descriptors: Experience, Social Indicators, Stress Variables, Affective Measures
Ahmed, Ayesha; Pollitt, Alastair – Assessment in Education: Principles, Policy & Practice, 2011
At the heart of most assessments lies a set of questions, and those who write them must achieve "two" things. Not only must they ensure that each question elicits the kind of performance that shows how "good" pupils are at the subject, but they must also ensure that each mark scheme gives more marks to those who are…
Descriptors: Academic Achievement, Classification, Educational Quality, Quality Assurance
Peer reviewedFeuerman, Martin; Weiss, Harvey – Management Science, 1973
A model is presented for test construction and scoring that utilizes the knapsack model of mathematical programing. The method applies to examinations of the type in which a choice exists in the number of questions the examinee is required to answer. The method has been utilized with respect to a mathematics examination, and computer-generated…
Descriptors: Computer Oriented Programs, Mathematics, Models, Scoring
Frary, Robert B. – 1980
Ordinal response modes for multiple choice tests are those under which the examinee marks one or more choices in an effort to identify the correct choice, or include it in a proper subset of the choices. Two ordinal response modes: answer-until-correct, and Coomb's elimination of choices which examinees identify as wrong, were analyzed for scoring…
Descriptors: Guessing (Tests), Multiple Choice Tests, Responses, Scoring
Peer reviewedDiamond, James J. – Journal of Educational Measurement, 1975
Investigates the reliability and validity of scores yielded from a new scoring formula. (Author/DEP)
Descriptors: Guessing (Tests), Multiple Choice Tests, Objective Tests, Scoring
Hambleton, Ronald K.; Novick, Melvin R. – 1972
In this paper, an attempt has been made to synthesize some of the current thinking in the area of criterion-referenced testing as well as to provide the beginning of an integration of theory and method for such testing. Since criterion-referenced testing is viewed from a decision-theoretic point of view, approaches to reliability and validity…
Descriptors: Criterion Referenced Tests, Measurement Instruments, Measurement Techniques, Scaling
Kahl, Peter W. – Neusprachliche Mitteilungen, 1971
Descriptors: Achievement Tests, English (Second Language), Language Tests, Scoring
Kingston, Neal M. – 1984
In October 1981, the Graduate Record Examinations (GRE) Program introduced a new version of the General Test (GT) that differed from the previous version in three major ways. The GT was altered to: reduce the verbal measure's speededness and allow the addition of several quantitative items; delete two item types from the analytical measure; and…
Descriptors: College Entrance Examinations, Equated Scores, Higher Education, Mathematics Tests
Larkin, Kevin C.; Weiss, David J. – 1975
A 15-stage pyramidal test and a 40-item two-stage test were constructed and administered by computer to 111 college undergraduates. The two-stage test was found to utilize a smaller proportion of its potential score range than the pyramidal test. Score distributions for both tests were positively skewed but not significantly different from the…
Descriptors: Ability, Aptitude Tests, Comparative Analysis, Computer Programs
Echternacht, Gary – 1973
Estimates for the variance of empirically determined scoring weights are given. It is shown that test item writers should write distractors that discriminate on the criterion variable when this type of scoring is used. (Author)
Descriptors: Item Analysis, Measurement Techniques, Multiple Choice Tests, Performance Criteria
Brennan, Robert L. – 1974
The first four chapters of this report primarily provide an extensive, critical review of the literature with regard to selected aspects of the criterion-referenced and mastery testing fields. Major topics treated include: (a) definitions, distinctions, and background, (b) the relevance of classical test theory, (c) validity and procedures for…
Descriptors: Computer Programs, Confidence Testing, Criterion Referenced Tests, Error of Measurement
Echternacht, Gary – 1973
This study compares various item option scoring methods with respect to coefficient alpha and a concurrent validity coefficient. The scoring methods under consideration were: (1) formula scoring, (2) a priori scoring, (3) empirical scoring with an internal criterion, and (4) two modifications of formula scoring. The study indicates a clear…
Descriptors: Item Analysis, Measurement Techniques, Multiple Choice Tests, Performance Criteria
Sympson, James B. – 1979
Development of The Assessment of Basic Competencies (ABC), a test battery based on the three-parameter logistic model, is described. Eleven dimensions of intellectual growth are measured, from the pre-kindergarten level through ninth grade. An educationally relevant skill domain is represented by each test. Unique properties of the test, based on…
Descriptors: Academic Ability, Cognitive Processes, Cognitive Tests, Elementary Education

Direct link
