ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	3

Descriptor

Scoring	14
Scoring Formulas	14
Test Construction	14
Test Validity	10
Test Reliability	9
Test Interpretation	5
Item Analysis	4
Measurement Techniques	4
Multiple Choice Tests	4
Testing	4
Guessing (Tests)	3
Research Reports	3
Test Items	3
Test Results	3
Testing Problems	3
Computer Programs	2
Criterion Referenced Tests	2
Equated Scores	2
Higher Education	2
Performance Criteria	2
Psychometrics	2
Questionnaires	2
Rating Scales	2
Standardized Tests	2
Tests	2
More ▼

Source

Assessment in Education:…	1
Journal of Educational…	1
Journal of School Health	1
Management Science	1
Neusprachliche Mitteilungen	1
Online Submission	1

Publication Type

Reports - Research	7
Speeches/Meeting Papers	4
Journal Articles	2
Reports - Descriptive	1

Education Level

Elementary Secondary Education	1
Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

United Kingdom

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Development and Validity Testing of the School Health Score Card

Peer reviewed

Direct link

Yun, Young Ho; Kim, Yaeji; Sim, Jin A.; Choi, Soo Hyuk; Lim, Cheolil; Kang, Joon-ho – Journal of School Health, 2018

Background: The objective of this study was to develop the School Health Score Card (SHSC) and validate its psychometric properties. Methods: The development of the SHSC questionnaire included 3 phases: item generation, construction of domains and items, and field testing with validation. To assess the instrument's reliability and validity, we…

Descriptors: School Health Services, Psychometrics, Test Construction, Test Validity

Negative Life Events Scale for Students (NLESS)

Download full text

Buri, John R.; Cromett, Cristina E.; Post, Maria C.; Landis, Anna Marie; Alliegro, Marissa C. – Online Submission, 2015

Rationale is presented for the derivation of a new measure of stressful life events for use with students [Negative Life Events Scale for Students (NLESS)]. Ten stressful life events questionnaires were reviewed, and the more than 600 items mentioned in these scales were culled based on the following criteria: (a) only long-term and unpleasant…

Descriptors: Experience, Social Indicators, Stress Variables, Affective Measures

Improving Marking Quality through a Taxonomy of Mark Schemes

Peer reviewed

Direct link

Ahmed, Ayesha; Pollitt, Alastair – Assessment in Education: Principles, Policy & Practice, 2011

At the heart of most assessments lies a set of questions, and those who write them must achieve "two" things. Not only must they ensure that each question elicits the kind of performance that shows how "good" pupils are at the subject, but they must also ensure that each mark scheme gives more marks to those who are…

Descriptors: Academic Achievement, Classification, Educational Quality, Quality Assurance

A Mathematical Programming Model for Test Construction and Scoring

Peer reviewed

Feuerman, Martin; Weiss, Harvey – Management Science, 1973

A model is presented for test construction and scoring that utilizes the knapsack model of mathematical programing. The method applies to examinations of the type in which a choice exists in the number of questions the examinee is required to answer. The method has been utilized with respect to a mathematics examination, and computer-generated…

Descriptors: Computer Oriented Programs, Mathematics, Models, Scoring

Expected Multiple-Choice Test Item Scores Under Ordinal Response Modes.

Frary, Robert B. – 1980

Ordinal response modes for multiple choice tests are those under which the examinee marks one or more choices in an effort to identify the correct choice, or include it in a proper subset of the choices. Two ordinal response modes: answer-until-correct, and Coomb's elimination of choices which examinees identify as wrong, were analyzed for scoring…

Descriptors: Guessing (Tests), Multiple Choice Tests, Responses, Scoring

A Preliminary Study of the Reliability and Validity of a Scoring Procedure Based Upon Confidence and Partial Information

Peer reviewed

Diamond, James J. – Journal of Educational Measurement, 1975

Investigates the reliability and validity of scores yielded from a new scoring formula. (Author/DEP)

Descriptors: Guessing (Tests), Multiple Choice Tests, Objective Tests, Scoring

Toward an Integration of Theory and Method for Criterion-Referenced Tests.

Download full text

Hambleton, Ronald K.; Novick, Melvin R. – 1972

In this paper, an attempt has been made to synthesize some of the current thinking in the area of criterion-referenced testing as well as to provide the beginning of an integration of theory and method for such testing. Since criterion-referenced testing is viewed from a decision-theoretic point of view, approaches to reliability and validity…

Descriptors: Criterion Referenced Tests, Measurement Instruments, Measurement Techniques, Scaling

Moglichkeiten der Leistungsmessung durch informelle Tests im Englischunterricht (Possibilities for Measuring Achievement in English Instruction through Informal Tests)

Kahl, Peter W. – Neusprachliche Mitteilungen, 1971

Descriptors: Achievement Tests, English (Second Language), Language Tests, Scoring

Analysis of Shifts in Scale and Construct through the Use of Repeater Data.

Download full text

Kingston, Neal M. – 1984

In October 1981, the Graduate Record Examinations (GRE) Program introduced a new version of the General Test (GT) that differed from the previous version in three major ways. The GT was altered to: reduce the verbal measure's speededness and allow the addition of several quantitative items; delete two item types from the analytical measure; and…

Descriptors: College Entrance Examinations, Equated Scores, Higher Education, Mathematics Tests

An Empirical Comparison of Two-Stage and Pyramidal Adaptive Ability Testing.

Download full text

Larkin, Kevin C.; Weiss, David J. – 1975

A 15-stage pyramidal test and a 40-item two-stage test were constructed and administered by computer to 111 college undergraduates. The two-stage test was found to utilize a smaller proportion of its potential score range than the pyramidal test. Score distributions for both tests were positively skewed but not significantly different from the…

Descriptors: Ability, Aptitude Tests, Comparative Analysis, Computer Programs

A Note on the Variances of Empirically Derived Option Scoring Weights.

Download full text

Echternacht, Gary – 1973

Estimates for the variance of empirically determined scoring weights are given. It is shown that test item writers should write distractors that discriminate on the criterion variable when this type of scoring is used. (Author)

Descriptors: Item Analysis, Measurement Techniques, Multiple Choice Tests, Performance Criteria

The Evaluation of Mastery Test Items. Final Report.

Download full text

Brennan, Robert L. – 1974

The first four chapters of this report primarily provide an extensive, critical review of the literature with regard to selected aspects of the criterion-referenced and mastery testing fields. Major topics treated include: (a) definitions, distinctions, and background, (b) the relevance of classical test theory, (c) validity and procedures for…

Descriptors: Computer Programs, Confidence Testing, Criterion Referenced Tests, Error of Measurement

A Comparison of Various Item Option Weighting Schemes.

Download full text

Echternacht, Gary – 1973

This study compares various item option scoring methods with respect to coefficient alpha and a concurrent validity coefficient. The scoring methods under consideration were: (1) formula scoring, (2) a priori scoring, (3) empirical scoring with an internal criterion, and (4) two modifications of formula scoring. The study indicates a clear…

Descriptors: Item Analysis, Measurement Techniques, Multiple Choice Tests, Performance Criteria

The Assessment of Basic Competencies: A New Test Battery.

Sympson, James B. – 1979

Development of The Assessment of Basic Competencies (ABC), a test battery based on the three-parameter logistic model, is described. Eleven dimensions of intellectual growth are measured, from the pre-kindergarten level through ninth grade. An educationally relevant skill domain is represented by each test. Unique properties of the test, based on…

Descriptors: Academic Ability, Cognitive Processes, Cognitive Tests, Elementary Education

Echternacht, Gary	2
Ahmed, Ayesha	1
Alliegro, Marissa C.	1
Brennan, Robert L.	1
Buri, John R.	1
Choi, Soo Hyuk	1
Cromett, Cristina E.	1
Diamond, James J.	1
Feuerman, Martin	1
Frary, Robert B.	1
Hambleton, Ronald K.	1
Kahl, Peter W.	1
Kang, Joon-ho	1
Kim, Yaeji	1
Kingston, Neal M.	1
Landis, Anna Marie	1
Larkin, Kevin C.	1
Lim, Cheolil	1
Novick, Melvin R.	1
Pollitt, Alastair	1
Post, Maria C.	1
Sim, Jin A.	1
Sympson, James B.	1
Weiss, David J.	1
Weiss, Harvey	1
More ▼