ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	11
Since 2007 (last 20 years)	24

Descriptor

Scoring Formulas	146
Test Reliability	146
Test Validity	66
Multiple Choice Tests	47
Guessing (Tests)	38
Test Construction	33
Test Interpretation	26
Test Items	25
Higher Education	23
Scoring	23
Item Analysis	22
Response Style (Tests)	22
Measurement Techniques	21
Weighted Scores	19
Testing Problems	18
Statistical Analysis	16
Testing	15
Correlation	14
Comparative Analysis	12
Confidence Testing	12
Evaluation Methods	12
Scores	12
Achievement Tests	11
True Scores	11
Factor Analysis	10
More ▼

Publication Type

Reports - Research	72
Journal Articles	48
Speeches/Meeting Papers	14
Reports - Evaluative	10
Tests/Questionnaires	7
Reports - Descriptive	6
Guides - Non-Classroom	4
Guides - Classroom - Teacher	2
Information Analyses	2
Opinion Papers	2
Collected Works - General	1
Guides - General	1
Numerical/Quantitative Data	1
More ▼

Education Level

Higher Education	7
Postsecondary Education	6
Elementary Education	2
Elementary Secondary Education	2
Secondary Education	2
Adult Education	1
High Schools	1
Junior High Schools	1
Middle Schools	1

Audience

Researchers	2
Practitioners	1

Location

New York (New York)	2
Australia	1
Canada	1
Germany	1
India	1
Malaysia	1
Minnesota	1
Mississippi	1
New York	1
North Carolina	1
Ohio	1
Turkey	1
United Kingdom	1
United States	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

Graduate Record Examinations	3
Bender Gestalt Test	2
California Achievement Tests	2
Group Embedded Figures Test	2
Rod and Frame Test	2
SAT (College Admission Test)	2
Comprehensive Tests of Basic…	1
General Aptitude Test Battery	1
Graduate Management Admission…	1
Learning Style Inventory	1
Matching Familiar Figures Test	1
Preliminary Scholastic…	1
Rosenberg Self Esteem Scale	1
Strong Vocational Interest…	1
Test of English as a Foreign…	1
Wechsler Intelligence Scale…	1
Woodcock Reading Mastery Test	1
More ▼

What Works Clearinghouse Rating

Test Reliability X

Showing 106 to 120 of 146 results Save | Export

The Effect of Differential Weighting of Individual Item Responses on the Predictive Validity and Reliability of an Aptitude Test.

Download full text

Sabers, Darrell L.; White, Gordon W. – 1971

A procedure for scoring multiple-choice tests by assigning different weights to every option of a test item is investigated. The weighting method used was based on that proposed by Davis, which involves taking the upper and lower 27% of a sample, according to some criterion measure, and using the percentages of these groups marking an item option…

Descriptors: Computer Oriented Programs, Item Analysis, Measurement Techniques, Multiple Choice Tests

Improving the Reliability and Validity of Confidence-Scored Tests by Adjusting for Realism.

Peer reviewed

Rippey, Robert M.; Smith, Susan – Evaluation and the Health Professions, 1979

Medical and dental students were administered two short confidence-scored tests on cellular and molecular biology. Increases in test reliability and predictive validity were found when test scores were adjusted for realism, but were not statistically significant. (Author/MH)

Descriptors: Confidence Testing, Dental Schools, Higher Education, Medical Students

Technical Report for the Unisex Edition of the ACT Interest Inventory (UNIACT).

Download full text

American Coll. Testing Program, Iowa City, IA. – 1981

UNIACT, a major component of the American College Testing (ACT) Assessment Program, is one of the first interest inventories to employ a new technique for ensuring sex fairness in the reporting of scores. UNIACT was constructed with the goal that distributions of career options suggested to males and females would be similar. It is intended to…

Descriptors: Adults, Career Planning, Interest Inventories, Minority Groups

An Optimizing Weight For Wrong Scores.

Download full text

Donlon, Thomas F. – 1975

This study empirically determined the optimizing weight to be applied to the Wrongs Total Score in scoring rubrics of the general form = R - kW, where S is the Score, R the Rights Total, k the weight and W the Wrongs Total, if reliability is to be maximized. As is well known, the traditional formula score rests on a theoretical framework which is…

Descriptors: Achievement Tests, Comparative Analysis, Guessing (Tests), Multiple Choice Tests

A Psychophysical Investigation of Factors Affecting Teacher-Observers' Judgment.

Download full text

Cohen, Stuart J.; Bengston, John K. – 1975

One hundred twenty-eight observers randomly assigned to 16 treatment conditions in a modified Latin square design, viewed three videotapes of simulated classrooms in which teacher behavior was controlled (paralleling psychophysical procedures) to fit unambiguously into specific categories on ratings of frequency and variety of social…

Descriptors: Evaluation Methods, Observation, Pictorial Stimuli, Psychophysiology

A Closed Sequential Procedure for Answer-Until-Correct Tests.

Peer reviewed

Wilcox, Rand R. – Journal of Experimental Education, 1982

A closed sequential procedure for estimating true score is proposed for use with answer-until-correct tests. The accuracy of determining true score is the same as in conventional sequential solutions, but the possibility of using an unnecessarily large number of items is eliminated. (Author/CM)

Descriptors: Answer Sheets, Guessing (Tests), Item Banks, Measurement Techniques

A Comparison of Two Instruments for Evaluating Composition.

Barter, Alice K.; And Others – 1980

A follow-up study of two instruments for evaluating college writing was conducted. The experimental scale (E Scale) was developed in 1976 and revised for this study. The control scale (C Scale) was described in the literature in 1977. Ten English majors graded ten essays from diagnostic entrance exams. Both the E Scale and the C Scale were used,…

Descriptors: College Entrance Examinations, Comparative Testing, Essay Tests, Evaluation Criteria

An Empirical Comparison of Two-Stage and Pyramidal Adaptive Ability Testing.

Download full text

Larkin, Kevin C.; Weiss, David J. – 1975

A 15-stage pyramidal test and a 40-item two-stage test were constructed and administered by computer to 111 college undergraduates. The two-stage test was found to utilize a smaller proportion of its potential score range than the pyramidal test. Score distributions for both tests were positively skewed but not significantly different from the…

Descriptors: Ability, Aptitude Tests, Comparative Analysis, Computer Programs

The Use of Confidence Testing in Objective Tests.

Download full text

Echternacht, Gary – 1971

Confidence testing has been used in varying forms over the past 40 years as a method for increasing the amount of information available from objective test items. This paper traces the development of the procedure from Hevner's beginning method up to the various methods in use today and describes both the testing procedures and scoring methods…

Descriptors: Confidence Testing, Guessing (Tests), Individual Characteristics, Measurement Techniques

The Effect of Keying All Options Correct on Equating Functions and Scores.

Download full text

Lenel, Julia C.; Gilmer, Jerry S. – 1986

In some testing programs an early item analysis is performed before final scoring in order to validate the intended keys. As a result, some items which are flawed and do not discriminate well may be keyed so as to give credit to examinees no matter which answer was chosen. This is referred to as allkeying. This research examined how varying the…

Descriptors: Equated Scores, Item Analysis, Latent Trait Theory, Licensing Examinations (Professions)

Factors Influencing the Psychometric Characteristics of an Adaptive Testing Strategy for Test Batteries.

Download full text

Maurelli, Vincent A.; Weiss, David J. – 1981

A monte carlo simulation was conducted to assess the effects in an adaptive testing strategy for test batteries of varying subtest order, subtest termination criterion, and variable versus fixed entry on the psychometric properties of an existent achievement test battery. Comparisons were made among conventionally administered tests and adaptive…

Descriptors: Achievement Tests, Adaptive Testing, Computer Assisted Testing, Latent Trait Theory

An Empirical Study of the Broad Range Tailored Test of Verbal Ability.

Download full text

Kreitzberg, Charles B.; Jones, Douglas H. – 1980

The Broad-Range Tailored Test (BRTT) is a computerized adaptive test. Each testee responds to 25 items; at the conclusion of the test the computer calculates a verbal ability score for the individual. The test was designed to yield a verbal ability score from the fifth grade level to the graduate school level. Two forms of the BRTT were…

Descriptors: Adaptive Testing, Computer Assisted Testing, High School Students, Higher Education

Manual for the USES General Aptitude Test Battery. Section III: Development.

Download full text

Manpower Administration (DOL), Washington, DC. – 1970

This revised manual for the General Aptitude Test Battery (GATB) discusses: (1) historical development; (2) item analysis; (3) factor analysis; (4) physical format; (5) general working population norms (ages 18-54); (6) intercorrelations of raw GATB test scores and of GATB aptitude scores; (7) development of norms for specific occupations (tables…

Descriptors: Adults, Aptitude Tests, Citations (References), High Schools

Test Development and Technical Information on the Writing Section of the SAT Reasoning Test™. Research Notes RN-25

Download full text

Kobrin, Jennifer L.; Kimmel, Ernest W. – College Board, 2006

Based on statistics from the first few administrations of the SAT writing section, the test is performing as expected. The reliability of the writing section is very similar to that of other writing assessments. Based on preliminary validity research, the writing section is expected to add modestly to the prediction of college performance when…

Descriptors: Test Construction, Writing Tests, Cognitive Tests, College Entrance Examinations

Technical Report on Alberta Essay Scales: Models.

Nyberg, Verner R.; Nyberg, Adell M. – 1982

The supplementary information on "Alberta Essay Scales: Models" presented here includes similar models to employ in grading essays, the background and development of the scales, and the rationale for developing two scales of English mechanics and style/content. A standard is presented for evaluating current writing achievement by…

Descriptors: Academic Standards, Expository Writing, Foreign Countries, Grade 12

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

Educational and Psychological…	18
Journal of Educational…	10
Applied Psychological…	6
Journal of Experimental…	3
Psychology in the Schools	3
Psychometrika	3
ETS Research Report Series	2
Educational Leadership	2
Evaluation and the Health…	2
Journal of Computer-Based…	2
Journal of Educational…	2
Accounting Education	1
Advances in Health Sciences…	1
American Educational Research…	1
Anatomical Sciences Education	1
Assessment & Evaluation in…	1
Assessment in Education:…	1
Child Abuse and Neglect: The…	1
College Board	1
Creativity Research Journal	1
Diagnostique	1
Educational Assessment	1
Educational Sciences: Theory…	1
English Language Teaching	1
Higher Education: The…	1
More ▼

Weiss, David J.	5
Echternacht, Gary	4
Frary, Robert B.	3
Hambleton, Ronald K.	3
Rippey, Robert M.	3
Albanese, Mark A.	2
Bejar, Issac I.	2
Cross, Lawrence H.	2
Frederiksen, Norman	2
Hakstian, A. Ralph	2
Huynh, Huynh	2
Kane, Michael T.	2
Kansup, Wanlop	2
Larkin, Kevin C.	2
Moloney, James M.	2
Reilly, Richard R.	2
Traub, Ross E.	2
Ward, William C.	2
Wilcox, Rand R.	2
Abedi, Jamal	1
Abramson, Paul R.	1
Abu-Sayf, F. K.	1
Acar, Selcuk	1
Aghbar, Ali-Asghar	1
More ▼