ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	1

Descriptor

Difficulty Level	8
Scoring Formulas	8
Test Reliability	8
Test Items	5
Item Analysis	3
Adaptive Testing	2
Comparative Analysis	2
Computer Assisted Testing	2
Confidence Testing	2
Factor Analysis	2
Guessing (Tests)	2
Higher Education	2
Multiple Choice Tests	2
Psychometrics	2
Simulation	2
Test Validity	2
Testing Problems	2
Achievement Tests	1
Cognitive Measurement	1
College Entrance Examinations	1
Comparative Testing	1
Computer Programs	1
Conceptual Tempo	1
Correlation	1
Cutting Scores	1
More ▼

Source

Advances in Health Sciences…	1
Journal of Experimental…	1

Author

Bauer, Daniel	1
Bejar, Issac I.	1
Brinzer, Raymond J.	1
Church, Austin T.	1
Fischer, Martin R.	1
Guttormsen, Sissel	1
Hsu, Tse-Chi	1
Huwendiek, Sören	1
Huynh, Huynh	1
Krebs, René	1
Lahner, Felicitas-Maria	1
Lörwald, Andrea Carolin	1
Nouns, Zineb Miriam	1
Rippey, Robert M.	1
Saunders, Joseph C.	1
Weiss, David J.	1
Yen, Wendy M.	1
More ▼

Publication Type

Reports - Research	6
Speeches/Meeting Papers	3
Journal Articles	2
Reports - Evaluative	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Comprehensive Tests of Basic…	1
Matching Familiar Figures Test	1

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Multiple True-False Items: A Comparison of Scoring Algorithms

Peer reviewed

Direct link

Lahner, Felicitas-Maria; Lörwald, Andrea Carolin; Bauer, Daniel; Nouns, Zineb Miriam; Krebs, René; Guttormsen, Sissel; Fischer, Martin R.; Huwendiek, Sören – Advances in Health Sciences Education, 2018

Multiple true-false (MTF) items are a widely used supplement to the commonly used single-best answer (Type A) multiple choice format. However, an optimal scoring algorithm for MTF items has not yet been established, as existing studies yielded conflicting results. Therefore, this study analyzes two questions: What is the optimal scoring algorithm…

Descriptors: Scoring Formulas, Scoring Rubrics, Objective Tests, Multiple Choice Tests

New Directions in Matching Familiar Figures Test Research Resulting From Scoring and Item Analyses.

Download full text

Brinzer, Raymond J. – 1979

The problem engendered by the Matching Familiar Figures (MFF) Test is one of instrument integrity (II). II is delimited by validity, reliability, and utility of MFF as a measure of the reflective-impulsive construct. Validity, reliability and utility of construct assessment may be improved by utilizing: (1) a prototypic scoring model that will…

Descriptors: Conceptual Tempo, Difficulty Level, Item Analysis, Research Methodology

Obtaining Some Degree of Correspondence Between Unequatable Scores: A Comparison of Item Response Theory and Equipercentile Equating Methods.

Yen, Wendy M. – 1982

Test scores that are not perfectly reliable cannot be strictly equated unless they are strictly parallel. This fact implies that tau equivalence can be lost if an equipercentile equating is applied to observed scores that are not strictly parallel. Thirty-six simulated data sets are produced to simulate equating tests with different difficulties…

Descriptors: Difficulty Level, Equated Scores, Latent Trait Theory, Methods

The Merits of Multiple-Answer Items as Evaluated by Using Six Scoring Formulas.

Peer reviewed

Hsu, Tse-Chi; And Others – Journal of Experimental Education, 1984

The indices of item difficulty and discrimination, the coefficients of effective length, and the average item information for both single- and multiple-answer items using six different scoring formulas were computed and compared. These formulas vary in terms of the assignment of partial credit and the correction for guessing. (Author/BW)

Descriptors: College Entrance Examinations, Comparative Analysis, Difficulty Level, Guessing (Tests)

Consideration for Sample Size in Reliability Studies for Mastery Tests. Publication Series in Mastery Testing.

Download full text

Saunders, Joseph C.; Huynh, Huynh – 1980

In most reliability studies, the precision of a reliability estimate varies inversely with the number of examinees (sample size). Thus, to achieve a given level of accuracy, some minimum sample size is required. An approximation for this minimum size may be made if some reasonable assumptions regarding the mean and standard deviation of the test…

Descriptors: Cutting Scores, Difficulty Level, Error of Measurement, Mastery Tests

Applications of Adaptive Testing in Measuring Achievement and Performance.

Download full text

Bejar, Issac I. – 1976

The concept of testing for partial knowledge is considered with the concept of tailored testing. Following the special usage of latent trait theory, the word valdity is used to mean the correlation of a test with the construct the test measures. The concept of a method factor in the test is also considered as a part of the validity. The possible…

Descriptors: Achievement Tests, Adaptive Testing, Computer Assisted Testing, Confidence Testing

Interactive Computer Administration of a Spatial Reasoning Test. Research Report 80-2.

Download full text

Church, Austin T.; Weiss, David J. – 1980

A pilot study on the development and administration of a test using a spatial reasoning problem, the 15-puzzle, is described. The test utilizes on-line capabilities of a real-time computer to record an examinee's progress on each problem through a sequence of problem-solving "moves", and to collect additional on-line data that might be…

Descriptors: Adaptive Testing, Cognitive Measurement, Computer Assisted Testing, Difficulty Level

Scoreing and Analyzing Confidence Tests. Final Report.

Download full text

Rippey, Robert M. – 1971

Technical improvements, which may be made in the reliability and validity of tests through confidence scores, are discussed. However, studies indicate that subjects do not handle their confidence uniformly. (MS)

Descriptors: Computer Programs, Confidence Testing, Correlation, Difficulty Level