ERIC - Search Results

Descriptor

Mastery Tests	11
Test Length	11
Test Reliability	11
Cutting Scores	7
Test Construction	6
Mathematical Models	5
Criterion Referenced Tests	4
Test Items	4
Test Validity	4
Computer Assisted Testing	3
Adaptive Testing	2
Difficulty Level	2
Item Analysis	2
Item Banks	2
Test Interpretation	2
Testing	2
Testing Problems	2
Ability Identification	1
Achievement Tests	1
Career Development	1
Certification	1
Classification	1
College Students	1
Comparative Analysis	1
Comparative Testing	1
More ▼

Source

Applied Psychological…	1
Psychometrika	1
Review of Educational Research	1

Author

Hambleton, Ronald K.	2
Huynh, Huynh	2
Wilcox, Rand R.	2
Byars, Alvin Gregg	1
Eignor, Daniel R.	1
Hambleton, Ronald K., Ed.	1
Harnisch, Delwyn L.	1
Lunz, Mary E.	1
Saunders, Joseph C.	1
Subkoviak, Michael J.	1

Publication Type

Reports - Research	7
Speeches/Meeting Papers	5
Journal Articles	2
Collected Works - Serials	1
Guides - Non-Classroom	1
Information Analyses	1

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

Comprehensive Tests of Basic…

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Reliability of Multiple Classifications.

Peer reviewed

Huynh, Huynh – Psychometrika, 1978

The use of Cohen's kappa index as a measure of the reliability of multiple classifications is developed. Special cases of the index as well as the effects of test length on the index are also explored. (JKS)

Descriptors: Career Development, Classification, Mastery Tests, Test Length

On False-Positive and False-Negative Decisions with a Mastery Test.

Download full text

Wilcox, Rand R. – 1980

Wilcox (1977) examines two methods of estimating the probability of a false-positive on false-negative decision with a mastery test. Both procedures make assumptions about the form of the true score distribution which might not give good results in all situations. In this paper, upper and lower bounds on the two possible error types are described…

Descriptors: Cutting Scores, Mastery Tests, Mathematical Models, Student Placement

Contributions to Criterion-Referenced Testing Technology.

Peer reviewed

Hambleton, Ronald K., Ed. – Applied Psychological Measurement, 1980

This special issue covers recent technical developments in the field of criterion-referenced testing. An introduction, six papers, and two commentaries dealing with test development, test score uses, and evaluation of scores review relevant literature, offer new models and/or results, and suggest directions for additional research. (SLD)

Descriptors: Criterion Referenced Tests, Mastery Tests, Measurement Techniques, Standard Setting (Scoring)

Consideration for Sample Size in Reliability Studies for Mastery Tests. Publication Series in Mastery Testing.

Download full text

Saunders, Joseph C.; Huynh, Huynh – 1980

In most reliability studies, the precision of a reliability estimate varies inversely with the number of examinees (sample size). Thus, to achieve a given level of accuracy, some minimum sample size is required. An approximation for this minimum size may be made if some reasonable assumptions regarding the mean and standard deviation of the test…

Descriptors: Cutting Scores, Difficulty Level, Error of Measurement, Mastery Tests

Practical Procedures for Constructing Mastery Tests to Minimize Errors of Classification and to Maximize or Optimize Decision Reliability.

Byars, Alvin Gregg – 1980

The objectives of this investigation are to develop, describe, assess, and demonstrate procedures for constructing mastery tests to minimize errors of classification and to maximize decision reliability. The guidelines are based on conditions where item exchangeability is a reasonable assumption and the test constructor can control the number of…

Descriptors: Cutting Scores, Difficulty Level, Grade 4, Intermediate Grades

Effects of Test Length and Advancement Score on Several Criterion-Referenced Test Reliability and Validity Indices. Laboratory of Psychometric and Evaluation Research Report No. 86.

Download full text

Eignor, Daniel R.; Hambleton, Ronald K. – 1979

The purpose of the investigation was to obtain some relationships among (1) test lengths, (2) shape of domain-score distributions, (3) advancement scores, and (4) several criterion-referenced test score reliability and validity indices. The study was conducted using computer simulation methods. The values of variables under study were set to be…

Descriptors: Comparative Analysis, Computer Assisted Testing, Criterion Referenced Tests, Cutting Scores

Criterion-Referenced Testing and Measurement: A Review of Technical Issues and Developments

Peer reviewed

Hambleton, Ronald K.; And Others – Review of Educational Research, 1978

Reviewing psychometric and statistical developments in criterion- referenced testing, this paper presents six sections: uses of criterion- referenced test scores, reliability of criterion-referenced test scores, determination of test length, determination of cut-off scores, test development and validation, and summary and suggestions for further…

Descriptors: Criterion Referenced Tests, Cutting Scores, Mastery Tests, Mathematical Models

Evaluation of Criterion-Referenced Reliability Coefficients. Final Report.

Download full text

Subkoviak, Michael J. – 1977

Four different procedures were used for estimating the proportion of persons who would be classified consistently as either passing both of two parallel tests or failing both. These four methods were applied at each of four different mastery level scores for each of three different length tests. Data were based on 50 replications of each procedure…

Descriptors: Criterion Referenced Tests, Cutting Scores, Data Analysis, Data Collection

Test-Retest Consistency of Computer Adaptive Tests.

Lunz, Mary E.; And Others – 1990

This study explores the test-retest consistency of computer adaptive tests of varying lengths. The testing model used was designed as a mastery model to determine whether an examinee's estimated ability level is above or below a pre-established criterion expressed in the metric (logits) of the calibrated item pool scale. The Rasch model was used…

Descriptors: Ability Identification, Adaptive Testing, College Students, Comparative Testing

An Approach to Measuring the Achievement or Proficiency of an Examinee.

Wilcox, Rand R. – 1979

Mastery tests are analyzed in terms of the number of skills to be mastered and the number of items per skill, in order that correct decisions of mastery or nonmastery will be made to a desired degree of probability. It is assumed that a random sample of skills will be selected for measurement, that each skill will be measured by the same number of…

Descriptors: Achievement Tests, Cutting Scores, Decision Making, Equivalency Tests

Computer Application Issues in Certification and Licensure Testing.

Harnisch, Delwyn L. – 1985

Computer adaptive testing systems are feasible for certification and licensure testing. This is in part due to the availability of extensive yet inexpensive computers. Modern item response theory, combined with computerized adaptive testing, yields a powerful new method of testing which provides greater accuracy and efficiency and less boredom for…

Descriptors: Adaptive Testing, Certification, Computer Assisted Testing, Cost Effectiveness