NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 781 to 795 of 1,187 results Save | Export
van der Linden, Wim J.; Boekkooi-Timminga, Ellen – 1986
In order to estimate the classical coefficient of test reliability, parallel measurements are needed. H. Gulliksen's matched random subtests method, which is a graphical method for splitting a test into parallel test halves, has practical relevance because it maximizes the alpha coefficient as a lower bound of the classical test reliability…
Descriptors: Algorithms, Computer Assisted Testing, Computer Software, Difficulty Level
Zimmerman, Irla L.; Woo-Sam, James M. – 1982
Two kinds of WISC-R short forms, item reduction and subtest reduction, are reviewed in terms of their ability to meet these criteria of adequacy: a significant correlation between the full scale IQ and the short form IQ, a non-significant difference between the full and short form mean IQ, a low percentage of IQ classification changes resulting…
Descriptors: Intelligence Tests, Test Interpretation, Test Items, Test Reliability
Brinzer, Raymond J. – 1979
The problem engendered by the Matching Familiar Figures (MFF) Test is one of instrument integrity (II). II is delimited by validity, reliability, and utility of MFF as a measure of the reflective-impulsive construct. Validity, reliability and utility of construct assessment may be improved by utilizing: (1) a prototypic scoring model that will…
Descriptors: Conceptual Tempo, Difficulty Level, Item Analysis, Research Methodology
Berk, Ronald A. – 1979
As alternatives to the objectives-based approach to specifying content domains for test construction purposes, six strategies are proposed: (1) amplified objectives; (2) Instructional Objectives Exchange (IOX) test specifications; (3) item transformations; (4) item forms; (5) algorithms; and (6) mapping sentences. Their effectiveness is assessed…
Descriptors: Behavioral Objectives, Comparative Analysis, Criterion Referenced Tests, Evaluation Criteria
Peer reviewed Peer reviewed
Duncan, George T.; Milton, E. O. – Psychometrika, 1978
A multiple-answer multiple-choice test is one which offers several alternate choices for each stem and any number of those choices may be considered to be correct. In this article, a class of scoring procedures called the binary class is discussed. (Author/JKS)
Descriptors: Answer Keys, Measurement Techniques, Multiple Choice Tests, Scoring Formulas
Peer reviewed Peer reviewed
Weber, Margaret B. – Educational and Psychological Measurement, 1977
Bilevel dimensionality of probability was examined via factor analysis, Rasch latent trait analysis, and classical item analysis. Results suggest that when nonstandardized measures are the criteria for achievement, relying solely on estimates of content validity may lead to erroneous interpretation of test score data. (JKS)
Descriptors: Achievement, Achievement Tests, Factor Analysis, Item Analysis
Peer reviewed Peer reviewed
Chambers, David W. – Journal of Dental Education, 1988
A discussion of good test criteria reviews the basic concepts of test theory, examines four types of validity, outlines the concept of reliability and its coefficients and limitations, makes suggestions for gauging test quality, and demonstrates use of the standard error of measurement for estimating the likelihood of misgrading. (MSE)
Descriptors: Dental Schools, Higher Education, Professional Education, Statistical Analysis
Peer reviewed Peer reviewed
Garg, Rashmi; And Others – Journal of Educational Measurement, 1986
For the purpose of obtaining data to use in test development, multiple matrix sampling plans were compared to examinee sampling plans. Data were simulated for examinees, sampled from a population with a normal distribution of ability, responding to items selected from an item universe. (Author/LMO)
Descriptors: Difficulty Level, Monte Carlo Methods, Sampling, Statistical Studies
Peer reviewed Peer reviewed
Greer, Darryl – Review of Higher Education, 1984
The legislative history of legislation concerning disclosure of standardized college admissions test items is reviewed, the effect of existing laws in California and New York is outlined, and public policy and legal questions leading to and resulting from the legislation are discussed. (MSE)
Descriptors: College Entrance Examinations, Disclosure, Higher Education, Legal Problems
Fishman, Judith – Writing Program Administration, 1984
Examines the CUNY-WAT program and questions many aspects of it, especially the choice and phrasing of topics. (FL)
Descriptors: Essay Tests, Higher Education, Test Format, Test Items
Peer reviewed Peer reviewed
MisLevy, Robert J.; Bock, R. Darrell – Educational and Psychological Measurement, 1982
An alternative biweight estimator based on Tukey's is examined in which (1) test disturbances are not assumed to be the same for all subjects, (2) each response is utilized proportional to its value, and (3) the biweight and maximum likelihood estimate agree when no disturbances are present. Smaller mean-squared errors are shown. (Author/CM)
Descriptors: Error of Measurement, Estimation (Mathematics), Guessing (Tests), Latent Trait Theory
Peer reviewed Peer reviewed
Sharpley, C. F.; Cross, D. G. – Journal of Marriage and the Family, 1982
Examined one instrument devised to classify respondents for research purposes into high and low marital or dyadic adjustment groups. Data indicated that, while the overall scale performs the task reliably, the majority of its 32 items are unnecessary. Factor analysis revealed that there was one underlying "adjustment" dimension. (Author)
Descriptors: Adjustment (to Environment), Factor Analysis, Foreign Countries, Marriage
Peer reviewed Peer reviewed
Gross, Linda C.; Bevil, Catherine W. – Nursing Outlook, 1981
Describes the steps taken by City College School of Nursing (New York) in the development of nursing student placement tests. These steps include determining test items, use of multiple-choice questions, test revision, clinical performance tests, estimating test reliability, establishing standards, and using the tests. (CT)
Descriptors: Curriculum Development, Equivalency Tests, Higher Education, Nursing Education
Peer reviewed Peer reviewed
Karnes, Frances A.; Brown, K. Eliot – Psychology in the Schools, 1981
A study to develop a short form of the Wechsler Intelligence Scale for Children-Revised (WISC-R) for the intellectually gifted showed the Vocabulary and Block Design comprise the best two-subtest short form. The Similarities, Vocabulary, Block Design, and Object Assembly tetrad could be most useful in time and reliability. (Author)
Descriptors: Academically Gifted, Elementary Secondary Education, Intelligence Tests, Screening Tests
Peer reviewed Peer reviewed
Rusch, Reuben; Steiner, Judith – Journal of Experimental Education, 1979
The Selected Marker Tests were examined for scoring problems and internal consistency and were administered orally to sixth and seventh graders. Scoring problems were discovered and changes were suggested. The problem was found to be item reliability rather than interrater reliability. (Author/MH)
Descriptors: Cognitive Tests, Elementary Education, Item Analysis, Problem Solving
Pages: 1  |  ...  |  49  |  50  |  51  |  52  |  53  |  54  |  55  |  56  |  57  |  ...  |  80