NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 31 to 45 of 47 results Save | Export
Kalisch, Stanley J. – 1974
A tailored testing model employing the beta distribution, whose mean equals the difficulty of an item and whose variance is approximately equal to the sampling variance of the item difficulty, and employing conditional item difficulties, is proposed. The model provides a procedure by which a minimum number of items of a test, consisting of a set…
Descriptors: Adaptive Testing, Branching, Computer Oriented Programs, Decision Making
Theunissen, Phiel J. J. M. – 1983
Any systematic approach to the assessment of students' ability implies the use of a model. The more explicit the model is, the more its users know about what they are doing and what the consequences are. The Rasch model is a strong model where measurement is a bonus of the model itself. It is based on four ideas: (1) separation of observable…
Descriptors: Ability Grouping, Difficulty Level, Evaluation Criteria, Item Sampling
Forster, Fred – 1987
Studies carried out over a 12-year period addressed fundamental questions on the use of Rasch-based item banks. Large field tests administered in grades 3-8 of reading, mathematics, and science items, as well as standardized test results were used to explore the possible effects of many factors on item calibrations. In general, the results…
Descriptors: Achievement Tests, Difficulty Level, Elementary Education, Item Analysis
PDF pending restoration PDF pending restoration
Kriewall, Thomas E. – 1972
The measurement information generated by CRT's is designed for use in instructional management systems where classifications of pupils for treatment are to be decided on the basis of minimal data consistent with predetermined limits for the errors of misclassification. The measures obtained are content specific estimates of proficiency useful for…
Descriptors: Ability Grouping, Academic Achievement, Criterion Referenced Tests, Decision Making
Brown, James Dean – 1983
This study attempted to determine the effectiveness of cloze procedures as norm-referenced instruments by comparing the differential responses of four groups of college students of English as a second language on two identical cloze passages. The responses were scored using both exact-answer and acceptable-word methods. The results indicate that…
Descriptors: Cloze Procedure, College Students, Comparative Analysis, English (Second Language)
Ward, William C.; Frederiksen, Norman – 1977
This study provides preliminary evidence as to the validity of measures derived from the "Tests of Scientific Thinking" (TST). The TST and the Graduate Record Examinations (GRE) were compared with regard to their relationships to interests, self-appraisals, and accomplishments of students during their first year of graduate work in…
Descriptors: Academic Achievement, College Entrance Examinations, Creative Thinking, Creativity Tests
Boyd, Thomas A.; Tramontana, Michael G. – 1984
To examine the validity of short forms of the Wechsler Intelligence Scale for Children-Revised (WISC-R), the WISC-R was first administered to 106 hospitalized psychiatric patients, aged 8-16. No subjects had a primary diagnosis of mental retardation or learning disability, and one-third were receiving psychotropic medication. WISC-R IQ scores…
Descriptors: Adolescents, Children, Correlation, Elementary Secondary Education
Gillmore, Gerald M. – 1979
It is argued in this paper that generalizability theory provides a uniquely useful framework for defining and quantifying the dependability of data for decision making. It does so by requiring careful specification of the conditions of measurement and the anticipated sources of variation in the results of the measurement procedure. A distinction…
Descriptors: Analysis of Variance, Criterion Referenced Tests, Decision Making, Educational Assessment
Swezey, Robert W.; Pearlstein, Richard B. – 1975
This manual outlines the rationale for using the Criterion Referenced Test (CRT) approach and suggests specific guidelines for test developers to use in constructing test items. Methods for assessing the adequacy of a CRT are also provided. (Author/RC)
Descriptors: Behavioral Objectives, Check Lists, Comparative Analysis, Criterion Referenced Tests
Mason, Victor W. – 1986
Reading skills are crucial to students learning and using English as a second language for academic purposes. Teachers can construct valid reading tests if they approach the task with care and focus on the test's ability to measure construct rather than face validity. In reading tests, the crucial elements of test design affecting validity are (1)…
Descriptors: Communicative Competence (Languages), English for Academic Purposes, English (Second Language), Higher Education
Haladyna, Tom – 1976
The existence of criterion-referenced (CR) measurement is questioned in this paper. Despite beliefs that differences exist between two alternative forms of measurement, CR and Norm Referenced (NR), an analysis of philosophical and psychological descriptions of measurement, as well as a growing number of empirical studies, reveal that the common…
Descriptors: Academic Standards, Achievement Tests, Career Development, Comparative Analysis
Kohr, Richard L., Comp.; And Others – 1983
This guide begins with a series of questions and answers that introduce Pennsylvania's Educational Quality Assessment (EQA) Inventory as a 188- to 190-item multiple-choice test for grades 5, 8, and 11. Items are selected from a 400-item bank using matrix sampling procedures. Test results are analyzed at the school level; no individual student…
Descriptors: Achievement Tests, Affective Measures, Affective Objectives, Basic Skills
Hively, Wells, Ed. – 1974
The central assumption in domain-referenced testing (DRT), as presented in this book, is that a domain may be determined which adequately represents a particular universe of knowledge. After a domain has been established, the technological and practical problem of using domain-referenced testing must be solved. This book contains a collection of…
Descriptors: Accountability, Behavior Change, Behavioral Objectives, Criterion Referenced Tests
Kohr, Richard L., Comp.; And Others – 1979
This guide begins with a series of questions and answers which introduce Pennsylvania's Educational Quality Assessment (EQA) Inventory as a 188-to 190-item multiple choice test for fifth, eighth, and eleventh grades. Items are selected from a 400-item bank using matrix sampling procedures. Test results are analyzed at the school level; no…
Descriptors: Achievement Tests, Affective Measures, Affective Objectives, Basic Skills
de Jong, John H. A. L. – 1982
The development and validation of a test of listening comprehension for English as a second language at the Dutch National Institute for Educational Measurement (Cito) is described. The test uses two distinct item formats: true-false items and modified cloze items with two options. Both item formats were found to measure foreign language listening…
Descriptors: Cloze Procedure, English (Second Language), Evaluation Criteria, Foreign Countries
Pages: 1  |  2  |  3  |  4