NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 826 to 840 of 1,187 results Save | Export
Peer reviewed Peer reviewed
Sciarone, A. G.; Schoorl, J. J. – Language Learning, 1989
Presents findings from an experiment that sought to determine the minimal number of blanks required to ensure parallelism in cloze tests, differing only in the point at which deletion starts. Results showed the required minimum depended on the scoring methods used, with exact-word tests requiring about 100 blanks and acceptable-word tests…
Descriptors: Cloze Procedure, Dutch, Indonesian, Reading Tests
Peer reviewed Peer reviewed
Hancock, Gregory R.; And Others – Educational and Psychological Measurement, 1993
Two-option multiple-choice vocabulary test items are compared with comparably written true-false test items. Results from a study with 111 high school students suggest that multiple-choice items provide a significantly more reliable measure than the true-false format. (SLD)
Descriptors: Ability, High School Students, High Schools, Objective Tests
Peer reviewed Peer reviewed
Meijer, Rob R.; And Others – Applied Psychological Measurement, 1994
The power of the nonparametric person-fit statistic, U3, is investigated through simulations as a function of item characteristics, test characteristics, person characteristics, and the group to which examinees belong. Results suggest conditions under which relatively short tests can be used for person-fit analysis. (SLD)
Descriptors: Difficulty Level, Group Membership, Item Response Theory, Nonparametric Statistics
Peer reviewed Peer reviewed
Armstrong, Ronald D.; And Others – Journal of Educational Statistics, 1994
A network-flow model is formulated for constructing parallel tests based on classical test theory while using test reliability as the criterion. Practitioners can specify a test-difficulty distribution for values of item difficulties as well as test-composition requirements. An empirical study illustrates the reliability of generated tests. (SLD)
Descriptors: Algorithms, Computer Assisted Testing, Difficulty Level, Item Banks
Peer reviewed Peer reviewed
Direct linkDirect link
Segall, Daniel O. – Journal of Educational and Behavioral Statistics, 2004
A new sharing item response theory (SIRT) model is presented that explicitly models the effects of sharing item content between informants and test takers. This model is used to construct adaptive item selection and scoring rules that provide increased precision and reduced score gains in instances where sharing occurs. The adaptive item selection…
Descriptors: Scoring, Item Analysis, Item Response Theory, Adaptive Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Bliss, Stacy – Journal of Psychoeducational Assessment, 2006
The Test of Early Mathematics Ability--Third Edition (TEMA-3) is a norm-referenced parallel forms test intended to identify the level of mathematical ability for children aged 3 years 0 months through 8 years 11 months. According to the authors, the instrument can also be used as a criterion referenced or diagnostic tool for older students who are…
Descriptors: Test Reviews, Mathematics Tests, Norm Referenced Tests, Young Children
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Güven, Bülent; Özbek, Özge – Turkish Online Journal of Educational Technology - TOJET, 2007
In the process of education, instead of classifying students according to their insufficiency, teachers should try to get to know them and determine their cognitive, sensorial, and kinetic characteristics. This study on improving learning style inventory, which aims to help classroom teachers determine students' attributes in individualized…
Descriptors: Cognitive Style, Instructional Design, Individualized Instruction, Interest Inventories
Wainer, Howard; And Others – 1991
It is sometimes sensible to think of the fundamental unit of test construction as being larger than an individual item. This unit, dubbed the testlet, must pass muster in the same way that items do. One criterion of a good item is the absence of differential item functioning (DIF). The item must function in the same way as all important…
Descriptors: Definitions, Identification, Item Bias, Item Response Theory
Crehan, Kevin D.; And Others – 1993
Among the measurement techniques receiving greater attention is the context-dependent item set or testlet. The context-dependent item set consists of a scenario and related test questions. This item format is generally believed to be able to tap higher level thinking. Unfortunately, this item form leads to inter-item dependence within item sets…
Descriptors: Comparative Analysis, Item Response Theory, Measurement Techniques, Reading Tests
Thompson, Lynn; And Others – 1988
A project is reported that developed a test for students in foreign language in the elementary school (FLES) programs. Relevant tests in Spanish and English as a Second Language (ESL) were reviewed in order to develop a listening and reading test that could determine achievement in a typical FLES curriculum. Pilot testing was conducted with 121…
Descriptors: FLES, Intermediate Grades, Language Tests, Second Language Instruction
Crehan, Kevin D.; And Others – 1989
Two issues in the writing of multiple-choice test items were investigated: a comparison of three versus four options; and the use of the inclusive "none of these" option versus a content option. Subjects were 220 introductory psychology students, who were enrolled at a large southwestern university, responding to a final examination in psychology…
Descriptors: College Students, Higher Education, Item Analysis, Multiple Choice Tests
Samejima, Fumiko – 1990
Test validity is a concept that has often been ignored in the context of latent trait models and in modern test theory, particularly as it relates to computerized adaptive testing. Some considerations about the validity of a test and of a single item are proposed. This paper focuses on measures that are population-free and that will provide local…
Descriptors: Adaptive Testing, Computer Assisted Testing, Equations (Mathematics), Item Response Theory
Shayne, Vivian T. – 1987
A target partition analysis (TPA) was used to help seven subject matter experts, with little expertise in testing, evaluate relationships among test items. The subjects were experts in Income Maintenance in the field of public assistance work; they were trainers affiliated with the Professional Development Program's Income Maintenance Training…
Descriptors: Content Analysis, Item Analysis, Models, Pretests Posttests
Sympson, J. Bradford; Haladyna, Thomas M. – 1988
A new approach to polychotomous scoring of test items, similar to "max-alpha" scaling (MAS) and known as polyweighting, has been developed. Unlike MAS, this new method of polychotomous scoring provides scoring weights for a given item that are independent of the difficulty of other items in the analysis. Moreover, the scoring weights are…
Descriptors: Computer Software, Difficulty Level, Item Analysis, Latent Trait Theory
Haladyna, Thomas M.; Downing, Steven M. – 1988
The proposition that the optimal number of options in a multiple choice test item is three was examined. The concept of functional distractor, a plausible wrong answer that is negatively discriminating when total test performance is the criterion, is discussed. Three distinct groups of achievers (high, middle, and low) on a national standardized…
Descriptors: Achievement Tests, Item Analysis, Multiple Choice Tests, Physicians
Pages: 1  |  ...  |  52  |  53  |  54  |  55  |  56  |  57  |  58  |  59  |  60  |  ...  |  80