NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Type
Reports - Evaluative19
Journal Articles10
Speeches/Meeting Papers3
Opinion Papers1
Audience
Location
Laws, Policies, & Programs
Elementary and Secondary…1
What Works Clearinghouse Rating
Showing 1 to 15 of 19 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Lawrence T. DeCarlo – Educational and Psychological Measurement, 2024
A psychological framework for different types of items commonly used with mixed-format exams is proposed. A choice model based on signal detection theory (SDT) is used for multiple-choice (MC) items, whereas an item response theory (IRT) model is used for open-ended (OE) items. The SDT and IRT models are shown to share a common conceptualization…
Descriptors: Test Format, Multiple Choice Tests, Item Response Theory, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Crowther, Gregory J.; Knight, Thomas A. – Advances in Physiology Education, 2023
The past [approximately]15 years have seen increasing interest in defining disciplinary core concepts. Within the field of physiology, Michael, McFarland, Modell, and colleagues have published studies that defined physiology core concepts and have elaborated many of these as detailed conceptual frameworks. With such helpful definitions now in…
Descriptors: Test Format, Physiology, Higher Education, Concept Teaching
Peer reviewed Peer reviewed
Direct linkDirect link
Liu, Chunyan; Kolen, Michael J. – Journal of Educational Measurement, 2018
Smoothing techniques are designed to improve the accuracy of equating functions. The main purpose of this study is to compare seven model selection strategies for choosing the smoothing parameter (C) for polynomial loglinear presmoothing and one procedure for model selection in cubic spline postsmoothing for mixed-format pseudo tests under the…
Descriptors: Comparative Analysis, Accuracy, Models, Sample Size
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Abass, Olalere A.; Olajide, Samuel A.; Samuel, Babafemi O. – Turkish Online Journal of Distance Education, 2017
The traditional method of assessment (examination) is often characterized by examination questions leakages, human errors during marking of scripts and recording of scores. The technological advancement in the field of computer science has necessitated the need for computer usage in majorly all areas of human life and endeavors, education sector…
Descriptors: Computer Assisted Testing, Computer System Design, Test Format, Design Requirements
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Becker, Kirk A.; Bergstrom, Betty A. – Practical Assessment, Research & Evaluation, 2013
The need for increased exam security, improved test formats, more flexible scheduling, better measurement, and more efficient administrative processes has caused testing agencies to consider converting the administration of their exams from paper-and-pencil to computer-based testing (CBT). Many decisions must be made in order to provide an optimal…
Descriptors: Testing, Models, Testing Programs, Program Administration
Peer reviewed Peer reviewed
Direct linkDirect link
Webb, Mi-young Lee; Cohen, Allan S.; Schwanenflugel, Paula J. – Educational and Psychological Measurement, 2008
This study investigated the use of latent class analysis for the detection of differences in item functioning on the Peabody Picture Vocabulary Test-Third Edition (PPVT-III). A two-class solution for a latent class model appeared to be defined in part by ability because Class 1 was lower in ability than Class 2 on both the PPVT-III and the…
Descriptors: Item Response Theory, Test Items, Test Format, Cognitive Ability
Stocking, Martha L. – 1993
In the context of paper and pencil testing, the frequency of the exposure of items is usually controlled through policies that regulate both the reuse of test forms and the frequency with which a candidate may retake the test. In the context of computerized adaptive testing, where item pools are large and expensive to produce and testing can be on…
Descriptors: Adaptive Testing, Computer Assisted Testing, Item Banks, Models
McKinley, Robert L.; Way, Walter D. – 1992
An analysis of the skills necessary for performance on the Test of English as a Foreign Language (TOEFL) tends to support the view that there are important, although subtle, secondary dimensions present in the test. This research explored the feasibility of an item response theory (IRT) based method of modeling examinee performance on these…
Descriptors: Ability, Goodness of Fit, Identification, Item Response Theory
Peer reviewed Peer reviewed
Sykes, Robert C.; Fitzpatrick, Anne R. – Journal of Educational Measurement, 1992
Explanations for an observed change in Rasch item parameters ("b" values) from consecutive administrations of a professional licensing examination were investigated. Analysis of covariance indicated that the change was not related to item position or type. It is hypothesized that the change is attributable to shifts in curriculum…
Descriptors: Analysis of Covariance, Change, Curriculum, Higher Education
Tatsuoka, Kikumi K. – 1991
Constructed-response formats are desired for measuring complex and dynamic response processes that require the examinee to understand the structures of problems and micro-level cognitive tasks. These micro-level tasks and their organized structures are usually unobservable. This study shows that elementary graph theory is useful for organizing…
Descriptors: Adult Literacy, Cognitive Measurement, Cognitive Processes, Constructed Response
Wang, Tianyou; Kolen, Michael J. – 1994
In this paper a quadratic curve equating method for different test forms under a random-group data-collection design is proposed. Procedures for implementing this method and related issues are described and discussed. The quadratic-curve method was evaluated with real test data (from two 30-item subtests for a professional licensure examination…
Descriptors: Comparative Analysis, Data Collection, Equated Scores, Goodness of Fit
Stone, Gregory Ethan – 1994
The quality of fit between the data and the measurement model is fundamental to any discussion of results. Fit has been the subject of inquiry since as early as the 1920s. Most early explorations concentrated on assessing global fit or subset fits on fixed length, traditional paper and pencil tests given as a single unit. The detection of aberrant…
Descriptors: Adaptive Testing, Computer Assisted Testing, Educational Assessment, Educational History
Bunch, Michael B. – 1982
Research evidence relating to the utility of the RMC evaluation models of compensatory education employing non-normed tests is examined. The history and evolution of five early models into the current norm-referenced model utilizing a non-normed test (Model A2), non-normed versions of a comparison model (Model B2), and the regression model (Model…
Descriptors: Compensatory Education, Correlation, Criterion Referenced Tests, Elementary Secondary Education
Peer reviewed Peer reviewed
Trevisan, Michael S.; And Others – Educational and Psychological Measurement, 1994
The reliabilities of 2-, 3-, 4-, and 5-choice tests were compared through an incremental-option model on a test taken by 154 high school seniors. Creating the test forms incrementally more closely approximates actual test construction. The nonsignificant differences among the option choices support the three-option item. (SLD)
Descriptors: Distractors (Tests), Estimation (Mathematics), High School Students, High Schools
Way, Walter D.; And Others – 1992
This study provided an exploratory investigation of item features that might contribute to a lack of invariance of item parameters for the Test of English as a Foreign Language (TOEFL). Data came from seven forms of the TOEFL administered in 1989. Subjective and quantitative measures developed for the study provided consistent information related…
Descriptors: Ability, English (Second Language), Goodness of Fit, Item Response Theory
Previous Page | Next Page ยป
Pages: 1  |  2