NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 7,231 to 7,245 of 9,530 results Save | Export
Peer reviewed Peer reviewed
Thissen, David; And Others – Journal of Educational Measurement, 1994
Restricted factor analysis shows that the multiple-choice and free-response sections of the Computer Science and Chemistry Advanced Placement examinations (College Board) measure the same proficiencies for the most part. There is a small degree of multidimensionality because of local dependence among free-response items. (SLD)
Descriptors: Advanced Placement, Chemistry, Computer Science, Factor Analysis
Peer reviewed Peer reviewed
Huynh, Huynh; Ferrara, Steven – Journal of Educational Measurement, 1994
Equal percentile (EP) and partial credit (PC) equatings for raw scores from performance-based assessments with free-response items are compared through the use of data from the Maryland School Performance Assessment Program. Results suggest that EP and PC methods do not give equivalent results when distributions are markedly skewed. (SLD)
Descriptors: Comparative Analysis, Equated Scores, Mathematics Tests, Performance Based Assessment
Peer reviewed Peer reviewed
Carey, Lou M.; And Others – Educational and Psychological Measurement, 1994
Effects of randomly distributing attitude-measurement items throughout a questionnaire (personality format) versus grouping together items from the same dimension (achievement format) on students' end-of-course evaluations were studied for 376 undergraduates. Advantages demonstrated for the achievement format in terms of statistical results,…
Descriptors: Attitude Measures, Course Evaluation, Evaluation Methods, Higher Education
Peer reviewed Peer reviewed
Statman, Stella – System, 1992
Describes weaknesses of English-as-a-foreign-language (EFL) testing methods and argues that many EFL departments set up their own examinations to assess how effectively they are preparing students for those examinations. It is suggested that this leads to production of test items that are biased against divergent students and that an interview…
Descriptors: Departments, English (Second Language), English for Special Purposes, Higher Education
Peer reviewed Peer reviewed
Albert, James H. – Journal of Educational Statistics, 1992
Estimating item parameters from a two-parameter normal ogive model is considered using Gibbs sampling to simulate draws from the joint posterior distribution of ability and item parameters. The method gives marginal posterior density estimates for any parameter of interest, as illustrated using data from a 33-item mathematics placement…
Descriptors: Algorithms, Bayesian Statistics, Equations (Mathematics), Estimation (Mathematics)
Peer reviewed Peer reviewed
Wainer, Howard; And Others – Journal of Educational Measurement, 1992
Computer simulations were run to measure the relationship between testlet validity and factors of item pool size and testlet length for both adaptive and linearly constructed testlets. Making a testlet adaptive yields only modest increases in aggregate validity because of the peakedness of the typical proficiency distribution. (Author/SLD)
Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Computer Simulation
Peer reviewed Peer reviewed
Crehan, Kevin D.; And Others – Educational and Psychological Measurement, 1993
Studies with 220 college students found that multiple-choice test items with 3 items are more difficult than those with 4 items, and items with the none-of-these option are more difficult than those without this option. Neither format manipulation affected item discrimination. Implications for test construction are discussed. (SLD)
Descriptors: College Students, Comparative Testing, Difficulty Level, Distractors (Tests)
Peer reviewed Peer reviewed
Wilson, Mark; Masters, Geoffery N. – Psychometrika, 1993
A strategy is described for dealing with measurement situations in which certain categories of responses are null, that is, persons do not respond in certain categories to certain items. The method is described for the partial credit model while maintaining the integrity of the original response framework. (SLD)
Descriptors: Equations (Mathematics), Estimation (Mathematics), Item Response Theory, Mathematical Models
Peer reviewed Peer reviewed
Dodd, Barbara G. – Applied Psychological Measurement, 1990
Using one simulated and two real data sets, the effects of the systematic variation of the item-selection procedure and the stepsize method on the operating characteristics of computerized adaptive testing (CAT) for instruments with polychotomously scored rating scale items were studied. The six rating scale CAT procedures used performed well.…
Descriptors: Adaptive Testing, Attitude Measures, Comparative Analysis, Computer Assisted Testing
Peer reviewed Peer reviewed
Koch, William R.; And Others – Measurement and Evaluation in Counseling and Development, 1990
Implemented computerized adaptive testing (CAT) to measure students' attitudes toward alcohol. Administered a paper-and-pencil version and a CAT version of an attitudes toward alcohol scale to 113 undergraduates enrolled in health education classes. Findings showed a high correlation between scores from the CAT and the paper-and-pencil versions.…
Descriptors: Adaptive Testing, College Students, Computer Assisted Testing, Drinking
Peer reviewed Peer reviewed
Case, Susan M.; Swanson, David B. – Teaching and Learning in Medicine, 1993
Extended matching, a test item format used currently in medical licensing examinations, is described. Procedures for writing and reviewing such test items are outlined, test development and psychometric advantages are discussed, and issues in test administration and scoring are examined. The extended matching form is also seen as having uses for…
Descriptors: Clinical Diagnosis, Decision Making, Higher Education, Licensing Examinations (Professions)
Peer reviewed Peer reviewed
Budescu, David; Bar-Hillel, Maya – Journal of Educational Measurement, 1993
Test taking and scoring are examined from the normative and descriptive perspectives of judgment and decision theory. The number-right scoring rule is endorsed because it discourages omissions and is robust against variability in respondent motivations, item vagaries, and limitations in judgments of uncertainty. (SLD)
Descriptors: Elementary Secondary Education, Guessing (Tests), Knowledge Level, Multiple Choice Tests
Peer reviewed Peer reviewed
Bridgeman, Brent; Rock, Donald A. – Journal of Educational Measurement, 1993
Exploratory and confirmatory factor analyses were used to explore relationships among existing item types and three new computer-administered item types for the analytical scale of the Graduate Record Examination General Test. Results with 349 students indicate constructs the item types are measuring. (SLD)
Descriptors: College Entrance Examinations, College Students, Comparative Testing, Computer Assisted Testing
Peer reviewed Peer reviewed
DeMars, Christine E. – Applied Measurement in Education, 1998
Scores from mathematics (tested at 102 schools) and science (tested at 99 schools) sections of pilot forms of the Michigan High School Proficiency Test were examined for interaction between gender and response format (multiple choice or constructed response). Overall, neither males nor females seemed to be disadvantaged by item format. (SLD)
Descriptors: Constructed Response, High School Students, High Schools, Mathematics Tests
Peer reviewed Peer reviewed
Ferrara, Steven; Huynh, Huynh; Michaels, Hillary – Journal of Educational Measurement, 1999
Provides hypothesized explanations for local item dependence (LID) in a large-scale hands-on science performance assessment involving approximately 55,000 students each at grades 3, 5, and 8. Items that appear to elicit locally dependent responses require examinees to answer and explain their answers or to use given or generalized information to…
Descriptors: Context Effect, Elementary Education, Hands on Science, Junior High Schools
Pages: 1  |  ...  |  479  |  480  |  481  |  482  |  483  |  484  |  485  |  486  |  487  |  ...  |  636