NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 7,216 to 7,230 of 9,530 results Save | Export
Peer reviewed Peer reviewed
Chang, Lei – Applied Psychological Measurement, 1994
Reliability and validity of 4-point and 6-point scales were assessed using a new model-based approach to fit empirical data from 165 graduate students completing an attitude measure. Results suggest that the issue of four- versus six-point scales may depend on the empirical setting. (SLD)
Descriptors: Attitude Measures, Goodness of Fit, Graduate Students, Graduate Study
Peer reviewed Peer reviewed
Kim, Seock-Ho; And Others – Applied Psychological Measurement, 1994
Type I error rates of F. M. Lord's chi square test for differential item functioning were investigated using Monte Carlo simulations with marginal maximum likelihood estimation and marginal Bayesian estimation algorithms. Lord's chi square did not provide useful Type I error control for the three-parameter logistic model at these sample sizes.…
Descriptors: Algorithms, Bayesian Statistics, Chi Square, Error of Measurement
Wilson, Audrey – Journal of Science and Mathematics Education in Southeast Asia, 1992
Reports a study to compare the performance of Australian university students on chemistry questions delivered in two different formats: multiple choice and grid. Concluded that students encounter more difficulties with questions presented in the grid format and that grid questions demand a deeper understanding of the topic. (MDH)
Descriptors: Academic Achievement, Chemistry, Cognitive Processes, Foreign Countries
Peer reviewed Peer reviewed
Rocklin, Thomas – Applied Measurement in Education, 1992
College students rated dissimilarity of pairs of common test item formats. A multidimensional scaling model with individual differences fit to data from 111 students suggested that they used 2 dimensions to distinguish among the formats, 1 separating supply from selection items and 1 based on the number of options. (SLD)
Descriptors: Academic Ability, Academic Achievement, College Students, Higher Education
Peer reviewed Peer reviewed
Cashin, William E.; Downey, Ronald G. – Journal of Educational Psychology, 1992
The usefulness of global items in predicting weighted-composite evaluations of teaching was evaluated with a sample of 17,183 classes from 105 institutions. Results suggest that, because global items account for a substantial amount of variance, a short evaluation form could capture much of the information needed for summative evaluation. (SLD)
Descriptors: College Students, Evaluation Methods, Higher Education, Predictive Measurement
Peer reviewed Peer reviewed
Kim, Seock-Ho; Cohen, Allan S. – Journal of Educational Measurement, 1992
Effects of the following methods for linking metrics on detection of differential item functioning (DIF) were compared: (1) test characteristic curve method (TCC); (2) weighted mean and sigma method; and (3) minimum chi-square method. With large samples, results were essentially the same. With small samples, TCC was most accurate. (SLD)
Descriptors: Chi Square, Comparative Analysis, Equations (Mathematics), Estimation (Mathematics)
Peer reviewed Peer reviewed
Cameron, Beverly J. – College Teaching, 1991
When college teachers are explicit about specific methods and strategies involved in effective thinking, students are more likely to learn and use these skills. Labeling test questions with the thinking skills required can help students refocus their study methods, resulting in more effective thinking, problem-solving, or decision-making skills.…
Descriptors: Classroom Techniques, College Instruction, Decision Making, Higher Education
Peer reviewed Peer reviewed
Thissen, David; And Others – Journal of Educational Measurement, 1994
Restricted factor analysis shows that the multiple-choice and free-response sections of the Computer Science and Chemistry Advanced Placement examinations (College Board) measure the same proficiencies for the most part. There is a small degree of multidimensionality because of local dependence among free-response items. (SLD)
Descriptors: Advanced Placement, Chemistry, Computer Science, Factor Analysis
Peer reviewed Peer reviewed
Huynh, Huynh; Ferrara, Steven – Journal of Educational Measurement, 1994
Equal percentile (EP) and partial credit (PC) equatings for raw scores from performance-based assessments with free-response items are compared through the use of data from the Maryland School Performance Assessment Program. Results suggest that EP and PC methods do not give equivalent results when distributions are markedly skewed. (SLD)
Descriptors: Comparative Analysis, Equated Scores, Mathematics Tests, Performance Based Assessment
Peer reviewed Peer reviewed
Carey, Lou M.; And Others – Educational and Psychological Measurement, 1994
Effects of randomly distributing attitude-measurement items throughout a questionnaire (personality format) versus grouping together items from the same dimension (achievement format) on students' end-of-course evaluations were studied for 376 undergraduates. Advantages demonstrated for the achievement format in terms of statistical results,…
Descriptors: Attitude Measures, Course Evaluation, Evaluation Methods, Higher Education
Peer reviewed Peer reviewed
Statman, Stella – System, 1992
Describes weaknesses of English-as-a-foreign-language (EFL) testing methods and argues that many EFL departments set up their own examinations to assess how effectively they are preparing students for those examinations. It is suggested that this leads to production of test items that are biased against divergent students and that an interview…
Descriptors: Departments, English (Second Language), English for Special Purposes, Higher Education
Peer reviewed Peer reviewed
Albert, James H. – Journal of Educational Statistics, 1992
Estimating item parameters from a two-parameter normal ogive model is considered using Gibbs sampling to simulate draws from the joint posterior distribution of ability and item parameters. The method gives marginal posterior density estimates for any parameter of interest, as illustrated using data from a 33-item mathematics placement…
Descriptors: Algorithms, Bayesian Statistics, Equations (Mathematics), Estimation (Mathematics)
Peer reviewed Peer reviewed
Wainer, Howard; And Others – Journal of Educational Measurement, 1992
Computer simulations were run to measure the relationship between testlet validity and factors of item pool size and testlet length for both adaptive and linearly constructed testlets. Making a testlet adaptive yields only modest increases in aggregate validity because of the peakedness of the typical proficiency distribution. (Author/SLD)
Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Computer Simulation
Peer reviewed Peer reviewed
Crehan, Kevin D.; And Others – Educational and Psychological Measurement, 1993
Studies with 220 college students found that multiple-choice test items with 3 items are more difficult than those with 4 items, and items with the none-of-these option are more difficult than those without this option. Neither format manipulation affected item discrimination. Implications for test construction are discussed. (SLD)
Descriptors: College Students, Comparative Testing, Difficulty Level, Distractors (Tests)
Peer reviewed Peer reviewed
Wilson, Mark; Masters, Geoffery N. – Psychometrika, 1993
A strategy is described for dealing with measurement situations in which certain categories of responses are null, that is, persons do not respond in certain categories to certain items. The method is described for the partial credit model while maintaining the integrity of the original response framework. (SLD)
Descriptors: Equations (Mathematics), Estimation (Mathematics), Item Response Theory, Mathematical Models
Pages: 1  |  ...  |  478  |  479  |  480  |  481  |  482  |  483  |  484  |  485  |  486  |  ...  |  636