NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 2,401 to 2,415 of 3,093 results Save | Export
Peer reviewed Peer reviewed
Berk, Ronald A. – Journal of Personnel Evaluation in Education, 1988
Major factors beyond a teacher's control that can influence teacher effectiveness are identified as sources of invalidity related to the inference of teacher effectiveness from students' achievement gains. These factors include: (1) student characteristics; (2) school characteristics; (3) test validity; and (4) pretest-posttest design…
Descriptors: Academic Achievement, Achievement Gains, Elementary Secondary Education, Institutional Characteristics
Peer reviewed Peer reviewed
Aiken, Lewis R. – Educational and Psychological Measurement, 1989
Two alternatives to traditional item analysis and reliability estimation procedures are considered for determining the difficulty, discrimination, and reliability of optional items on essay and other tests. A computer program to compute these measures is described, and illustrations are given. (SLD)
Descriptors: College Entrance Examinations, Computer Software, Difficulty Level, Essay Tests
Peer reviewed Peer reviewed
Bloom, Benjamin S. – Teaching Education, 1988
A review of the development and innovation of testing at the higher education level focuses on Ralph Tyler's testing emphasis on the educational process and the objectives of instruction, a precursor to the current trend of integrating formative evaluation into the curriculum development process. (CB)
Descriptors: Achievement Tests, Evaluation Methods, Formative Evaluation, Higher Education
Peer reviewed Peer reviewed
Peterson, Raymond F.; And Others – Journal of Research in Science Teaching, 1989
Outlines a procedure used to develop a written diagnostic instrument to identify misconceptions and misunderstandings of the chemistry topic. Reports the characteristic data for the instrument, and student understanding of covalent bonding and structure. (Author/YP)
Descriptors: Chemical Bonding, Chemistry, Diagnostic Tests, Grade 11
Peer reviewed Peer reviewed
Baird, William E.; Silvern, Steven B. – Journal of Research on Computing in Education, 1992
Describes a study of college students that investigated the interaction between instructional mode and testing mode. Computer learning and testing versus paper-and-pencil methods are compared, and treatments for the experimental and control groups are described. Areas for further research are suggested. (20 references) (LRW)
Descriptors: Analysis of Variance, Comparative Analysis, Computer Assisted Instruction, Computer Assisted Testing
Peer reviewed Peer reviewed
Bergstrom, Betty A.; Lunz, Mary E. – Evaluation and the Health Professions, 1992
The level of confidence in pass/fail decisions obtained with computerized adaptive tests and paper-and-pencil tests was greater for 645 medical technology students when the computer adaptive test implemented a 90 percent confidence stopping rule than for paper-and-pencil tests of comparable length. (SLD)
Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Confidence Testing
Peer reviewed Peer reviewed
Frisbie, David A. – Educational Measurement: Issues and Practice, 1992
Literature related to the multiple true-false (MTF) item format is reviewed. Each answer cluster of a MTF item may have several true items and the correctness of each is judged independently. MTF tests appear efficient and reliable, although they are a bit harder than multiple choice items for examinees. (SLD)
Descriptors: Achievement Tests, Difficulty Level, Literature Reviews, Multiple Choice Tests
Peer reviewed Peer reviewed
Trevisan, Michael S.; And Others – Educational and Psychological Measurement, 1994
The reliabilities of 2-, 3-, 4-, and 5-choice tests were compared through an incremental-option model on a test taken by 154 high school seniors. Creating the test forms incrementally more closely approximates actual test construction. The nonsignificant differences among the option choices support the three-option item. (SLD)
Descriptors: Distractors (Tests), Estimation (Mathematics), High School Students, High Schools
Peer reviewed Peer reviewed
Hancock, Gregory R. – Journal of Experimental Education, 1994
To investigate the ability of multiple-choice tests to assess higher order thinking skills, examinations were constructed as half multiple choice and half constructed response. Results with 90 undergraduate and graduate students indicate that the 2 formats measure similar constructs at different levels of complexity. (SLD)
Descriptors: Cognitive Processes, Comparative Analysis, Constructed Response, Educational Assessment
Peer reviewed Peer reviewed
Feller, Morris – Studies in Educational Evaluation, 1994
The educational advantages of open-book testing are explored, considering the changes in teaching method that are necessary for open-book testing to be successful. How open-book testing should work is illustrated through the example of tests for Spanish-language classes. (SLD)
Descriptors: Educational Change, Elementary Secondary Education, Futures (of Society), Open Book Tests
Peer reviewed Peer reviewed
MacAvoy, Jim; Sidles, Craig – Journal of American Indian Education, 1991
Eighty Navajo-speaking students, aged 8-10, were administered free recall word lists in Navajo or English using a multitrial assessment format. Recall rates were higher in Trials 1 and 2 for those receiving Navajo words than for those receiving English words, but there were no differences on later trials. Contains 27 references. (SV)
Descriptors: American Indian Education, American Indians, Diagnostic Tests, Educational Diagnosis
Peer reviewed Peer reviewed
Tzeng, Oliver C. S.; And Others – Educational and Psychological Measurement, 1991
Measurement properties of two response formats (bipolar and unipolar ratings) in personality assessment were compared using data from 135 college students taking the Myers-Briggs Type Indicator (MBTI). Factorial validity and construct validity of the MBTI were supported. Reasons why the bipolar method is preferable are discussed. (SLD)
Descriptors: College Students, Comparative Testing, Construct Validity, Factor Analysis
Peer reviewed Peer reviewed
Shea, Judy A.; And Others – Evaluation and the Health Professions, 1992
Video and print formats of cardiovascular motion studies were compared for use as assessment measures of interpretive skills for 392 doctors taking a cardiovascular disease certification test. Although video studies were easier to interpret, the equivalence of both motion studies supports use of the print format in national examinations. (SLD)
Descriptors: Cardiovascular System, Comparative Testing, Graduate Medical Education, Interpretive Skills
Peer reviewed Peer reviewed
Benedict, Ralph H. B.; And Others – Psychological Assessment, 1992
The concurrent validities of 3 short forms of the Wechsler Adult Intelligence Scale (WAIS) were compared for their prediction of full-scale IQ for 145 male and 159 female psychiatric inpatients. Results support previous research showing better predictive accuracy for L. C. Ward's (1990) seven-subtest short form than the others. (SLD)
Descriptors: Adults, Comparative Testing, Concurrent Validity, Cost Effectiveness
Peer reviewed Peer reviewed
Friedman, Stephen J.; Ansley, Timothy N. – Journal of Experimental Education, 1990
To investigate the relationship between reading and listening test scores, 3 different sets of listening items accompanied by answer sheets requiring varying amounts of reading were administered to 1,200 students in grades 3 through 8. Listening scores increased as more printed information was added to the answer sheet. (SLD)
Descriptors: Answer Sheets, Comparative Testing, Elementary Education, Elementary School Students
Pages: 1  |  ...  |  157  |  158  |  159  |  160  |  161  |  162  |  163  |  164  |  165  |  ...  |  207