NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 8,026 to 8,040 of 9,530 results Save | Export
Bennett, Randy Elliot; And Others – 1988
This study developed, applied, and evaluated a theory-based method of detecting the underlying causes of differential difficulty. The method was applied to two subgroups taking the Scholastic Aptitude Test-Mathematics (SAT-M), 261 visually impaired students taking Braille forms of the test and 1,985 black students at 3 test administrations. It…
Descriptors: Black Students, Braille, Cluster Analysis, Difficulty Level
Rachor, Robert E.; Gray, George T. – 1996
Two frequently cited guidelines for writing multiple choice test item stems are: (1) the stem can be written in either a question or statement-to-be-completed format; and (2) only positively worded stems should be used. These guidelines were evaluated in a survey of the test item banks of 13 nationally administered examinations in the physician…
Descriptors: Allied Health Personnel, Difficulty Level, High Achievement, Item Banks
Schedl, Mary; And Others – 1996
The issue of what exactly is measured by different types of reading items has been a matter of interest in the field of reading research for many years. Language teaching and testing specialists have raised the question of whether a reading test for foreign students wishing to enter a university in the United States should include questions…
Descriptors: Adults, English (Second Language), Factor Analysis, Factor Structure
Leonard, John D. – 1996
In the context of the development of a possible national mathematics assessment, a study was conducted to determine whether a test item characterization scheme could be created based on a state policy document that serves as the driving force behind large-scale performance assessment. Further considerations were whether such an item…
Descriptors: Cost Effectiveness, Educational Policy, Elementary Secondary Education, Mathematics Education
Bennett, Randy Elliot; And Others – 1991
This study investigated the convergent validity of expert-system scores for four mathematical constructed-response item formats. A five-factor model was proposed comprised of four constructed-response format factors and a Graduate Record Examinations (GRE) General Test quantitative factor. Subjects were drawn from examinees taking a single form of…
Descriptors: College Students, Constructed Response, Correlation, Expert Systems
Chang, Lei – 1996
It was hypothesized that, when compared to the Angoff method (W. H. Angoff, 1971), the Nedelsky method (L. Nedelsky, 1954) for standard setting had lower intrajudge inconsistency, lower cutscores, and lower cutscores especially for items presenting challenges to the judges. These hypotheses were tested and supported in a sample of 22 graduate…
Descriptors: Comparative Analysis, Cutting Scores, Difficulty Level, Distractors (Tests)
Bunderson, C. Victor; And Others – 1988
Educational measurement is undergoing a revolution due to the rapid dissemination of information-processing technology. The recent growth in computing resources and their widespread dissemination in daily life have brought about irreversible changes in educational measurement. Recent developments in computerized measurement are summarized by…
Descriptors: Academic Achievement, Adaptive Testing, Computer Assisted Testing, Educational Assessment
Wang, Xiang Bo – 1996
It has been found that when examinees are allowed to choose a subset of constructed response (CR) items to answer on a test, they tend to choose differently and often perform lower on more popularly chosen items. The purpose of this study was to examine this finding. Using an experiment that incorporated a revised Advanced Placement (AP) Chemistry…
Descriptors: Chemistry, College Students, Constructed Response, Curriculum
Scheuneman, Janice Dowd; And Others – 1997
As part of the research leading to the implementation of computer-based case simulations (CCS) for the licensing examinations of the National Board of Medical Examiners, gender differences in performance were studied for one form consisting of 18 cases. A secondary purpose of the study was to note differences in style or approach that might…
Descriptors: Case Method (Teaching Technique), Case Studies, Cognitive Processes, Computer Simulation
Ohio State Dept. of Education, Columbus. – 1995
This practice test for the Ohio Ninth-grade Proficiency Tests consists of items similar to those that appear on the proficiency test. The writing section contains a prompt that asks the student to write about a hero or heroine. The reading test contains questions based on four reading selections and other reading skill questions not based on a…
Descriptors: Achievement Tests, Citizenship, Grade 9, Graduation Requirements
PDF pending restoration PDF pending restoration
Alberta Dept. of Education, Edmonton. Language Services Branch. – 1995
The French as a Second Language model tests for junior high school instruction presented here were designed to evaluate students' language performance as outlined in the learner expectations of the Alberta (Canada) second language curriculum. They focus on the specific fields of experience of these levels, but may be adapted for local contexts. An…
Descriptors: Behavioral Objectives, Classroom Techniques, Evaluation Criteria, Evaluation Methods
Alberta Dept. of Education, Edmonton. Language Services Branch. – 1994
The French as a Second Language model test (intermediate level) was designed as a criterion-referenced test using established criteria to measure attainment of learner expectations outlined in the Alberta (Canada) second language curriculum. It is intended for use at the junior and senior high school levels. Each test is based on an organizing…
Descriptors: Behavioral Objectives, Classroom Techniques, Criterion Referenced Tests, Evaluation Methods
Meijer, Rob R. – 1994
In person-fit analysis, the object is to investigate whether an item score pattern is improbable given the item score patterns of the other persons in the group or given what is expected on the basis of a test model. In this study, several existing group-based statistics to detect such improbable score patterns were investigated, along with the…
Descriptors: Achievement Tests, Classification, College Students, Cutting Scores
van der Linden, Wim J.; Zwarts, Michel A. – 1994
It is argued that judgments in evaluative research are ultimately subjective, but that good criteria are available to assess their quality. One of these criteria is the robustness of the judgments against incompleteness or uncertainty in the data used to describe the educational system. The use of the robustness criterion is demonstrated through…
Descriptors: Ability, Case Studies, Criteria, Decision Making
Koretz, Daniel; And Others – 1993
Patterns of nonresponse, representing items not reached by test takers or omitted because of difficulty, were addressed for all three age groups (grade 4, grade 8, and grade 12) taking the 1990 National Assessment of Educational Progress (NAEP). The analysis considered nonresponse rates for each item in the seven blocks of items on which reported…
Descriptors: Elementary Secondary Education, Grade 12, Grade 4, Grade 8
Pages: 1  |  ...  |  532  |  533  |  534  |  535  |  536  |  537  |  538  |  539  |  540  |  ...  |  636