NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 2,671 to 2,685 of 3,123 results Save | Export
Peer reviewed Peer reviewed
Salend, Spencer J. – Intervention in School and Clinic, 1995
Test modifications and techniques that teachers can employ to adapt their tests to meet individualized needs of mainstreamed students with disabilities are considered. Suggestions are offered to assist special education teachers in helping general educators design tests; address test reliability, validity, content, and format; and develop…
Descriptors: Disabilities, Elementary Secondary Education, Evaluation Methods, Learning Problems
Peer reviewed Peer reviewed
Haladyna, Thomas A. – Applied Measurement in Education, 1992
Several multiple-choice item formats are examined in the current climate of test reform. The reform movement is discussed as it affects use of the following formats: (1) complex multiple-choice; (2) alternate choice; (3) true-false; (4) multiple true-false; and (5) the context dependent item set. (SLD)
Descriptors: Cognitive Psychology, Comparative Testing, Context Effect, Educational Change
Peer reviewed Peer reviewed
Caudill, Steven B.; Gropper, Daniel M. – Journal of Economic Education, 1991
Presents a study of the effect of question order on student performance on economics tests. Reports that question order has no statistically significant effect on examination scores, even after including variables that reflect differential human capital characteristics. Concludes that instructors need not worry that some examination versions give…
Descriptors: Economics Education, Educational Research, Higher Education, Human Capital
Peer reviewed Peer reviewed
McKillip, Jack; And Others – Evaluation and Program Planning, 1992
Effects of question length and explicitness of directions (format) were examined for 97 university students answering open-ended consumer satisfaction questions at 9 health promotion workshops. Question format, but not length, was related to the usefulness of answers. Implications for use and construction of open-ended questions are discussed.…
Descriptors: Client Characteristics (Human Services), College Students, Consumer Economics, Evaluation Utilization
Peer reviewed Peer reviewed
Bennett, Randy Elliot; And Others – Journal of Educational Measurement, 1991
The relationship of multiple-choice and free-response items on the College Board's Advanced Placement Computer Science Examination was studied using confirmatory factor analysis. Results with 2 samples of 1,000 high school students suggested that the most parsimonious fit was achieved using a single factor. Implications for construct validity are…
Descriptors: Chi Square, College Entrance Examinations, Comparative Testing, Computer Science
Peer reviewed Peer reviewed
Skaggs, Gary; Lissitz, Robert W. – Journal of Educational Measurement, 1992
The consistency of several item bias detection methods was studied across different test administrations of the same items using data from a mathematics test given to approximately 6,600 eighth grade students in all. The Mantel Haenszel and item-response-theory-based sum-of-squares methods were the most consistent. (SLD)
Descriptors: Comparative Testing, Grade 8, Item Bias, Item Response Theory
Peer reviewed Peer reviewed
Birenbaum, Menucha; And Others – Applied Psychological Measurement, 1992
The effect of multiple-choice (MC) or open-ended (OE) response format on diagnostic assessment of algebra test performance was investigated with 231 eighth and ninth graders in Tel Aviv (Israel) using bug or rule space analysis. Both analyses indicated closer similarity between parallel OE subsets than between stem-equivalent OE and MC subsets.…
Descriptors: Algebra, Comparative Testing, Educational Assessment, Educational Diagnosis
Peer reviewed Peer reviewed
Harris, Abigail M.; Carlton, Sydell T. – Applied Measurement in Education, 1993
Differential item functioning on 6 forms of the Scholastic Aptitude Test was examined for 181,228 male and 198,668 female students focusing on the points tested, the test format, and subject matter in which items are embedded. Implications of the identifiable differences are discussed. (SLD)
Descriptors: College Entrance Examinations, Comparative Analysis, Females, High School Students
Peer reviewed Peer reviewed
DeMars, Christine E. – Applied Measurement in Education, 2000
Studied the effects of test consequences, response formats, gender, and ethnicity on the mathematics and science sections of the Michigan High School Proficiency Test. Results for more than 11,000 students show that students taking constructed response and multiple choice formats performed better under high stakes conditions. Discusses gender and…
Descriptors: Constructed Response, Ethnicity, High School Students, High Schools
Peer reviewed Peer reviewed
Direct linkDirect link
Schoenfeld, Alan H. – Measurement: Interdisciplinary Research and Perspectives, 2007
The authors of this volume's stimulus papers have taken on the challenge of developing measures of teachers' mathematical knowledge for teaching (MKT). This task involves multiple decisions and considerations, including: (1) How does one specify the body of knowledge being assessed? What warrants are offered for those choices?; (2) How does one…
Descriptors: Test Validity, Psychometrics, Test Construction, Evaluation Research
Peer reviewed Peer reviewed
Direct linkDirect link
Yeh, Stuart S. – Educational Policy, 2006
The No Child Left Behind Act (NCLB) assumes that state-mandated tests provide useful information to school administrators and teachers. However, interviews with administrators and teachers suggest that Minnesota's tests, which are representative of the current generation of state-mandated tests, fail to provide useful information to administrators…
Descriptors: Federal Legislation, Educational Policy, Outcomes of Education, Accountability
Alderson, J. Charles; And Others – 1995
The guide is intended for teachers who must construct language tests and for other professionals who may need to construct, evaluate, or use the results of language tests. Most examples are drawn from the field of English-as-a-Second-Language instruction in the United Kingdom, but the principles and practices described may be applied to the…
Descriptors: Educational Trends, English (Second Language), Interrater Reliability, Language Tests
Smith, Robert L.; Carlson, Alfred B. – 1995
The feasibility of constructing test forms with practically equivalent cut scores using judges' estimates of item difficulty as target "statistical" specifications was investigated. Test forms with equivalent judgmental cut scores (based on judgments of item difficulty) were assembled using items from six operational forms of the…
Descriptors: Cutting Scores, Decision Making, Difficulty Level, Equated Scores
Myerberg, N. James – 1996
The Montgomery County (Maryland) public school system has started using assessments other than multiple-choice tests because it is felt that this will provide school staff with better information about the success of the instructional program. One of the ways assessments can provide better information is by having teachers score student papers.…
Descriptors: Accountability, Achievement Tests, Educational Assessment, Elementary Secondary Education
Donoghue, John R.; Mazzeo, John – 1995
At grades 8 and 12, the 1992 National Assessment of Educational Progress (NAEP) reading assessment contained a small number of 50-minute blocks in addition to the usual 25-minute blocks. To determine whether to incorporate the 50-minute blocks into the operational scaling, this study sought to determine whether the longer blocks measured a…
Descriptors: Chi Square, Goodness of Fit, Grade 12, Grade 8
Pages: 1  |  ...  |  175  |  176  |  177  |  178  |  179  |  180  |  181  |  182  |  183  |  ...  |  209