NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20260
Since 20250
Since 2022 (last 5 years)0
Since 2017 (last 10 years)2
Since 2007 (last 20 years)3
Assessments and Surveys
SAT (College Admission Test)1
What Works Clearinghouse Rating
Showing all 14 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Mo, Ya; Carney, Michele; Cavey, Laurie; Totorica, Tatia – Applied Measurement in Education, 2021
There is a need for assessment items that assess complex constructs but can also be efficiently scored for evaluation of teacher education programs. In an effort to measure the construct of teacher attentiveness in an efficient and scalable manner, we are using exemplar responses elicited by constructed-response item prompts to develop…
Descriptors: Protocol Analysis, Test Items, Responses, Mathematics Teachers
Peer reviewed Peer reviewed
Direct linkDirect link
Morgan, Grant B.; Moore, Courtney A.; Floyd, Harlee S. – Journal of Psychoeducational Assessment, 2018
Although content validity--how well each item of an instrument represents the construct being measured--is foundational in the development of an instrument, statistical validity is also important to the decisions that are made based on the instrument. The primary purpose of this study is to demonstrate how simulation studies can be used to assist…
Descriptors: Simulation, Decision Making, Test Construction, Validity
Peer reviewed Peer reviewed
Schott, G. R.; Bellin, W. – Evaluation & Research in Education, 2001
Developed an approach to account for the impact of item presentation on ensuing constructs in the development of two versions of a self-report measure, the Relational Concept Scale, that was tested with 978 adolescent students in the United Kingdom. Outlines benefits of developing two versions of the scale to protect against presentational bias.…
Descriptors: Adolescents, Foreign Countries, Statistical Bias, Test Construction
Peer reviewed Peer reviewed
Haladyna, Thomas M.; Downing, Steven M. – Applied Measurement in Education, 1989
Results of 96 theoretical/empirical studies were reviewed to see if they support a taxonomy of 43 rules for writing multiple-choice test items. The taxonomy is the result of an analysis of 46 textbooks dealing with multiple-choice item writing. For nearly half of the rules, no research was found. (SLD)
Descriptors: Classification, Literature Reviews, Multiple Choice Tests, Test Construction
Haladyna, Thomas M. – 1999
This book explains writing effective multiple-choice test items and studying responses to items to evaluate and improve them, two topics that are very important in the development of many cognitive tests. The chapters are: (1) "Providing a Context for Multiple-Choice Testing"; (2) "Constructed-Response and Multiple-Choice Item Formats"; (3)…
Descriptors: Constructed Response, Multiple Choice Tests, Test Construction, Test Format
Peer reviewed Peer reviewed
Crehan, Kevin; Haladyna, Thomas M. – Journal of Experimental Education, 1991
Two item-writing rules were tested: phrasing stems as questions versus partial sentences; and using the "none-of-the-above" option instead of a specific content option. Results with 228 college students do not support the use of either stem type and provide limited evidence to caution against the "none-of-the-above" option.…
Descriptors: College Students, Higher Education, Multiple Choice Tests, Test Construction
Hambleton, Ronald K.; Patsula, Liane – 2000
Whatever the purpose of test adaptation, questions arise concerning the validity of inferences from such adapted tests. This paper considers several advantages and disadvantages of adapting tests from one language and culture to another. The paper also reviews several sources of error or invalidity associated with adapting tests and suggests ways…
Descriptors: Cross Cultural Studies, Cultural Awareness, Quality of Life, Test Construction
Peer reviewed Peer reviewed
Williams, Janet L. – RSR: Reference Services Review, 2000
Discusses the basic concepts of testing and item development and the application of alternative assessments to information literacy content for library instruction. Topics include reliability; validity; statistical analysis; selected response, including checklists, rank order, or simple match; constructed response; essays; and complex assessments.…
Descriptors: Essays, Evaluation Methods, Information Literacy, Library Instruction
Peer reviewed Peer reviewed
Direct linkDirect link
Silvestrone, Judy M. – New Directions for Teaching and Learning, 2004
Whether in the science or language laboratory, carrying out health care procedures or demonstrating performance arts, faculty can improve skill evaluation through transparency and authenticity in exam construction, format, and grading.
Descriptors: Language Laboratories, Performance Based Assessment, Validity, Reliability
Mott, Michael S.; Halpin, Regina – 1999
The reliability and developmental and concurrent validity of the Writing What You Read (WWYR) rubric, designed for use with paper and pen, for hypermedia-authored narrative productions of students in grades 2 and 3 were studied. Sixty students from 4 classrooms produced hypermedia narratives (interactive multimedia presentations) that were rated…
Descriptors: Comparative Analysis, Computer Assisted Instruction, Computer Software, Elementary School Students
Mott, Michael S.; Hare, R. Dwight – 1999
This study investigated the reliability and developmental and concurrent validity of the Writing What You Read (WWYR) rubric, an instrument originally designed for use with paper-and-pen-created narratives, for hypermedia productions of students in grades 2 and 3. Four teachers guided their students in a 3-month-long hypermedia/process writing…
Descriptors: Comparative Analysis, Computer Assisted Instruction, Computer Software, Elementary Education
Peer reviewed Peer reviewed
Direct linkDirect link
Wiliam, Dylan – Review of Research in Education, 2010
The idea that validity should be considered a property of inferences, rather than of assessments, has developed slowly over the past century. In early writings about the validity of educational assessments, validity was defined as a property of an assessment. The most common definition was that an assessment was valid to the extent that it…
Descriptors: Educational Assessment, Validity, Inferences, Construct Validity
Baker, Eva L.; O'Neil, Harold F. – 1985
This paper presents a discussion of outcome assessment that puts into context how measurement has evolved to its present state. Several types of testing and assessment options are considered against a background of validity. Criterion-referenced measurement is discussed extensively in terms of history, field study, identity problems, intellectual…
Descriptors: Criterion Referenced Tests, Educational Assessment, Educational Technology, Elementary Secondary Education
Maihoff, N. A.; Mehrens, Wm. A. – 1985
A comparison is presented of alternate-choice and true-false item forms used in an undergraduate natural science course. The alternate-choice item is a modified two-choice multiple-choice item in which the two responses are included within the question stem. This study (1) compared the difficulty level, discrimination level, reliability, and…
Descriptors: Classroom Environment, College Freshmen, Comparative Analysis, Comparative Testing