NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 6,421 to 6,435 of 16,724 results Save | Export
Peer reviewed Peer reviewed
Reckase, Mark D. – Educational Measurement: Issues and Practice, 1998
Considers what a responsible test developer would do to gain information to support the consequential basis of validity for a test early in the development. How the consequential basis of validity of the program would be monitored and reported during the life of the program is examined. The validity of the ACT Assessment is considered as if it…
Descriptors: Evaluation Methods, Program Evaluation, Test Construction, Validity
Peer reviewed Peer reviewed
Dorans, Neil J.; Holland, Paul W. – Journal of Educational Measurement, 2000
Studied the degree to which equating functions failed to demonstrate population invariance across subpopulations, using two root-mean-square difference measures of the degree to which functions used to link two tests computed on subpopulations differ from the linking function for the whole population. Illustrated the ideas using data from the…
Descriptors: College Entrance Examinations, Equated Scores, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Goldwater, Bram C.; Grabavac, Diana M.; Acker, Loren E. – International Journal of Testing, 2005
Regular testing can serve as an incentive for students to keep up with their readings, but the time and effort involved in composing and grading frequent tests can serve as an equally strong disincentive to time-strapped instructors. We describe a distorted-item (DI) test that simply requires excerpting sentences or phrases from the assigned…
Descriptors: Objective Tests, Test Construction, Testing, Student Attitudes
Peer reviewed Peer reviewed
Direct linkDirect link
Huitzing, Hiddo A. – Journal of Educational Measurement, 2004
In optimal assembly of tests from item banks, linear programming (LP) models have proved to be very useful. Assembly by hand has become nearly impossible, but these LP techniques are able to find the best solutions, given the demands and needs of the test to be assembled and the specifics of the item bank from which it is assembled. However,…
Descriptors: Mathematical Applications, Test Construction, Item Banks, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Oakland, Thomas; Lane, Holly B. – International Journal of Testing, 2004
Issues pertaining to language and reading while developing and adapting tests are examined. Strengths and limitations associated with the use of readability formulas are discussed. Their use should be confined to paragraphs and longer passages, not items. Readability methods that consider both quantitative and qualitative variables and are…
Descriptors: Test Content, Readability, Readability Formulas, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Stein, Mary; Barman, Charles R.; Larrabee, Timothy – Journal of Science Teacher Education, 2007
This article describes the rationale for, and development of, an online instrument that helps identify commonly held science misconceptions. Science Beliefs is a 47-item instrument that targets topics in chemistry, physics, biology, earth science, and astronomy. It utilizes a true or false, along with a written-explanation, format. The true or…
Descriptors: Misconceptions, Scientific Concepts, Chemistry, Physics
Peer reviewed Peer reviewed
Direct linkDirect link
Nick, Sabine; Nather, Christian – Journal of Chemical Education, 2007
In July 2004 the 36th International Chemistry Olympiad was held in Kiel, Germany. Competition for medals included 236 students from 61 countries, accompanied by about 150 teachers and other mentors. During this Olympiad the students performed qualitative and quantitative analyses of a superconductor, based on lanthanum barium cuprate. In the…
Descriptors: Chemistry, Test Construction, Science Tests, Competition
Peer reviewed Peer reviewed
Direct linkDirect link
Zamboanga, Byron L.; Padilla-Walker, Laura M.; Hardy, Sam A.; Thompson, Ross A.; Wang, Sherry C. – Teaching of Psychology, 2007
We examined how academic background and course involvement differentially predicted students' performance on lecture- and text-based exam questions (N = 114; 34% men; 76% freshmen). Results showed that academic background and course involvement predicted performance on lecture-based questions and overall exam performance, whereas academic…
Descriptors: Test Construction, Academic Achievement, Prediction, Teaching Methods
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Attali, Yigal; Powers, Don; Freedman, Marshall; Harrison, Marissa; Obetz, Susan – ETS Research Report Series, 2008
This report describes the development, administration, and scoring of open-ended variants of GRE® Subject Test items in biology and psychology. These questions were administered in a Web-based experiment to registered examinees of the respective Subject Tests. The questions required a short answer of 1-3 sentences, and responses were automatically…
Descriptors: College Entrance Examinations, Graduate Study, Scoring, Test Construction
Bloom, Howard; Zhu, Pei; Jacob, Robin; Raudenbush, Stephen; Martinez, Andres; Lin, Fen – MDRC, 2008
This paper provides practical guidance for researchers who are designing studies that randomize groups to measure the impacts of interventions on children. To do so, the paper: (1) provides new empirical information about the values of parameters that influence the precision of impact estimates (intra-class correlations and R-squares); (2)…
Descriptors: Pilot Projects, Research Methodology, Intervention, Sampling
Peer reviewed Peer reviewed
Direct linkDirect link
Allalouf, Avi; Abramzon, Andrea – Language Assessment Quarterly, 2008
Differential item functioning (DIF) analysis can be used to great advantage in second language (L2) assessments. This study examined the differences in performance on L2 test items between groups from different first language backgrounds and suggested ways of improving L2 assessments. The study examined DIF on L2 (Hebrew) test items for two…
Descriptors: Test Items, Test Format, Second Language Learning, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Kim, Do-Hong; Huynh, Huynh – Educational and Psychological Measurement, 2008
The current study compared student performance between paper-and-pencil testing (PPT) and computer-based testing (CBT) on a large-scale statewide end-of-course English examination. Analyses were conducted at both the item and test levels. The overall results suggest that scores obtained from PPT and CBT were comparable. However, at the content…
Descriptors: Reading Comprehension, Computer Assisted Testing, Factor Analysis, Comparative Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Othman, Jazilah; Treagust, David F.; Chandrasegaran, A. L. – International Journal of Science Education, 2008
A thorough understanding of chemical bonding requires familiarity with the particulate nature of matter. In this study, a two-tier multiple-choice diagnostic instrument consisting of ten items (five items involving each of the two concepts) was developed to assess students' understanding of the particulate nature of matter and chemical bonding so…
Descriptors: Chemistry, Foreign Countries, Grade 9, Grade 10
Peer reviewed Peer reviewed
Direct linkDirect link
O'Donnell, Meaghan L.; Creamer, Mark C.; Parslow, Ruth; Elliott, Peter; Holmes, Alexander C. N.; Ellen, Steven; Judson, Rodney; McFarlane, Alexander C.; Silove, Derrick; Bryant, Richard A. – Journal of Consulting and Clinical Psychology, 2008
Posttraumatic stress disorder (PTSD) and major depressive episode (MDE) are frequent and disabling consequences of surviving severe injury. The majority of those who develop these problems are not identified or treated. The aim of this study was to develop and validate a screening instrument that identifies, during hospitalization, adults at high…
Descriptors: Posttraumatic Stress Disorder, Injuries, Patients, Measures (Individuals)
Becker, Craig; Whetstone, Lauren; Glascoff, Mary; Moore, Justin B. – American Journal of Health Education, 2008
Background: Traditional health measurement tools use a pathogenic, or disease origins framework, to assess for the absence of disease or risk factors. Good or positive health, however, is more than the absence of disease and current tools do not reflect this. Purpose: The purpose of this study was to test the psychometric properties of the adult…
Descriptors: Health Education, Health Promotion, Life Satisfaction, Health Conditions
Pages: 1  |  ...  |  425  |  426  |  427  |  428  |  429  |  430  |  431  |  432  |  433  |  ...  |  1115