NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 48 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Muhammad Yoga Prabowo; Sarah Rahmadian – TEFLIN Journal: A publication on the teaching and learning of English, 2023
The outbreak of the COVID-19 pandemic has transformed the educational landscape in a way unseen before. Educational institutions are navigating between offline and online learning worldwide. Computer-based testing is rapidly taking over paper-and-pencil testing as the dominant mode of assessment. In some settings, computer-based and…
Descriptors: English (Second Language), Second Language Learning, Test Format, Language Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Yulianto, Ahmad; Pudjitriherwanti, Anastasia; Kusumah, Chevy; Oktavia, Dies – International Journal of Language Testing, 2023
The increasing use of computer-based mode in language testing raises concern over its similarities with and differences from paper-based format. The present study aimed to delineate discrepancies between TOEFL PBT and CBT. For that objective, a quantitative method was employed to probe into scores equivalence, the performance of male-female…
Descriptors: Computer Assisted Testing, Test Format, Comparative Analysis, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Sinharay, Sandip – Educational Measurement: Issues and Practice, 2018
The choice of anchor tests is crucial in applications of the nonequivalent groups with anchor test design of equating. Sinharay and Holland (2006, 2007) suggested "miditests," which are anchor tests that are content-representative and have the same mean item difficulty as the total test but have a smaller spread of item difficulties.…
Descriptors: Test Content, Difficulty Level, Test Items, Test Construction
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS) to provide guidance to states that are interested in including New Meridian content and would like to either keep reporting scores on the New Meridian Scale or use the New Meridian performance levels; that is, the state…
Descriptors: Testing, Standards, Comparative Analysis, Test Content
McClellan, Catherine; Snyder, Rebecca; Woods-Murphy, Maryann; Basset, Katherine – National Network of State Teachers of the Year, 2018
Great teachers recognize great assessments. As policy and education leaders work to make sure state tests are measuring the problem-solving, writing, and critical-thinking skills students need for success, they should convene and rely on teachers to review test quality and help answer the question: Do the questions on our state test reflect…
Descriptors: Student Evaluation, Educational Quality, Standardized Tests, Test Items
Wah-Mei Kodimer – ProQuest LLC, 2015
The purpose of the proposed study is to add to the literature regarding the assessment of effort and malingering in the field of neuropsychology using the Rey Complex Figure Test (RCFT). The majority of the literature on this measure has been in the specific areas for which the instrument was developed, namely those of visual spatial and…
Descriptors: Psychological Evaluation, Neuropsychology, Recognition (Psychology), Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Rios, Joseph A.; Sireci, Stephen G. – International Journal of Testing, 2014
The International Test Commission's "Guidelines for Translating and Adapting Tests" (2010) provide important guidance on developing and evaluating tests for use across languages. These guidelines are widely applauded, but the degree to which they are followed in practice is unknown. The objective of this study was to perform a…
Descriptors: Guidelines, Translation, Adaptive Testing, Second Languages
Peer reviewed Peer reviewed
Direct linkDirect link
Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010
"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…
Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques
Wang, Huan – ProQuest LLC, 2010
Multiple uses of the same assessment may present challenges for both the design and use of an assessment. Little advice, however, has been given to assessment developers as to how to understand the phenomena of multiple assessment use and meet the challenges these present. Particularly problematic is the case in which an assessment is used for…
Descriptors: Test Use, Testing Programs, Program Effectiveness, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010
The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…
Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria
Peer reviewed Peer reviewed
Direct linkDirect link
Leach, Mark M.; Oakland, Thomas – International Journal of Testing, 2007
Ethics codes are designed to protect the public by prescribing behaviors professionals are expected to exhibit. Although test use is universal, albeit reflecting strong Western influences, previous studies that examine the degree issues pertaining to test development and use and that are addressed in ethics codes of national psychological…
Descriptors: Test Use, Test Construction, Psychologists, Psychology
Peer reviewed Peer reviewed
Rest, James; And Others – Journal of Educational Psychology, 1997
The Defining Issues Test (DIT) of moral judgment, a stage preference approach, is compared with the Moral Judgment Test (MJT) widely used in Europe. The MJT is a stage consistency test. Construct validity is defined in terms of seven types of studies, which support use of the stage preference approach. (SLD)
Descriptors: Attitude Measures, Comparative Analysis, Ethics, Moral Values
Peer reviewed Peer reviewed
Downing, Steven M. – Educational Measurement: Issues and Practice, 1992
Research on true-false (TF), multiple-choice, and alternate-choice (AC) tests is reviewed, discussing strengths, weaknesses, and the usefulness in classroom and large-scale testing of each. Recommendations are made for improving use of AC items to overcome some of the problems associated with TF items. (SLD)
Descriptors: Comparative Analysis, Educational Research, Multiple Choice Tests, Objective Tests
Peer reviewed Peer reviewed
Cameron, L. A.; Heywood, J. – College Teaching, 1985
A study of the comparability of results of two techniques of testing, the traditional approach and that of giving the students the questions before the examination, revealed little difference in the two approaches' results, and supports the use of "prior notice" to reduce test anxiety. (MSE)
Descriptors: College Instruction, Comparative Analysis, Higher Education, Preservice Teacher Education
Merz, William R. – 1980
Several methods of assessing test item bias are described, and the concept of fair use of tests is examined. A test item is biased if individuals of equal ability have different probabilities of attaining the item correct. The following seven general procedures used to examine test items for bias are summarized and discussed: (1) analysis of…
Descriptors: Comparative Analysis, Evaluation Methods, Factor Analysis, Mathematical Models
Previous Page | Next Page ยป
Pages: 1  |  2  |  3  |  4