NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 106 to 120 of 771 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Nichols, Paul D.; Williams, Natasha – Educational Measurement: Issues and Practice, 2009
This article has three goals. The first goal is to clarify the role that the consequences of test score use play in validity judgments by reviewing the role that modern writers on validity have ascribed for consequences in supporting validity judgments. The second goal is to summarize current views on who is responsible for collecting evidence of…
Descriptors: Tests, Test Validity, Scores, Data Collection
Peer reviewed Peer reviewed
Direct linkDirect link
Penfield, Randall D. – Educational Researcher, 2010
A growing body of research showing that grade retention serves as an educationally low-quality placement has raised increasing concerns about whether the use of standardized tests in making decisions concerning grade retention conforms to current standards for appropriate and nondiscriminatory test use. This article examines the extent to which…
Descriptors: Test Use, Grade Repetition, Standardized Tests, Learning Readiness
Peer reviewed Peer reviewed
Direct linkDirect link
Fulcher, Glenn; Davidson, Fred – Language Testing, 2009
Just like buildings, tests are designed and built for specific purposes, people, and uses. However, both buildings and tests grow and change over time as the needs of their users change. Sometimes, they are also both used for purposes other than those intended in the original designs. This paper explores architecture as a metaphor for language…
Descriptors: Figurative Language, Language Tests, Measurement Techniques, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Denner, Peter; Norman, Antony; Lin, Shu-Yuan – Educational Assessment, Evaluation and Accountability, 2009
Acknowledging the necessity to establish the fairness and consequential validity of teacher candidate performance assessments when they are used to make high-stakes decisions impacting entry into the profession, we investigated whether there were any adverse results from the use of the Renaissance Teacher Work Sample (TWS) assessment at two…
Descriptors: Work Sample Tests, Test Validity, Test Bias, Preservice Teachers
Proctor, Thomas P.; Kim, YoungKoung Rachel – College Board, 2009
Presented at the national conference for the American Educational Research Association (AERA) in April 2009. This study examined the utility of scores on the SAT writing test, specifically examining the reliability of scores using generalizability and item response theories. The study also provides an overview of current predictive validity…
Descriptors: College Entrance Examinations, Writing Tests, Psychometrics, Predictive Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010
"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…
Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques
Peer reviewed Peer reviewed
Direct linkDirect link
Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010
Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…
Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics
New York State Education Department, 2014
This technical report provides an overview of the New York State Alternate Assessment (NYSAA), including a description of the purpose of the NYSAA, the processes utilized to develop and implement the NYSAA program, and Stakeholder involvement in those processes. The purpose of this report is to document the technical aspects of the 2013-14 NYSAA.…
Descriptors: Alternative Assessment, Educational Assessment, State Departments of Education, Student Evaluation
Mather, Nancy; Bos, Candace – Diagnostique, 1984
Performance of 46 gifted and talented students (7-12 years old) on the Woodcock-Johnson Tests of Cognitive Ability and the Wechsler Intelligence Scale for Children-Revised was compared. Concurrent validity between the two full-scale measures was indicated. Scores on the alternative cluster of Broad Reasoning provided more accurate appraisal of…
Descriptors: Gifted, Talent, Test Use, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Sireci, Stephen G. – Educational Researcher, 2007
Lissitz and Samuelsen (2007) propose a new framework for conceptualizing test validity that separates analysis of test properties from analysis of the construct measured. In response, the author of this article reviews fundamental characteristics of test validity, drawing largely from seminal writings as well as from the accepted standards. He…
Descriptors: Test Content, Test Validity, Guidelines, Test Items
Peer reviewed Peer reviewed
Simpson, B. Allen – Psychology: A Quarterly Journal of Human Behavior, 1986
Deals with the use of the "lie detector" or "polygraphic test" as a method of detecting deception in industries and law enforcement agencies. Explains what the polygraph is and how it operates. Presents a series of specific arguments for and against the validity of the instrument. Research appears to be inconclusive. (Author/ABB)
Descriptors: Industry, Measurement Equipment, Polygraphs, Test Use
Peer reviewed Peer reviewed
Douglass, Frazier M.; Douglass, Robin – Family Relations, 1994
Authors of 1993 article on Myers-Briggs Type Indicator (MBTI) respond to article by Sherman and Jones (1994) concerning 1993 article. Addresses nine points raised by Sherman and Jones. Concludes that MBTI holds considerable promise as tool in therapy. (NB)
Descriptors: Personality Measures, Reader Response, Test Use, Test Validity
Peer reviewed Peer reviewed
Meyer, Gregory J. – Psychological Assessment, 1997
In reply to criticism of the Rorschach Comprehensive System (CS) by J. Wood, M. Nezworski, and W. Stejskal (1996), this article presents a meta-analysis of published data indicating that the CS has excellent chance-corrected interrater reliability. It is noted that the erroneous assumptions of Wood et al. make their assertions about validity…
Descriptors: Interrater Reliability, Meta Analysis, Test Use, Test Validity
Peer reviewed Peer reviewed
Wood, James M.; Nezworski, M. Teresa; Stejskal, William J. – Psychological Assessment, 1997
G. Meyer (1997) attempts to refute the present authors' criticisms of the interrater reliability of the Rorschach Comprehensive System (CS) but misrepresents their position and offers a flawed meta-analysis in support of his own. Rorschach proponents need to undertake high-quality replicated studies of CS reliability and validity. (SLD)
Descriptors: Interrater Reliability, Meta Analysis, Test Use, Test Validity
Peer reviewed Peer reviewed
Meyer, Gregory J. – Psychological Assessment, 1997
Replies to Wood et al. and documents limitations of their conclusions about the Rorschach Comprehensive System (CS), supporting Meyer's own meta-analysis, which finds adequate interrater reliability for the CS. (SLD)
Descriptors: Interrater Reliability, Meta Analysis, Test Use, Test Validity
Pages: 1  |  ...  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  12  |  ...  |  52