Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 4 |
| Since 2017 (last 10 years) | 11 |
| Since 2007 (last 20 years) | 32 |
Descriptor
Source
Author
| Ediger, Marlow | 6 |
| Haertel, Edward | 4 |
| Roeber, Edward D. | 4 |
| Shepard, Lorrie A. | 4 |
| Coleman, Geraldine J. | 3 |
| Deeter, Thomas | 3 |
| Donovan, Jenny | 3 |
| French, Russell L. | 3 |
| Herman, Joan | 3 |
| Kane, Michael T. | 3 |
| Koretz, Daniel | 3 |
| More ▼ | |
Publication Type
Education Level
| Elementary Secondary Education | 19 |
| Elementary Education | 11 |
| Grade 6 | 6 |
| Secondary Education | 6 |
| Junior High Schools | 4 |
| Middle Schools | 4 |
| Grade 4 | 3 |
| Grade 8 | 3 |
| High Schools | 3 |
| Higher Education | 3 |
| Intermediate Grades | 3 |
| More ▼ | |
Audience
| Practitioners | 59 |
| Teachers | 22 |
| Administrators | 17 |
| Researchers | 15 |
| Parents | 12 |
| Policymakers | 12 |
| Students | 8 |
| Community | 6 |
| Counselors | 2 |
Location
| Canada | 12 |
| Australia | 8 |
| Florida | 5 |
| New York | 5 |
| United States | 4 |
| Colorado | 3 |
| Pennsylvania | 3 |
| Texas | 3 |
| Arizona | 2 |
| Bahrain | 2 |
| China | 2 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Zou, Shen; Xu, Qian – Language Assessment Quarterly, 2017
Washback and fairness are interrelated in validity research, and thus an investigation into washback inevitably involves fairness. This article reports Phase One of a washback study of "Test for English Majors for Grade Eight" (TEM8). Phase One was a questionnaire survey administered to university program administrators. Two research…
Descriptors: Foreign Countries, Language Tests, English (Second Language), Test Bias
New York State Education Department, 2018
This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 English Language Arts (ELA) and Mathematics 2018 Operational Tests. This report includes information about test content and test development, item (i.e., individual…
Descriptors: English, Language Arts, Language Tests, Mathematics Tests
Kane, Michael T. – Journal of Educational Measurement, 2013
This response to the comments contains three main sections, each addressing a subset of the comments. In the first section, I will respond to the comments by Brennan, Haertel, and Moss. All of these comments suggest ways in which my presentation could be extended or improved; I generally agree with their suggestions, so my response to their…
Descriptors: Validity, Test Interpretation, Test Use, Scores
Mattern, Krista D.; Kobrin, Jennifer L.; Camara, Wayne J. – Measurement: Interdisciplinary Research and Perspectives, 2012
As researchers at a testing organization concerned with the appropriate uses and validity evidence for our assessments, we provide an applied perspective related to the issues raised in the focus article. Newton's proposal for elaborating the consensus definition of validity is offered with the intention to reduce the risks of inadequate…
Descriptors: Evidence, Validity, Tests, Testing
Wiliam, Dylan – Measurement: Interdisciplinary Research and Perspectives, 2013
In "How Is Testing Supposed to Improve Schooling?" Edward Haertel has proposed a framework for thinking about the mechanisms by which testing might improve the various educational processes undertaken in schools. The framework seems to the author to be quite general (he uses the word "general" here in its mathematical sense of including all cases)…
Descriptors: Educational Testing, Educational Improvement, Test Results, Test Use
New York State Education Department, 2017
This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 English Language Arts (ELA) and Mathematics 2017 Operational Tests. This report includes information about test content and test development, item (i.e., individual…
Descriptors: English, Language Arts, Language Tests, Mathematics Tests
Kane, Michael T. – Journal of Educational Measurement, 2013
To validate an interpretation or use of test scores is to evaluate the plausibility of the claims based on the scores. An argument-based approach to validation suggests that the claims based on the test scores be outlined as an argument that specifies the inferences and supporting assumptions needed to get from test responses to score-based…
Descriptors: Test Interpretation, Validity, Scores, Test Use
Lane, Suzanne – Measurement: Interdisciplinary Research and Perspectives, 2012
Considering consequences in the evaluation of validity is not new although it is still debated by Paul E. Newton and others. The argument-based approach to validity entails an interpretative argument that explicitly identifies the proposed interpretations and uses of test scores and a validity argument that provides a structure for evaluating the…
Descriptors: Educational Opportunities, Accountability, Validity, Inferences
Shin, Seon-Hi; Slater, Charles L.; Backhoff, Eduardo – Educational Administration Quarterly, 2013
This study compared PISA 2009 student reading literacy scores with principal perceptions across three countries with varying levels of student performance: Korea, Mexico, and the United States. Seventy-five countries participated in PISA 2009, which measured 15-year-old children's reading achievement and principal perceptions. The study explored…
Descriptors: Foreign Countries, Reading Achievement, Principals, Administrator Attitudes
Haertel, Edward – Measurement: Interdisciplinary Research and Perspectives, 2013
Validation research for educational achievement tests is often limited to an examination of intended test score interpretations. This article calls for an expansion of validation research in three dimensions. First, validation must attend to actual test use and its consequences, not just score meaning. Second, validation must attend to unintended…
Descriptors: Educational Testing, Educational Improvement, Test Validity, Achievement Tests
National Council on Measurement in Education, 2012
Testing and data integrity on statewide assessments is defined as the establishment of a comprehensive set of policies and procedures for: (1) the proper preparation of students; (2) the management and administration of the test(s) that will lead to accurate and appropriate reporting of assessment results; and (3) maintaining the security of…
Descriptors: State Programs, Integrity, Testing, Test Preparation
Klesch, Heather S. – ProQuest LLC, 2010
The reporting of scores on educational tests is at times misunderstood, misinterpreted, and potentially confusing to examinees and other stakeholders who may need to interpret test scores. In reporting test results to examinees, there is a need for clarity in the message communicated. As pressure rises for students to demonstrate performance at a…
Descriptors: Feedback (Response), Test Results, Focus Groups, Educational Testing
Kane, Michael T. – Educational Researcher, 2008
Lissitz and Samuelsen (2007) have proposed an operational definition of "validity" that shifts many of the questions traditionally considered under validity to a separate category associated with the utility of test use. Operational definitions support inferences about how well people perform some kind of task or how they respond to some kind of…
Descriptors: Test Use, Definitions, Validity, Classification
Peer reviewedBanken, Joseph A. – Journal of Clinical Psychology, 1985
Investigated the utility of considering Digits Forward (DF) and Digits Backward (DB) as separate components of the Wechsler Adult Intelligence Scale-Revised (WAIS-R) through correlations with other intelligence tests. The findings of significant correlations indicate that although DF and DB tasks are related, the combination of these tasks into a…
Descriptors: Correlation, Intelligence Tests, Psychological Evaluation, Test Interpretation
Peer reviewedMegargee, Edwin I. – Psychological Assessment, 1997
A Minnesota Multiphasic Personality Inventory (MMPI) based classification system (E. I. Megargee, 1977) was extended to female prisoners, using MMPI results for 400 inmates. Using revised rules for classifying the original MMPIs and MMPI-2s, 386 women could be classified on both versions, and 87% were classified identically. (SLD)
Descriptors: Classification, Females, Personality Measures, Personality Traits

Direct link
