Publication Date
| In 2026 | 0 |
| Since 2025 | 5 |
| Since 2022 (last 5 years) | 25 |
| Since 2017 (last 10 years) | 67 |
| Since 2007 (last 20 years) | 120 |
Descriptor
| Test Use | 771 |
| Test Validity | 771 |
| Test Reliability | 297 |
| Test Construction | 239 |
| Elementary Secondary Education | 150 |
| Higher Education | 123 |
| Scores | 105 |
| Foreign Countries | 101 |
| Standardized Tests | 98 |
| Test Interpretation | 95 |
| Evaluation Methods | 93 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 55 |
| Researchers | 26 |
| Teachers | 23 |
| Administrators | 13 |
| Parents | 9 |
| Students | 8 |
| Policymakers | 5 |
| Community | 3 |
| Counselors | 2 |
| Support Staff | 1 |
Location
| Australia | 13 |
| Canada | 11 |
| New York | 7 |
| United Kingdom (England) | 7 |
| Tennessee | 6 |
| United States | 6 |
| South Korea | 5 |
| Turkey | 5 |
| China | 4 |
| Japan | 4 |
| New Jersey | 4 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Wilcox, Bethany R.; Caballero, Marcos D.; Baily, Charles; Sadaghiani, Homeyra; Chasteen, Stephanie V.; Ryan, Qing X.; Pollock, Steven J. – Physical Review Special Topics - Physics Education Research, 2015
The use of validated conceptual assessments alongside conventional course exams to measure student learning in introductory courses has become standard practice in many physics departments. These assessments provide a more standard measure of certain learning goals, allowing for comparisons of student learning across instructors, semesters,…
Descriptors: Student Evaluation, Physics, Tests, Advanced Courses
Engelhard, George, Jr.; Wind, Stefanie A. – Measurement: Interdisciplinary Research and Perspectives, 2013
In 2012, Edward Haertel received the NCME Career Contributions Award. The focus article for this issue emerged from his address on the topic "How Is Testing Supposed to Improve Schooling?" His focus article provides a discussion of the relationships between testing and schooling in which he issues a call to action to the measurement community to…
Descriptors: Educational Testing, Educational Improvement, Social Action, Test Results
Ho, Andrew – Measurement: Interdisciplinary Research and Perspectives, 2013
In his thoughtful focus article, Haertel (this issue) pushes testing experts to broaden the scope of their validation efforts and to invite scholars from other disciplines to join them. He credits existing validation frameworks for helping the measurement community to identify incomplete or nonexistent validity arguments. However, he notes his…
Descriptors: Educational Testing, Scores, Test Use, Test Validity
Haertel, Edward – Measurement: Interdisciplinary Research and Perspectives, 2013
The author is deeply gratified by the commentators' thoughtful responses and finds almost nothing to disagree with in any of them. Each offers additional insights prompting further reflection. In drawing out just a few common themes, this brief rejoinder omits many important ideas from the individual contributions. As stated in his title, the…
Descriptors: Educational Testing, Educational Improvement, Test Interpretation, Test Use
Hatala, Rose; Cook, David A.; Brydges, Ryan; Hawkins, Richard – Advances in Health Sciences Education, 2015
In order to construct and evaluate the validity argument for the Objective Structured Assessment of Technical Skills (OSATS), based on Kane's framework, we conducted a systematic review. We searched MEDLINE, EMBASE, CINAHL, PsycINFO, ERIC, Web of Science, Scopus, and selected reference lists through February 2013. Working in duplicate, we selected…
Descriptors: Measures (Individuals), Test Validity, Surgery, Skills
Sriram, Rishi – NASPA - Student Affairs Administrators in Higher Education, 2014
When student affairs professionals assess their work, they often employ some type of survey. The use of surveys stems from a desire to objectively measure outcomes, a demand from someone else (e.g., supervisor, accreditation committee) for data, or the feeling that numbers can provide an aura of competence. Although surveys are effective tools for…
Descriptors: Surveys, Test Construction, Student Personnel Services, Test Use
Sanders, Sara – National Technical Assistance Center for the Education of Neglected or Delinquent Children and Youth (NDTAC), 2019
This guide is designed to assist States, agencies, and/or facilities who work with youth who are neglected, delinquent, or at-risk (N or D). The information in the guide will benefit those who are (a) interested in implementing pre-posttests, (b) in the process of identifying an appropriate pre-posttest, or (c) ready to evaluate current testing…
Descriptors: At Risk Students, Delinquency, Pretests Posttests, Testing
Zou, Shen; Xu, Qian – Language Assessment Quarterly, 2017
Washback and fairness are interrelated in validity research, and thus an investigation into washback inevitably involves fairness. This article reports Phase One of a washback study of "Test for English Majors for Grade Eight" (TEM8). Phase One was a questionnaire survey administered to university program administrators. Two research…
Descriptors: Foreign Countries, Language Tests, English (Second Language), Test Bias
New York State Education Department, 2018
This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 English Language Arts (ELA) and Mathematics 2018 Operational Tests. This report includes information about test content and test development, item (i.e., individual…
Descriptors: English, Language Arts, Language Tests, Mathematics Tests
Cheng, Liying; Sun, Youyi – Language Assessment Quarterly, 2015
This article draws on Kane's (2006) argument-based validation framework to synthesize evidence derived from a large-scale, mixed-method explanatory study on the impact of the Ontario Secondary School Literacy Test (OSSLT) on second language (L2) students. The purpose of the OSSLT is to ensure that students have acquired the essential reading and…
Descriptors: Foreign Countries, Secondary School Students, Literacy, Reading Tests
Kane, Michael – Measurement: Interdisciplinary Research and Perspectives, 2012
Paul E. Newton's article on the consensus definition of validity tackles a number of big issues and makes a number of strong claims. I agreed with much of what he said, and I disagreed with a number of his claims, but I found his article to be consistently interesting and thought provoking (whether I agreed or not). I will focus on three general…
Descriptors: Validity, Construct Validity, Tests, Testing
International Journal of Testing, 2019
These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…
Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage
Kruyen, Peter M.; Emons, Wilco H. M.; Sijtsma, Klaas – International Journal of Testing, 2013
To efficiently assess multiple psychological constructs and to minimize the burden on respondents, psychologists increasingly use shortened versions of existing tests. However, compared to the longer test, a shorter test version may have a substantial impact on the reliability and the validity of the test scores in psychological research and…
Descriptors: Test Length, Psychological Testing, Test Use, Test Validity
Mattern, Krista D.; Kobrin, Jennifer L.; Camara, Wayne J. – Measurement: Interdisciplinary Research and Perspectives, 2012
As researchers at a testing organization concerned with the appropriate uses and validity evidence for our assessments, we provide an applied perspective related to the issues raised in the focus article. Newton's proposal for elaborating the consensus definition of validity is offered with the intention to reduce the risks of inadequate…
Descriptors: Evidence, Validity, Tests, Testing
Penuel, William R.; Confrey, Jere; Maloney, Alan; Rupp, André A. – Journal of the Learning Sciences, 2014
This article analyzes the design decisions of a team developing diagnostic assessments for a learning trajectory focused on rational number reasoning. The analysis focuses on the design rationale for key decisions about how to develop the cognitive assessments and related validity arguments within a fluid state and national policy context. The…
Descriptors: Mathematics Tests, Diagnostic Tests, Test Construction, Numbers

Peer reviewed
Direct link
