Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 4 |
| Since 2017 (last 10 years) | 12 |
| Since 2007 (last 20 years) | 17 |
Descriptor
| Test Interpretation | 81 |
| Test Use | 81 |
| Scores | 21 |
| Elementary Secondary Education | 20 |
| Test Validity | 16 |
| Higher Education | 13 |
| Testing Problems | 13 |
| Achievement Tests | 12 |
| Intelligence Tests | 12 |
| Standardized Tests | 12 |
| Test Results | 12 |
| More ▼ | |
Source
Author
| Elmore, Patricia B. | 3 |
| Amy Briesch | 2 |
| Bauer, Malcolm I. | 2 |
| Brittany Melo | 2 |
| Davis, W. Alan | 2 |
| Jacqueline M. Caemmerer | 2 |
| Jessica B. Koslouski | 2 |
| Jin, Hui | 2 |
| Moore, John C. | 2 |
| Pressler, Yamina | 2 |
| Sandra M. Chafouleas | 2 |
| More ▼ | |
Publication Type
| Reports - Research | 81 |
| Journal Articles | 44 |
| Speeches/Meeting Papers | 18 |
| Tests/Questionnaires | 6 |
| Information Analyses | 3 |
| Guides - Non-Classroom | 1 |
| Reports - Evaluative | 1 |
| Reports - General | 1 |
Education Level
Audience
| Researchers | 6 |
| Practitioners | 2 |
Location
| Massachusetts | 2 |
| Alabama | 1 |
| Arizona | 1 |
| Canada | 1 |
| China | 1 |
| France | 1 |
| Germany | 1 |
| Hong Kong | 1 |
| Indiana | 1 |
| Kansas | 1 |
| Louisiana | 1 |
| More ▼ | |
Laws, Policies, & Programs
| Education Consolidation… | 1 |
| Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Folger, Timothy D.; Bostic, Jonathan; Krupa, Erin E. – Educational Measurement: Issues and Practice, 2023
Validity is a fundamental consideration of test development and test evaluation. The purpose of this study is to define and reify three key aspects of validity and validation, namely test-score interpretation, test-score use, and the claims supporting interpretation and use. This study employed a Delphi methodology to explore how experts in…
Descriptors: Test Interpretation, Scores, Test Use, Test Validity
Cristan Farmer; Audrey Thurm; Tanvi Das; E. Martina Bebin; Jonathan A. Bernstein; Elizabeth Berry-Kravis; Joseph D. Buxbaum; Charis Eng; Thomas Frazier; Antonio Y. Hardan; Alexander Kolevzon; Darcy A. Krueger; Julian A. Martinez-Agosto; Hope Northrup; Craig M. Powell; Latha Valluripalli Soorya; Joyce Y. Wu; Mustafa Sahin – American Journal on Intellectual and Developmental Disabilities, 2025
Developmental domains, such as cognitive, language, and motor, are key concepts of interest in longitudinal studies of intellectual and developmental disabilities (IDD). Normative scores (e.g., IQ) are often used to operationalize performance on standardized tests of these concepts, but it is the interval-distributed person-ability scores that are…
Descriptors: Cognitive Tests, Intelligence Tests, Cognitive Ability, Intellectual Disability
Wind, Stefanie A. – Educational Measurement: Issues and Practice, 2020
Researchers have documented the impact of rater effects, or raters' tendencies to give different ratings than would be expected given examinee achievement levels, in performance assessments. However, the degree to which rater effects influence person fit, or the reasonableness of test-takers' achievement estimates given their response patterns,…
Descriptors: Performance Based Assessment, Evaluators, Achievement, Influences
Jessica B. Koslouski; Sandra M. Chafouleas; Amy Briesch; Jacqueline M. Caemmerer; Brittany Melo – School Mental Health, 2024
We are developing the Equitable Screening to Support Youth (ESSY) Whole Child Screener to address concerns prevalent in existing school-based screenings that impede goals to advance educational equity using universal screeners. Traditional assessment development does not include end users in the early development phases, instead relying on a…
Descriptors: Screening Tests, Psychometrics, Validity, Child Development
Jin, Hui; van Rijn, Peter; Moore, John C.; Bauer, Malcolm I.; Pressler, Yamina; Yestness, Nissa – International Journal of Science Education, 2019
This article provides a validation framework for research on the development and use of science Learning Progressions (LPs). The framework describes how evidence from various sources can be used to establish an interpretive argument and a validity argument at five stages of LP research--development, scoring, generalisation, extrapolation, and use.…
Descriptors: Sequential Approach, Educational Research, Science Education, Validity
Jin, Hui; van Rijn, Peter; Moore, John C.; Bauer, Malcolm I.; Pressler, Yamina; Yestness, Nissa – Grantee Submission, 2019
This article provides a validation framework for research on the development and use of science Learning Progressions (LPs). The framework describes how evidence from various sources can be used to establish an interpretive argument and a validity argument at five stages of LP research--development, scoring, generalisation, extrapolation, and use.…
Descriptors: Sequential Approach, Educational Research, Science Education, Validity
Iaccarino, Stephanie; von der Embse, Nathaniel; Kilgus, Stephen – Journal of Psychoeducational Assessment, 2019
Detecting mental illness in school students may prevent poor school outcomes. Clinicians often use universal behavioral screeners to identify students at risk for mental illness. This study examined the applicability of Kane's interpretation and use argument (IUA) to the Social, Academic, and Emotional Behavior Risk Screener--Teacher Rating Scale…
Descriptors: Screening Tests, Test Interpretation, Test Use, Mental Disorders
Jessica B. Koslouski; Sandra M. Chafouleas; Amy Briesch; Jacqueline M. Caemmerer; Brittany Melo – Grantee Submission, 2024
We are developing the Equitable Screening to Support Youth (ESSY) Whole Child Screener to address concerns prevalent in existing school-based screenings that impede goals to advance educational equity using universal screeners. Traditional assessment development does not include end users in the early development phases, instead relying on a…
Descriptors: Screening Tests, Usability, Decision Making, Validity
Stone, Elizabeth; Wylie, E. Caroline – ETS Research Report Series, 2019
We describe the summative assessment component within a K-12 assessment program and our development of a validity argument to support its claims with respect to intended uses and interpretations. First, we describe the "Winsight"® assessment program theory of action, a logic model elucidating mechanisms for how use of the assessment…
Descriptors: Summative Evaluation, Educational Assessment, Test Validity, Test Use
Leonard, Jack – Education Policy Analysis Archives, 2018
This paper introduces the new Massachusetts Performance Assessment for Leaders (PAL) and uses critical policy analysis to re-examine the validity evidence (using the 2014 Standards for Educational and Psychological Testing and a theory of multicultural validity) for the use and interpretation of the PAL in regards to emerging school leadership.…
Descriptors: Performance Based Assessment, Test Validity, High Stakes Tests, School Administration
Zou, Shen; Xu, Qian – Language Assessment Quarterly, 2017
Washback and fairness are interrelated in validity research, and thus an investigation into washback inevitably involves fairness. This article reports Phase One of a washback study of "Test for English Majors for Grade Eight" (TEM8). Phase One was a questionnaire survey administered to university program administrators. Two research…
Descriptors: Foreign Countries, Language Tests, English (Second Language), Test Bias
Cheng, Liying; Sun, Youyi – Language Assessment Quarterly, 2015
This article draws on Kane's (2006) argument-based validation framework to synthesize evidence derived from a large-scale, mixed-method explanatory study on the impact of the Ontario Secondary School Literacy Test (OSSLT) on second language (L2) students. The purpose of the OSSLT is to ensure that students have acquired the essential reading and…
Descriptors: Foreign Countries, Secondary School Students, Literacy, Reading Tests
Traynor, Anne – Educational Assessment, 2017
Variation in test performance among examinees from different regions or national jurisdictions is often partially attributed to differences in the degree of content correspondence between local school or training program curricula, and the test of interest. This posited relationship between test-curriculum correspondence, or "alignment,"…
Descriptors: Test Items, Test Construction, Alignment (Education), Curriculum
Jordan, Sally – Computers & Education, 2012
Students were observed directly, in a usability laboratory, and indirectly, by means of an extensive evaluation of responses, as they attempted interactive computer-marked assessment questions that required free-text responses of up to 20 words and as they amended their responses after receiving feedback. This provided more general insight into…
Descriptors: Learner Engagement, Feedback (Response), Evaluation, Test Interpretation
January, Stacy-Ann A.; Ardoin, Scott P.; Christ, Theodore J.; Eckert, Tanya L.; White, Mary Jane – School Psychology Review, 2016
Universal screening in elementary schools often includes administering curriculum-based measurement in reading (CBM-R); but in first grade, nonsense word fluency (NWF) and, to a lesser extent, word identification fluency (WIF) are used because of concerns that CBM-R is too difficult for emerging readers. This study used Kane's argument-based…
Descriptors: Curriculum Based Assessment, Reading Tests, Test Interpretation, Test Use

Peer reviewed
Direct link
