ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	4
Since 2017 (last 10 years)	12
Since 2007 (last 20 years)	17

Descriptor

Test Interpretation	81
Test Use	81
Scores	21
Elementary Secondary Education	20
Test Validity	16
Higher Education	13
Testing Problems	13
Achievement Tests	12
Intelligence Tests	12
Standardized Tests	12
Test Results	12
Student Evaluation	11
Test Construction	10
Decision Making	8
Elementary Education	8
Psychometrics	8
Scoring	8
Correlation	7
Evaluation Methods	7
School Districts	7
Test Items	7
Testing	7
Educational Assessment	6
Language Tests	6
Learning Disabilities	6
More ▼

Publication Type

Reports - Research	81
Journal Articles	44
Speeches/Meeting Papers	18
Tests/Questionnaires	6
Information Analyses	3
Guides - Non-Classroom	1
Reports - Evaluative	1
Reports - General	1

Education Level

Elementary Education	4
Early Childhood Education	2
Elementary Secondary Education	2
Higher Education	2
Middle Schools	2
Postsecondary Education	2
Primary Education	2
Secondary Education	2
Grade 1	1
Grade 2	1
Grade 3	1
Grade 4	1
Grade 5	1
Intermediate Grades	1
Junior High Schools	1
More ▼

Audience

Researchers	6
Practitioners	2

Location

Massachusetts	2
Alabama	1
Arizona	1
Canada	1
China	1
France	1
Germany	1
Hong Kong	1
Indiana	1
Kansas	1
Louisiana	1
Michigan	1
Minnesota	1
Netherlands	1
New Jersey	1
New York	1
Ohio	1
Oregon	1
United Kingdom	1
Vermont	1
More ▼

Laws, Policies, & Programs

Education Consolidation…	1
Elementary and Secondary…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 81 results Save | Export

Defining Test-Score Interpretation, Use, and Claims: Delphi Study for the Validity Argument

Peer reviewed

Direct link

Folger, Timothy D.; Bostic, Jonathan; Krupa, Erin E. – Educational Measurement: Issues and Practice, 2023

Validity is a fundamental consideration of test development and test evaluation. The purpose of this study is to define and reify three key aspects of validity and validation, namely test-score interpretation, test-score use, and the claims supporting interpretation and use. This study employed a Delphi methodology to explore how experts in…

Descriptors: Test Interpretation, Scores, Test Use, Test Validity

Which Score for What? Operationalizing Standardized Cognitive Test Performance for the Assessment of Change

Peer reviewed

Direct link

Cristan Farmer; Audrey Thurm; Tanvi Das; E. Martina Bebin; Jonathan A. Bernstein; Elizabeth Berry-Kravis; Joseph D. Buxbaum; Charis Eng; Thomas Frazier; Antonio Y. Hardan; Alexander Kolevzon; Darcy A. Krueger; Julian A. Martinez-Agosto; Hope Northrup; Craig M. Powell; Latha Valluripalli Soorya; Joyce Y. Wu; Mustafa Sahin – American Journal on Intellectual and Developmental Disabilities, 2025

Developmental domains, such as cognitive, language, and motor, are key concepts of interest in longitudinal studies of intellectual and developmental disabilities (IDD). Normative scores (e.g., IQ) are often used to operationalize performance on standardized tests of these concepts, but it is the interval-distributed person-ability scores that are…

Descriptors: Cognitive Tests, Intelligence Tests, Cognitive Ability, Intellectual Disability

Exploring the Impact of Rater Effects on Person Fit in Rater-Mediated Assessments

Peer reviewed

Direct link

Wind, Stefanie A. – Educational Measurement: Issues and Practice, 2020

Researchers have documented the impact of rater effects, or raters' tendencies to give different ratings than would be expected given examinee achievement levels, in performance assessments. However, the degree to which rater effects influence person fit, or the reasonableness of test-takers' achievement estimates given their response patterns,…

Descriptors: Performance Based Assessment, Evaluators, Achievement, Influences

Developing a Whole Child School Screening Instrument: Evaluating Perceived Usability as an Initial Step in Planning for Consequential Validity

Peer reviewed

Direct link

Jessica B. Koslouski; Sandra M. Chafouleas; Amy Briesch; Jacqueline M. Caemmerer; Brittany Melo – School Mental Health, 2024

We are developing the Equitable Screening to Support Youth (ESSY) Whole Child Screener to address concerns prevalent in existing school-based screenings that impede goals to advance educational equity using universal screeners. Traditional assessment development does not include end users in the early development phases, instead relying on a…

Descriptors: Screening Tests, Psychometrics, Validity, Child Development

A Validation Framework for Science Learning Progression Research

Peer reviewed

Direct link

Jin, Hui; van Rijn, Peter; Moore, John C.; Bauer, Malcolm I.; Pressler, Yamina; Yestness, Nissa – International Journal of Science Education, 2019

This article provides a validation framework for research on the development and use of science Learning Progressions (LPs). The framework describes how evidence from various sources can be used to establish an interpretive argument and a validity argument at five stages of LP research--development, scoring, generalisation, extrapolation, and use.…

Descriptors: Sequential Approach, Educational Research, Science Education, Validity

A Validation Framework for Science Learning Progression Research

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jin, Hui; van Rijn, Peter; Moore, John C.; Bauer, Malcolm I.; Pressler, Yamina; Yestness, Nissa – Grantee Submission, 2019

Descriptors: Sequential Approach, Educational Research, Science Education, Validity

Interpretation and Use of the Social, Academic, and Emotional Behavior Risk Screener: A Latent Transition Approach

Peer reviewed

Direct link

Iaccarino, Stephanie; von der Embse, Nathaniel; Kilgus, Stephen – Journal of Psychoeducational Assessment, 2019

Detecting mental illness in school students may prevent poor school outcomes. Clinicians often use universal behavioral screeners to identify students at risk for mental illness. This study examined the applicability of Kane's interpretation and use argument (IUA) to the Social, Academic, and Emotional Behavior Risk Screener--Teacher Rating Scale…

Descriptors: Screening Tests, Test Interpretation, Test Use, Mental Disorders

Developing a Whole Child School Screening Instrument: Evaluating Perceived Usability as an Initial Step in Planning for Consequential Validity

Peer reviewed

Direct link

Jessica B. Koslouski; Sandra M. Chafouleas; Amy Briesch; Jacqueline M. Caemmerer; Brittany Melo – Grantee Submission, 2024

Descriptors: Screening Tests, Usability, Decision Making, Validity

Building a Validity Argument While Developing and Using an Assessment: A Concurrent Approach for the "Winsight"® Summative Assessment. Research Report. ETS RR-19-26

Peer reviewed
PDF on ERIC

Download full text

Stone, Elizabeth; Wylie, E. Caroline – ETS Research Report Series, 2019

We describe the summative assessment component within a K-12 assessment program and our development of a validity argument to support its claims with respect to intended uses and interpretations. First, we describe the "Winsight"® assessment program theory of action, a logic model elucidating mechanisms for how use of the assessment…

Descriptors: Summative Evaluation, Educational Assessment, Test Validity, Test Use

The Validity of a Performance Based Assessment for Aspiring School Leaders

Peer reviewed
PDF on ERIC

Download full text

Leonard, Jack – Education Policy Analysis Archives, 2018

This paper introduces the new Massachusetts Performance Assessment for Leaders (PAL) and uses critical policy analysis to re-examine the validity evidence (using the 2014 Standards for Educational and Psychological Testing and a theory of multicultural validity) for the use and interpretation of the PAL in regards to emerging school leadership.…

Descriptors: Performance Based Assessment, Test Validity, High Stakes Tests, School Administration

A Washback Study of the "Test for English Majors for Grade Eight" (TEM8) in China--From the Perspective of University Program Administrators

Peer reviewed

Direct link

Zou, Shen; Xu, Qian – Language Assessment Quarterly, 2017

Washback and fairness are interrelated in validity research, and thus an investigation into washback inevitably involves fairness. This article reports Phase One of a washback study of "Test for English Majors for Grade Eight" (TEM8). Phase One was a questionnaire survey administered to university program administrators. Two research…

Descriptors: Foreign Countries, Language Tests, English (Second Language), Test Bias

Interpreting the Impact of the Ontario Secondary School Literacy Test on Second Language Students within an Argument-Based Validation Framework

Peer reviewed

Direct link

Cheng, Liying; Sun, Youyi – Language Assessment Quarterly, 2015

This article draws on Kane's (2006) argument-based validation framework to synthesize evidence derived from a large-scale, mixed-method explanatory study on the impact of the Ontario Secondary School Literacy Test (OSSLT) on second language (L2) students. The purpose of the OSSLT is to ensure that students have acquired the essential reading and…

Descriptors: Foreign Countries, Secondary School Students, Literacy, Reading Tests

Does Test Item Performance Increase with Test-to-Standards Alignment?

Peer reviewed

Direct link

Traynor, Anne – Educational Assessment, 2017

Variation in test performance among examinees from different regions or national jurisdictions is often partially attributed to differences in the degree of content correspondence between local school or training program curricula, and the test of interest. This posited relationship between test-curriculum correspondence, or "alignment,"…

Descriptors: Test Items, Test Construction, Alignment (Education), Curriculum

Student Engagement with Assessment and Feedback: Some Lessons from Short-Answer Free-Text E-Assessment Questions

Peer reviewed

Direct link

Jordan, Sally – Computers & Education, 2012

Students were observed directly, in a usability laboratory, and indirectly, by means of an extensive evaluation of responses, as they attempted interactive computer-marked assessment questions that required free-text responses of up to 20 words and as they amended their responses after receiving feedback. This provided more general insight into…

Descriptors: Learner Engagement, Feedback (Response), Evaluation, Test Interpretation

Evaluating the Interpretations and Use of Curriculum-Based Measurement in Reading and Word Lists for Universal Screening in First and Second Grade

Peer reviewed
PDF on ERIC

Download full text

January, Stacy-Ann A.; Ardoin, Scott P.; Christ, Theodore J.; Eckert, Tanya L.; White, Mary Jane – School Psychology Review, 2016

Universal screening in elementary schools often includes administering curriculum-based measurement in reading (CBM-R); but in first grade, nonsense word fluency (NWF) and, to a lesser extent, word identification fluency (WIF) are used because of concerns that CBM-R is too difficult for emerging readers. This study used Kane's argument-based…

Descriptors: Curriculum Based Assessment, Reading Tests, Test Interpretation, Test Use

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Educational Measurement:…	7
Grantee Submission	2
Journal of Clinical Psychology	2
Language Assessment Quarterly	2
Perceptual and Motor Skills	2
Psychological Assessment	2
APS in Action	1
American Journal of Mental…	1
American Journal on…	1
Applied Measurement in…	1
Assessment	1
Assessment & Evaluation in…	1
Assessment and Evaluation in…	1
B. C. Journal of Special…	1
Computers & Education	1
ETS Research Report Series	1
Education Policy Analysis…	1
Educational Assessment	1
Educational and Psychological…	1
Evaluation in Education: An…	1
Harvard Educational Review	1
International Journal of…	1
Journal of American Indian…	1
Journal of Career Assessment	1
Journal of Educational…	1
More ▼

Wechsler Adult Intelligence…	4
SAT (College Admission Test)	3
Wechsler Intelligence Scale…	3
Stanford Binet Intelligence…	2
Strong Interest Inventory	2
California Achievement Tests	1
Career Development Inventory	1
Iowa Tests of Basic Skills	1
Keymath Diagnostic Arithmetic…	1
Minnesota Multiphasic…	1
National Assessment of…	1
Nelson Denny Reading Tests	1
Pennsylvania Educational…	1
Raven Progressive Matrices	1
Rorschach Test	1
Stanford Achievement Tests	1
Strong Campbell Interest…	1
Trends in International…	1
Wide Range Achievement Test	1
Woodcock Johnson Psycho…	1
Woodcock Language Proficiency…	1
Woodcock Reading Mastery Test	1
More ▼

Elmore, Patricia B.	3
Amy Briesch	2
Bauer, Malcolm I.	2
Brittany Melo	2
Davis, W. Alan	2
Jacqueline M. Caemmerer	2
Jessica B. Koslouski	2
Jin, Hui	2
Moore, John C.	2
Pressler, Yamina	2
Sandra M. Chafouleas	2
Shepard, Lorrie A.	2
Yestness, Nissa	2
van Rijn, Peter	2
Alexander Kolevzon	1
Anastasi, Anne	1
Antonio Y. Hardan	1
Ardoin, Scott P.	1
Armstrong, Anne-Marie	1
Audrey Thurm	1
Bank, Adrianne	1
Banken, Joseph A.	1
Betz, Nancy E.	1
Biester, Thomas W.	1
More ▼