ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	12

Descriptor

Construct Validity	12
Educational Testing	12
Educational Assessment	6
Psychometrics	5
Academic Achievement	4
Definitions	4
Evidence	4
Foreign Countries	4
Student Evaluation	4
College Entrance Examinations	3
Correlation	3
Educational Policy	3
Evaluation Methods	3
Measurement Techniques	3
Performance Based Assessment	3
Psychological Testing	3
Standards	3
Statistical Analysis	3
Test Interpretation	3
Comparative Analysis	2
Computer Assisted Testing	2
Content Validity	2
Essay Tests	2
Essays	2
Evaluation Criteria	2
More ▼

Source

Research Papers in Education	2
College Board	1
Educational Research Review	1
Educational Researcher	1
Educational Testing Service	1
International Journal of…	1
International Journal of…	1
Journal of Applied Testing…	1
Measurement:…	1
Online Submission	1
Review of Research in…	1
More ▼

Publication Type

Journal Articles	10
Reports - Evaluative	5
Reports - Descriptive	3
Reports - Research	2
Non-Print Media	1
Opinion Papers	1
Reference Materials - General	1

Education Level

Elementary Secondary Education	6
Higher Education	2
Postsecondary Education	2
Secondary Education	2
Elementary Education	1
Grade 7	1
High Schools	1
Junior High Schools	1
Middle Schools	1

Audience

Location

United Kingdom	3
Malaysia	1
United States	1

Laws, Policies, & Programs

Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

SAT (College Admission Test)	3
Graduate Record Examinations	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Test Affordances or Test Function? Did We Get Messick's Message Right?

Download full text

Salmani Nodoushan, Mohammad Ali – Online Submission, 2021

This paper follows a line of logical argumentation to claim that what Samuel Messick conceptualized about construct validation has probably been misunderstood by some educational policy makers, practicing educators, and classroom teachers. It argues that, while Messick's unified theory of test validation aimed at (a) warning educational…

Descriptors: Construct Validity, Test Theory, Test Use, Affordances

Determining the Psychometric Properties of Middle School Statistical Thinking Testlet-Based Assessment Tool

Peer reviewed
PDF on ERIC

Download full text

Lim Hooi Lian; Wun Thiam Yew – International Journal of Assessment Tools in Education, 2023

The majority of students from elementary to tertiary levels have misunderstandings and challenges acquiring various statistical concepts and skills. However, the existing statistics assessment frameworks challenge practice in a classroom setting. The purpose of this research is to develop and validate a statistical thinking assessment tool…

Descriptors: Psychometrics, Grade 7, Middle School Mathematics, Statistics Education

Clarifying the Consensus Definition of Validity

Peer reviewed

Direct link

Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2012

The 1999 "Standards for Educational and Psychological Testing" defines validity as the degree to which evidence and theory support the interpretations of test scores entailed by proposed uses of tests. Although quite explicit, there are ways in which this definition lacks precision, consistency, and clarity. The history of validity has taught us…

Descriptors: Evidence, Validity, Educational Testing, Risk

The Dutch Review Process for Evaluating the Quality of Psychological Tests: History, Procedure, and Results

Peer reviewed

Direct link

Evers, Arne; Sijtsma, Klaas; Lucassen, Wouter; Meijer, Rob R. – International Journal of Testing, 2010

This article describes the 2009 revision of the Dutch Rating System for Test Quality and presents the results of test ratings from almost 30 years. The rating system evaluates the quality of a test on seven criteria: theoretical basis, quality of the testing materials, comprehensiveness of the manual, norms, reliability, construct validity, and…

Descriptors: Rating Scales, Documentation, Educational Quality, Educational Testing

Understanding Comparability of Examination Standards

Peer reviewed

Direct link

Coe, Robert – Research Papers in Education, 2010

Much of the argument about comparability of examination standards is at cross-purposes; contradictory positions are in fact often both defensible, but they are using the same words to mean different things. To clarify this, two broad conceptualisations of standards can be identified. One sees the standard in the observed phenomena of performance…

Descriptors: Foreign Countries, Tests, Evaluation Methods, Standards

College and Career Readiness: An Initial Validation Argument

Download full text

Camara, Wayne – College Board, 2011

This presentation was presented at the 2011 National Conference on Student Assessment (CCSSO). The focus of this presentation is how to validate the common core state standards (CCSS) in math and ELA and the subsequent assessments that will be developed by state consortia. The CCSS specify the skills students need to be ready for post-secondary…

Descriptors: College Readiness, Career Readiness, Benchmarking, Student Evaluation

Contrasting Conceptions of Comparability

Peer reviewed

Direct link

Newton, Paul E. – Research Papers in Education, 2010

Robert Coe has claimed that three broad conceptions of comparability can be identified from the literature: performance, statistical and conventional. Each of these he rejected, in favour of a single, integrated conception which relies upon the notion of a "linking construct" and which he termed "construct comparability".…

Descriptors: Psychometrics, Measurement Techniques, Foreign Countries, Tests

Evaluating the Construct-Coverage of the e-rater[R] Scoring Engine. Research Report. ETS RR-09-01

Download full text

Quinlan, Thomas; Higgins, Derrick; Wolff, Susanne – Educational Testing Service, 2009

This report evaluates the construct coverage of the e-rater[R[ scoring engine. The matter of construct coverage depends on whether one defines writing skill, in terms of process or product. Originally, the e-rater engine consisted of a large set of components with a proven ability to predict human holistic scores. By organizing these capabilities…

Descriptors: Guides, Writing Skills, Factor Analysis, Writing Tests

Large-Scale Student Assessment Studies Measure the Results of Processes of Knowledge Acquisition: Evidence in Support of the Distinction between Intelligence and Student Achievement

Peer reviewed

Direct link

Baumert, Jurgen; Ludtke, Oliver; Trautwein, Ulrich; Brunner, Martin – Educational Research Review, 2009

Given the relatively high intercorrelations observed between mathematics achievement, reading achievement, and cognitive ability, it has recently been claimed that student assessment studies (e.g., TIMSS, PISA) and intelligence tests measure a single cognitive ability that is practically identical to general intelligence. The present article uses…

Descriptors: Intelligence, Reading Achievement, Mathematics Achievement, Outcomes of Education

Peer reviewed

Direct link

Lissitz, Robert W.; Samuelsen, Karen – Educational Researcher, 2007

This article raises a number of questions about the current unified theory of test validity that has construct validity at its center. The authors suggest a different way of conceptualizing the problem of establishing validity by considering whether the focus of the investigation of a test is internal to the test itself or focuses on constructs…

Descriptors: Vocabulary, Evaluation Research, Construct Validity, Test Validity

Does Quantity Equal Quality?: The Relationship between Length of Response and Scores on the SAT Essay

Peer reviewed

Direct link

Kobrin, Jennifer L.; Deng, Hui; Shaw, Emily J. – Journal of Applied Testing Technology, 2007

This study was designed to address two frequent criticisms of the SAT essay--that essay length is the best predictor of scores, and that there is an advantage in using more "sophisticated" examples as opposed to personal experience. The study was based on 2,820 essays from the first three administrations of the new SAT. Each essay was…

Descriptors: Testing Programs, Computer Assisted Testing, Construct Validity, Writing Skills

What Counts as Evidence of Educational Achievement? The Role of Constructs in the Pursuit of Equity in Assessment

Peer reviewed

Direct link

Wiliam, Dylan – Review of Research in Education, 2010

The idea that validity should be considered a property of inferences, rather than of assessments, has developed slowly over the past century. In early writings about the validity of educational assessments, validity was defined as a property of an assessment. The most common definition was that an assessment was valid to the extent that it…

Descriptors: Educational Assessment, Validity, Inferences, Construct Validity

Newton, Paul E.	2
Baumert, Jurgen	1
Brunner, Martin	1
Camara, Wayne	1
Coe, Robert	1
Deng, Hui	1
Evers, Arne	1
Higgins, Derrick	1
Kobrin, Jennifer L.	1
Lim Hooi Lian	1
Lissitz, Robert W.	1
Lucassen, Wouter	1
Ludtke, Oliver	1
Meijer, Rob R.	1
Quinlan, Thomas	1
Salmani Nodoushan, Mohammad…	1
Samuelsen, Karen	1
Shaw, Emily J.	1
Sijtsma, Klaas	1
Trautwein, Ulrich	1
Wiliam, Dylan	1
Wolff, Susanne	1
Wun Thiam Yew	1
More ▼