ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	7

Descriptor

Construct Validity	11
Test Construction	11
Testing Problems	11
Test Validity	8
Measurement Techniques	5
Psychometrics	5
Test Items	5
Educational Assessment	4
Evaluation Methods	4
Foreign Countries	4
Language Tests	4
Multiple Choice Tests	4
Content Validity	3
Evaluation Research	3
Inferences	3
Second Language Learning	3
Test Reliability	3
Test Theory	3
Testing	3
Adaptive Testing	2
Culture Fair Tests	2
English (Second Language)	2
Evaluation Problems	2
Evidence	2
Knowledge Base for Teaching	2
More ▼

Source

Measurement:…	2
Review of Research in…	2
Educational Researcher	1
HOW	1
International Journal of…	1
Language Education &…	1

Author

Daniel Ginting	1
Facione, Peter A.	1
Gearhart, Maryl	1
Giraldo, Frank	1
Jingwen Wang	1
Kettler, Ryan J.	1
Kiely, Gerard L.	1
Kulikowich, Jonna M.	1
Messick, Samuel	1
Patrisius Istiarto Djiwandono	1
Scholz, George E.	1
Wainer, Howard	1
Wiliam, Dylan	1
Yi Zou	1
Ying Zheng	1
More ▼

Publication Type

Journal Articles	8
Reports - Evaluative	5
Information Analyses	3
Opinion Papers	2
Reports - Research	2
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Elementary Secondary Education	3
Elementary Education	2
Higher Education	1
Postsecondary Education	1

Audience

Location

United Kingdom	2
China	1
Colombia	1
Indonesia	1
United States	1

Laws, Policies, & Programs

Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

Pearson Test of English…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 11 results Save | Export

A Case Study of Washback and Test Preparation of the New Version of PTE Academic

Peer reviewed
PDF on ERIC

Download full text

Yi Zou; Ying Zheng; Jingwen Wang – International Journal of Language Testing, 2025

The Pearson Test of English Academic (PTE-A), a widely used high-stakes language proficiency test for university admissions and migration purposes, underwent a notable change from a three-hour to a two-hour version in November 2021. The implementation of the new version has prompted inquiries into the washback effects on various stakeholders.…

Descriptors: Testing Problems, Test Preparation, High Stakes Tests, English (Second Language)

Evaluating Research Reports on the Qualities of Tests of English Language Skills in Indonesian Schools: A Systematic Review

Peer reviewed
PDF on ERIC

Download full text

Patrisius Istiarto Djiwandono; Daniel Ginting – Language Education & Assessment, 2025

The teaching of English as a foreign language in Indonesia has a long history, and it is always important to ask whether the assessment of the students' language skills has been valid and reliable. A screening of many articles in several prominent databases reveal that a number of evaluation studies have been done by Indonesian scholars in the…

Descriptors: Foreign Countries, Language Tests, English (Second Language), Second Language Learning

Designing Language Assessments in Context: Theoretical, Technical, and Institutional Considerations

Peer reviewed
PDF on ERIC

Download full text

Giraldo, Frank – HOW, 2019

The purpose of this article of reflection is to raise awareness of how poor design of language assessments may have detrimental effects, if crucial qualities and technicalities of test design are not met. The article first discusses these central qualities for useful language assessments. Then, guidelines for creating listening assessments, as an…

Descriptors: Test Construction, Consciousness Raising, Language Tests, Second Language Learning

Adaptations and Access to Assessment of Common Core Content

Peer reviewed

Direct link

Kettler, Ryan J. – Review of Research in Education, 2015

This chapter introduces theory that undergirds the role of testing adaptations in assessment, provides examples of item modifications and testing accommodations, reviews research relevant to each, and introduces a new paradigm that incorporates opportunity to learn (OTL), academic enablers, testing adaptations, and inferences that can be made from…

Descriptors: Meta Analysis, Literature Reviews, Testing, Testing Accommodations

The Interplay of Evidence and Consequences in the Validation of Performance Assessments.

Peer reviewed

Messick, Samuel – Educational Researcher, 1994

Authentic and direct assessment of performance and products are examined in light of contrasting functions and purposes with implications for validation, especially those of specialized validity criteria for performance assessment. The roles of positive and negative consequences of validation are underscored, along with the need for evidence of…

Descriptors: Construct Validity, Criteria, Educational Assessment, Evaluation Methods

Exploring ESP and Language Testing.

Download full text

Scholz, George E. – 1993

A discussion of language testing in the context of a program in English for Special Purposes (ESP) focuses on the lack of "fit" between the two areas and makes some recommendations for improvement. It begins with overviews of recent trends in testing and recent issues in ESP. Overlap is seen in two areas: construct and content validity. It is…

Descriptors: Construct Validity, Content Validity, Curriculum Design, English for Special Purposes

What Counts as Evidence of Educational Achievement? The Role of Constructs in the Pursuit of Equity in Assessment

Peer reviewed

Direct link

Wiliam, Dylan – Review of Research in Education, 2010

The idea that validity should be considered a property of inferences, rather than of assessments, has developed slowly over the past century. In early writings about the validity of educational assessments, validity was defined as a property of an assessment. The most common definition was that an assessment was valid to the extent that it…

Descriptors: Educational Assessment, Validity, Inferences, Construct Validity

Assessing Inference Skills.

Download full text

Facione, Peter A. – 1989

Four major problem areas inhibit the standardized assessment of critical thinking (CT): (1) content validity; (2) construct validity; (3) technical jargon; and (4) background knowledge. Practical examples of framing multiple-choice items for assessment are suggested. In the area of content validity, new agreement about the definition of CT now…

Descriptors: Cognitive Measurement, Construct Validity, Content Validity, Critical Thinking

Mathematics Knowledge for Teaching: Questions about Constructs

Peer reviewed

Direct link

Gearhart, Maryl – Measurement: Interdisciplinary Research and Perspectives, 2007

Teacher knowledge has been of theoretical and empirical interest for over two decades, and development of measures is overdue. The researchers represented in this volume have been breaking new ground by developing a measure of mathematical knowledge for teaching (MKT) without guiding precedents, and in the face of differing perspectives on teacher…

Descriptors: Learning Theories, Elementary School Mathematics, Teaching Methods, Construct Validity

Toward Developmental Trajectories: A Commentary on "Assessing Measures of Mathematical Knowledge for Teaching"

Peer reviewed

Direct link

Kulikowich, Jonna M. – Measurement: Interdisciplinary Research and Perspectives, 2007

Operating from multiple literature bases in cognitive psychology, mathematics education, and theoretical and applied psychometrics, Schilling, Hill and their colleagues provide a systemic approach to studying the validity of scores of mathematical knowledge for teaching. This system encompasses an array of task formats and methodologies. The…

Descriptors: Multiple Choice Tests, Learning Theories, Teaching Methods, Construct Validity

CATs, Testlets, and Test Construction: A Rationale for Putting Test Developers Back into CAT.

Wainer, Howard; Kiely, Gerard L. – 1986

Recent experience with the Computerized Adaptive Test (CAT) has raised a number of concerns about its practical applications. The concerns are principally involved with the concept of having the computer construct the test from a precalibrated item pool, and substituting statistical characteristics for the test developer's skills. Problems with…

Descriptors: Adaptive Testing, Algorithms, Computer Assisted Testing, Construct Validity