ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	3
Since 2017 (last 10 years)	4
Since 2007 (last 20 years)	8

Descriptor

Construct Validity	15
Test Validity	15
Testing Problems	15
Test Construction	8
Language Tests	6
Measurement Techniques	5
Psychometrics	5
Evaluation Methods	4
Test Items	4
Content Validity	3
Educational Assessment	3
Evaluation Problems	3
Evaluation Research	3
Knowledge Base for Teaching	3
Language Skills	3
Mathematics Education	3
Mathematics Instruction	3
Measurement	3
Multiple Choice Tests	3
Pedagogical Content Knowledge	3
Second Language Learning	3
Teacher Evaluation	3
Teaching Methods	3
Test Reliability	3
Testing	3
More ▼

Source

Measurement:…	3
ELT Journal	1
Educational Measurement:…	1
International Journal of…	1
Language Education &…	1
Language Testing	1
Physical Review Physics…	1
Review of Research in…	1
Social Behavior and…	1

Publication Type

Journal Articles	11
Opinion Papers	5
Reports - Evaluative	4
Reports - Research	4
Information Analyses	3
Speeches/Meeting Papers	2
Reports - Descriptive	1

Education Level

Elementary Secondary Education	3
Elementary Education	2
Higher Education	1
Postsecondary Education	1

Audience

Location

China	1
Indonesia	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

Leader Behavior Description…	1
Pearson Test of English…	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

A Case Study of Washback and Test Preparation of the New Version of PTE Academic

Peer reviewed
PDF on ERIC

Download full text

Yi Zou; Ying Zheng; Jingwen Wang – International Journal of Language Testing, 2025

The Pearson Test of English Academic (PTE-A), a widely used high-stakes language proficiency test for university admissions and migration purposes, underwent a notable change from a three-hour to a two-hour version in November 2021. The implementation of the new version has prompted inquiries into the washback effects on various stakeholders.…

Descriptors: Testing Problems, Test Preparation, High Stakes Tests, English (Second Language)

The Vexing Problem of Validity and the Future of Second Language Assessment

Peer reviewed

Direct link

Aryadoust, Vahid – Language Testing, 2023

Construct validity and building validity arguments are some of the main challenges facing the language assessment community. The notion of construct validity and validity arguments arose from research in psychological assessment and developed into the gold standard of validation/validity research in language assessment. At a theoretical level,…

Descriptors: Testing Problems, Test Validity, Second Language Learning, Construct Validity

Evaluating Research Reports on the Qualities of Tests of English Language Skills in Indonesian Schools: A Systematic Review

Peer reviewed
PDF on ERIC

Download full text

Patrisius Istiarto Djiwandono; Daniel Ginting – Language Education & Assessment, 2025

The teaching of English as a foreign language in Indonesia has a long history, and it is always important to ask whether the assessment of the students' language skills has been valid and reliable. A screening of many articles in several prominent databases reveal that a number of evaluation studies have been done by Indonesian scholars in the…

Descriptors: Foreign Countries, Language Tests, English (Second Language), Second Language Learning

Validity Evaluation of the Lawson Classroom Test of Scientific Reasoning

Peer reviewed

Direct link

Bao, Lei; Xiao, Yang; Koenig, Kathleen; Han, Jing – Physical Review Physics Education Research, 2018

In science, technology, engineering, and mathematics education there has been increased emphasis on teaching goals that include not only the learning of content knowledge but also the development of scientific reasoning skills. The Lawson classroom test of scientific reasoning (LCTSR) is a popular assessment instrument for scientific reasoning.…

Descriptors: Science Tests, Science Process Skills, Logical Thinking, Test Validity

Adaptations and Access to Assessment of Common Core Content

Peer reviewed

Direct link

Kettler, Ryan J. – Review of Research in Education, 2015

This chapter introduces theory that undergirds the role of testing adaptations in assessment, provides examples of item modifications and testing accommodations, reviews research relevant to each, and introduces a new paradigm that incorporates opportunity to learn (OTL), academic enablers, testing adaptations, and inferences that can be made from…

Descriptors: Meta Analysis, Literature Reviews, Testing, Testing Accommodations

Consideration and Initiating Structure: Are They Basic Dimensions of Leader Behavior?

Peer reviewed

Tracy, Lane – Social Behavior and Personality, 1987

Conducted two studies to examine validity of scales measuring consideration and initiating structure as basic dimensions of leader behavior. Results indicated that the factors were multidimensional and probably did not represent fundamental dimensions of leader behavior. Found relevant scales of Leader Behavior Description Questionnaire to be…

Descriptors: Behavior, College Students, Construct Validity, Higher Education

Tests of Oral Performance: The Need for Data-based Criteria.

Peer reviewed

Fulcher, Glenn – ELT Journal, 1987

Communicative oral language tests have claimed high content validity, but have also elicited concern that the assessment scales are based on theory with little empirical justification. A new approach to construct validity can be found in discourse analysis, which could lead to the development of new communicative discourse tests in all skills. (CB)

Descriptors: Communicative Competence (Languages), Construct Validity, Discourse Analysis, Language Proficiency

Exploring ESP and Language Testing.

Download full text

Scholz, George E. – 1993

A discussion of language testing in the context of a program in English for Special Purposes (ESP) focuses on the lack of "fit" between the two areas and makes some recommendations for improvement. It begins with overviews of recent trends in testing and recent issues in ESP. Overlap is seen in two areas: construct and content validity. It is…

Descriptors: Construct Validity, Content Validity, Curriculum Design, English for Special Purposes

Challenges of Simultaneously Defining and Measuring Knowledge for Teaching

Peer reviewed

Direct link

Alonzo, Alicia C. – Measurement: Interdisciplinary Research and Perspectives, 2007

Schilling et al. (this issue) have done a commendable job in illustrating a comprehensive process of validating assessments of teacher knowledge (and, more broadly, other types of tests as well). On one hand, the concrete illustration of a process that often remains murky and incomplete is profoundly heartening, as it provides a rigorous model for…

Descriptors: Mathematics Education, Teacher Characteristics, Mathematics Instruction, Knowledge Base for Teaching

Measurement Specialists: Testing the Faith--A Reply to Mehrens.

Peer reviewed

Madaus, George F. – Educational Measurement: Issues and Practice, 1986

This reply to William A. Mehrens argues that test validity is the central issue in discussing the appropriate role of tests. It states that the procedures used to establish the validity of tests are inadequate because they depend primarily on content validity and not on construct and criterion validity. (JAZ)

Descriptors: Concurrent Validity, Construct Validity, Cutting Scores, Decision Making

Assessing Inference Skills.

Download full text

Facione, Peter A. – 1989

Four major problem areas inhibit the standardized assessment of critical thinking (CT): (1) content validity; (2) construct validity; (3) technical jargon; and (4) background knowledge. Practical examples of framing multiple-choice items for assessment are suggested. In the area of content validity, new agreement about the definition of CT now…

Descriptors: Cognitive Measurement, Construct Validity, Content Validity, Critical Thinking

Mathematics Knowledge for Teaching: Questions about Constructs

Peer reviewed

Direct link

Gearhart, Maryl – Measurement: Interdisciplinary Research and Perspectives, 2007

Teacher knowledge has been of theoretical and empirical interest for over two decades, and development of measures is overdue. The researchers represented in this volume have been breaking new ground by developing a measure of mathematical knowledge for teaching (MKT) without guiding precedents, and in the face of differing perspectives on teacher…

Descriptors: Learning Theories, Elementary School Mathematics, Teaching Methods, Construct Validity

Toward Developmental Trajectories: A Commentary on "Assessing Measures of Mathematical Knowledge for Teaching"

Peer reviewed

Direct link

Kulikowich, Jonna M. – Measurement: Interdisciplinary Research and Perspectives, 2007

Operating from multiple literature bases in cognitive psychology, mathematics education, and theoretical and applied psychometrics, Schilling, Hill and their colleagues provide a systemic approach to studying the validity of scores of mathematical knowledge for teaching. This system encompasses an array of task formats and methodologies. The…

Descriptors: Multiple Choice Tests, Learning Theories, Teaching Methods, Construct Validity

Assessing L2 Sociolinguistic Competence: In Search of Support from Pragmatic Theories.

Download full text

Zuskin, Robin D. – 1993

Second language tests claiming to assess communicative competence are widespread, despite the vague nature of the construct. Sociolinguistic or intercultural competence is gradually gaining attention in the classroom, but testing has not kept pace, partly because of difficulty in defining the related skills. An opinion is that speech act theory…

Descriptors: Communicative Competence (Languages), Construct Validity, Intercultural Communication, Language Proficiency

CATs, Testlets, and Test Construction: A Rationale for Putting Test Developers Back into CAT.

Wainer, Howard; Kiely, Gerard L. – 1986

Recent experience with the Computerized Adaptive Test (CAT) has raised a number of concerns about its practical applications. The concerns are principally involved with the concept of having the computer construct the test from a precalibrated item pool, and substituting statistical characteristics for the test developer's skills. Problems with…

Descriptors: Adaptive Testing, Algorithms, Computer Assisted Testing, Construct Validity

Alonzo, Alicia C.	1
Aryadoust, Vahid	1
Bao, Lei	1
Daniel Ginting	1
Facione, Peter A.	1
Fulcher, Glenn	1
Gearhart, Maryl	1
Han, Jing	1
Jingwen Wang	1
Kettler, Ryan J.	1
Kiely, Gerard L.	1
Koenig, Kathleen	1
Kulikowich, Jonna M.	1
Madaus, George F.	1
Patrisius Istiarto Djiwandono	1
Scholz, George E.	1
Tracy, Lane	1
Wainer, Howard	1
Xiao, Yang	1
Yi Zou	1
Ying Zheng	1
Zuskin, Robin D.	1
More ▼