Publication Date
| In 2026 | 0 |
| Since 2025 | 2 |
| Since 2022 (last 5 years) | 3 |
| Since 2017 (last 10 years) | 4 |
| Since 2007 (last 20 years) | 8 |
Descriptor
| Construct Validity | 15 |
| Test Validity | 15 |
| Testing Problems | 15 |
| Test Construction | 8 |
| Language Tests | 6 |
| Measurement Techniques | 5 |
| Psychometrics | 5 |
| Evaluation Methods | 4 |
| Test Items | 4 |
| Content Validity | 3 |
| Educational Assessment | 3 |
| More ▼ | |
Source
| Measurement:… | 3 |
| ELT Journal | 1 |
| Educational Measurement:… | 1 |
| International Journal of… | 1 |
| Language Education &… | 1 |
| Language Testing | 1 |
| Physical Review Physics… | 1 |
| Review of Research in… | 1 |
| Social Behavior and… | 1 |
Author
| Alonzo, Alicia C. | 1 |
| Aryadoust, Vahid | 1 |
| Bao, Lei | 1 |
| Daniel Ginting | 1 |
| Facione, Peter A. | 1 |
| Fulcher, Glenn | 1 |
| Gearhart, Maryl | 1 |
| Han, Jing | 1 |
| Jingwen Wang | 1 |
| Kettler, Ryan J. | 1 |
| Kiely, Gerard L. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 11 |
| Opinion Papers | 5 |
| Reports - Evaluative | 4 |
| Reports - Research | 4 |
| Information Analyses | 3 |
| Speeches/Meeting Papers | 2 |
| Reports - Descriptive | 1 |
Education Level
| Elementary Secondary Education | 3 |
| Elementary Education | 2 |
| Higher Education | 1 |
| Postsecondary Education | 1 |
Audience
Location
| China | 1 |
| Indonesia | 1 |
| United Kingdom | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| Leader Behavior Description… | 1 |
| Pearson Test of English… | 1 |
What Works Clearinghouse Rating
Yi Zou; Ying Zheng; Jingwen Wang – International Journal of Language Testing, 2025
The Pearson Test of English Academic (PTE-A), a widely used high-stakes language proficiency test for university admissions and migration purposes, underwent a notable change from a three-hour to a two-hour version in November 2021. The implementation of the new version has prompted inquiries into the washback effects on various stakeholders.…
Descriptors: Testing Problems, Test Preparation, High Stakes Tests, English (Second Language)
Aryadoust, Vahid – Language Testing, 2023
Construct validity and building validity arguments are some of the main challenges facing the language assessment community. The notion of construct validity and validity arguments arose from research in psychological assessment and developed into the gold standard of validation/validity research in language assessment. At a theoretical level,…
Descriptors: Testing Problems, Test Validity, Second Language Learning, Construct Validity
Patrisius Istiarto Djiwandono; Daniel Ginting – Language Education & Assessment, 2025
The teaching of English as a foreign language in Indonesia has a long history, and it is always important to ask whether the assessment of the students' language skills has been valid and reliable. A screening of many articles in several prominent databases reveal that a number of evaluation studies have been done by Indonesian scholars in the…
Descriptors: Foreign Countries, Language Tests, English (Second Language), Second Language Learning
Bao, Lei; Xiao, Yang; Koenig, Kathleen; Han, Jing – Physical Review Physics Education Research, 2018
In science, technology, engineering, and mathematics education there has been increased emphasis on teaching goals that include not only the learning of content knowledge but also the development of scientific reasoning skills. The Lawson classroom test of scientific reasoning (LCTSR) is a popular assessment instrument for scientific reasoning.…
Descriptors: Science Tests, Science Process Skills, Logical Thinking, Test Validity
Kettler, Ryan J. – Review of Research in Education, 2015
This chapter introduces theory that undergirds the role of testing adaptations in assessment, provides examples of item modifications and testing accommodations, reviews research relevant to each, and introduces a new paradigm that incorporates opportunity to learn (OTL), academic enablers, testing adaptations, and inferences that can be made from…
Descriptors: Meta Analysis, Literature Reviews, Testing, Testing Accommodations
Peer reviewedTracy, Lane – Social Behavior and Personality, 1987
Conducted two studies to examine validity of scales measuring consideration and initiating structure as basic dimensions of leader behavior. Results indicated that the factors were multidimensional and probably did not represent fundamental dimensions of leader behavior. Found relevant scales of Leader Behavior Description Questionnaire to be…
Descriptors: Behavior, College Students, Construct Validity, Higher Education
Peer reviewedFulcher, Glenn – ELT Journal, 1987
Communicative oral language tests have claimed high content validity, but have also elicited concern that the assessment scales are based on theory with little empirical justification. A new approach to construct validity can be found in discourse analysis, which could lead to the development of new communicative discourse tests in all skills. (CB)
Descriptors: Communicative Competence (Languages), Construct Validity, Discourse Analysis, Language Proficiency
Scholz, George E. – 1993
A discussion of language testing in the context of a program in English for Special Purposes (ESP) focuses on the lack of "fit" between the two areas and makes some recommendations for improvement. It begins with overviews of recent trends in testing and recent issues in ESP. Overlap is seen in two areas: construct and content validity. It is…
Descriptors: Construct Validity, Content Validity, Curriculum Design, English for Special Purposes
Alonzo, Alicia C. – Measurement: Interdisciplinary Research and Perspectives, 2007
Schilling et al. (this issue) have done a commendable job in illustrating a comprehensive process of validating assessments of teacher knowledge (and, more broadly, other types of tests as well). On one hand, the concrete illustration of a process that often remains murky and incomplete is profoundly heartening, as it provides a rigorous model for…
Descriptors: Mathematics Education, Teacher Characteristics, Mathematics Instruction, Knowledge Base for Teaching
Peer reviewedMadaus, George F. – Educational Measurement: Issues and Practice, 1986
This reply to William A. Mehrens argues that test validity is the central issue in discussing the appropriate role of tests. It states that the procedures used to establish the validity of tests are inadequate because they depend primarily on content validity and not on construct and criterion validity. (JAZ)
Descriptors: Concurrent Validity, Construct Validity, Cutting Scores, Decision Making
Facione, Peter A. – 1989
Four major problem areas inhibit the standardized assessment of critical thinking (CT): (1) content validity; (2) construct validity; (3) technical jargon; and (4) background knowledge. Practical examples of framing multiple-choice items for assessment are suggested. In the area of content validity, new agreement about the definition of CT now…
Descriptors: Cognitive Measurement, Construct Validity, Content Validity, Critical Thinking
Gearhart, Maryl – Measurement: Interdisciplinary Research and Perspectives, 2007
Teacher knowledge has been of theoretical and empirical interest for over two decades, and development of measures is overdue. The researchers represented in this volume have been breaking new ground by developing a measure of mathematical knowledge for teaching (MKT) without guiding precedents, and in the face of differing perspectives on teacher…
Descriptors: Learning Theories, Elementary School Mathematics, Teaching Methods, Construct Validity
Kulikowich, Jonna M. – Measurement: Interdisciplinary Research and Perspectives, 2007
Operating from multiple literature bases in cognitive psychology, mathematics education, and theoretical and applied psychometrics, Schilling, Hill and their colleagues provide a systemic approach to studying the validity of scores of mathematical knowledge for teaching. This system encompasses an array of task formats and methodologies. The…
Descriptors: Multiple Choice Tests, Learning Theories, Teaching Methods, Construct Validity
Zuskin, Robin D. – 1993
Second language tests claiming to assess communicative competence are widespread, despite the vague nature of the construct. Sociolinguistic or intercultural competence is gradually gaining attention in the classroom, but testing has not kept pace, partly because of difficulty in defining the related skills. An opinion is that speech act theory…
Descriptors: Communicative Competence (Languages), Construct Validity, Intercultural Communication, Language Proficiency
Wainer, Howard; Kiely, Gerard L. – 1986
Recent experience with the Computerized Adaptive Test (CAT) has raised a number of concerns about its practical applications. The concerns are principally involved with the concept of having the computer construct the test from a precalibrated item pool, and substituting statistical characteristics for the test developer's skills. Problems with…
Descriptors: Adaptive Testing, Algorithms, Computer Assisted Testing, Construct Validity

Direct link
