ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	12

Descriptor

Construct Validity	18
Test Content	18
Test Construction	11
Test Items	8
Test Validity	7
Content Validity	5
Scores	5
Test Reliability	5
Testing	5
Foreign Countries	4
Scoring	4
Student Attitudes	4
Test Theory	4
College Students	3
Correlation	3
Definitions	3
English (Second Language)	3
Item Analysis	3
Psychometrics	3
Statistical Analysis	3
Student Evaluation	3
Achievement Tests	2
College Entrance Examinations	2
Educational Testing	2
Evaluation Research	2
More ▼

Source

Review of Research in…	2
AERA Online Paper Repository	1
Applied Psychological…	1
Career Development Quarterly	1
Educational Researcher	1
Educational and Psychological…	1
Journal of Applied Testing…	1
Journal of Astronomy & Earth…	1
Journal of Psychoeducational…	1
Language Education &…	1
Language Testing	1
Learning Environments Research	1
ProQuest LLC	1
Sociological Methods &…	1
More ▼

Publication Type

Journal Articles	13
Reports - Research	12
Speeches/Meeting Papers	5
Reports - Evaluative	3
Tests/Questionnaires	3
Dissertations/Theses -…	1
Information Analyses	1
Opinion Papers	1
Reports - Descriptive	1

Education Level

Higher Education	5
Postsecondary Education	4
Elementary Secondary Education	2
High Schools	2
Secondary Education	2

Audience

Location

Japan	1
New York (Buffalo)	1
South Korea	1
Taiwan	1
United Kingdom	1
United States	1

Laws, Policies, & Programs

Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

SAT (College Admission Test)	2
General Social Survey	1
Raven Progressive Matrices	1
Test of English as a Foreign…	1
Wechsler Adult Intelligence…	1
Wechsler Intelligence Scale…	1
Work Keys (ACT)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 18 results Save | Export

Updating a Time-Series of Survey Questions: The Case of Abortion Attitudes in the General Social Survey

Peer reviewed

Direct link

Sarah K. Cowan; Michael Hout; Stuart Perrett – Sociological Methods & Research, 2024

Long-running surveys need a systematic way to reflect social change and to keep items relevant to respondents, especially when they ask about controversial subjects, or they threaten the items' validity. We propose a protocol for updating measures that preserves content and construct validity. First, substantive experts articulate the current and…

Descriptors: Surveys, Public Opinion, Social Attitudes, Pregnancy

Continual Improvement of a Student Evaluation of Teaching over Seven Semesters at a State University

Peer reviewed

Direct link

Rates, Christopher; Liu, Xiufeng; Vanzile-Tamzen, Carol; Morreale, Cathleen – AERA Online Paper Repository, 2017

In the fall of 2014, the University at Buffalo created a new universal Student Evaluation of Teaching (SET). The purpose of the present study was to establish the construct validity of SET items. Rasch analyses of data from 7 semesters (N=203,194 students) revealed problems with item fit indices and threshold distances. Changes to items and…

Descriptors: Student Evaluation of Teacher Performance, State Universities, College Students, Teacher Effectiveness

The Underlying Structure of Foreign Language Anxiety in Integrated Speaking Assessment: A Mixed Methods Study

Peer reviewed
PDF on ERIC

Download full text

Lee, Kwangmin; Ye, Yafei – Language Education & Assessment, 2021

The aim of this mixed methods study is to identify the underlying structure of the construct of Foreign Language Anxiety in integrated listening-to-speak tasks. First, the analysis of the qualitative interviews with six postsecondary ESL learners reveals that anxiety for integrated speaking stems from four different factors: "listening,"…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Factor Analysis

Flaws in Flynn Effect Research with the Wechsler Scales

Peer reviewed

Direct link

Weiss, Lawrence G.; Gregoire, Jacques; Zhu, Jianjun – Journal of Psychoeducational Assessment, 2016

Many Flynn effect (FE) studies compare scores across different editions of Wechsler's IQ tests. When construct changes are introduced by the test developers in the new edition, however, the presumed generational effects are difficult to untangle from changes due to test content. To remove this confound, we use the same edition of Wechsler…

Descriptors: Generational Differences, Intelligence Tests, Comparative Analysis, Scores

The Development and Validation of the Test Of Astronomy STandards (TOAST)

Peer reviewed
PDF on ERIC

Download full text

Slater, Stephanie J. – Journal of Astronomy & Earth Sciences Education, 2014

The Test Of Astronomy STandards (TOAST) is a comprehensive assessment instrument designed to measure students' general astronomy content knowledge. Built upon the research embedded within a generation of astronomy assessments designed to measure single concepts, the TOAST is appropriate to measure across an entire astronomy course. The TOAST's…

Descriptors: Astronomy, Academic Standards, Science Tests, Test Content

Development of an Instrument for Assessing Senior High School Students' Preferred and Perceived Laboratory Classroom Environment

Peer reviewed

Direct link

Hsiao, Chien-Hua; Wu, Ying-Tien; Lin, Chung-Yen; Wong, Terrence William; Fu, Hsieh-Hai; Yeh, Ting-Kuang; Chang, Chung-Yen – Learning Environments Research, 2014

This study aimed to develop an instrument, named the inquiry-based laboratory classroom environment instrument (ILEI), for assessing senior high-school science students' preferred and perceived laboratory environment. A total of 262 second-year students, from a senior-high school in Taiwan, were recruited for this study. Four stages were included…

Descriptors: Test Construction, Science Laboratories, Inquiry, Science Instruction

Adaptations and Access to Assessment of Common Core Content

Peer reviewed

Direct link

Kettler, Ryan J. – Review of Research in Education, 2015

This chapter introduces theory that undergirds the role of testing adaptations in assessment, provides examples of item modifications and testing accommodations, reviews research relevant to each, and introduces a new paradigm that incorporates opportunity to learn (OTL), academic enablers, testing adaptations, and inferences that can be made from…

Descriptors: Meta Analysis, Literature Reviews, Testing, Testing Accommodations

The Development and Initial Psychometric Evaluation of the Korean Career Stress Inventory for College Students

Peer reviewed

Direct link

Choi, Bo Young; Park, Heerak; Nam, Suk Kyung; Lee, Jayoung; Cho, Daeyeon; Lee, Sang Min – Career Development Quarterly, 2011

The purpose of this study was to develop a Korean College Stress Inventory (KCSI), which is designed to measure Korean college students' experiences and symptoms of career stress. Even though there have been numerous scales related to career issues, few scales measure the career stress construct and its dimensions. Factor structure, internal…

Descriptors: College Students, Factor Structure, Psychometrics, Stress Variables

Systematic Criterion-Referenced Test Development in an English-Language Program

Direct link

Kumazawa, Takaaki – ProQuest LLC, 2011

Although classroom assessment is one of the most frequent practices carried out by teachers in all educational programs, limited research has been conducted to investigate the dependability and validity of criterion-referenced tests (CRTs). The main purpose of this study is to develop a criterion-referenced test for first-year Japanese university…

Descriptors: Criterion Referenced Tests, Test Construction, Test Validity, English (Second Language)

Reconsidering Issues in Validity Theory

Peer reviewed

Direct link

Gorin, Joanna S. – Educational Researcher, 2007

Lissitz and Samuelsen (2007) propose a new framework for validity theory and terminology, emphasizing a shift in theory and practice toward issues of test content rather than constructs. The author of this article argues that several of Lissitz and Samuelsen's critiques of validity theory focus on previously considered, but subsequently discarded,…

Descriptors: Test Content, Test Validity, Construct Validity, Test Construction

The Central Role of Content Representation in Test Validity.

Download full text

Sireci, Stephen G. – 1995

The purpose of this paper is to clarify the seemingly discrepant views of test theorists and test developers about terminology related to the evaluation of test content. The origin and evolution of the concept of content validity are traced, and the concept is reformulated in a way that emphasizes the notion that content domain definition,…

Descriptors: Construct Validity, Content Validity, Definitions, Item Analysis

Does the Text Matter in a Multiple-Choice Test of Comprehension? The Case for the Construct Validity of TOEFL's Minitalks.

Peer reviewed

Kostin, Irene; Freedle, Roy – Language Testing, 1999

A study investigated whether examinees taking the Test of English as a Foreign Language (TOEFL) attended to the text passages in the "minitalks" when answering the multiple-choice items (n=337) testing listening comprehension. Results support the construct validity of the minitalks, and also allow comparison between reading and listening…

Descriptors: Construct Validity, English (Second Language), Language Tests, Listening Comprehension

The Test-Taking Behaviors, Knowledge and Perceptions of Adult Basic Education Students on a Standardized Reading Comprehension Test.

Download full text

Schierloh, Jane M. – 1993

A qualitative study investigated the test-taking behaviors, knowledge, and perceptions of 20 urban, adult basic education students reading at third to fifth grade equivalency levels. The entire reading comprehension subtest of the Test of Adult Basic Education, levels E and M, was administered under standardized conditions. A combination of…

Descriptors: Adult Basic Education, Construct Validity, Reading Comprehension, Scores

Using Subject-Matter Experts to Assess Content Representation: An MDS Analysis.

Peer reviewed

Sireci, Stephen G.; Geisinger, Kurt F. – Applied Psychological Measurement, 1995

An expanded version of the method of content evaluation proposed by S. G. Sireci and K. F. Giesinger (1992) was evaluated with respect to a national licensure examination and a nationally standardized social studies achievement test. Two groups of 15 subject-matter experts rated the similarity and content relevance of the items. (SLD)

Descriptors: Achievement Tests, Cluster Analysis, Construct Validity, Content Validity

Does Quantity Equal Quality?: The Relationship between Length of Response and Scores on the SAT Essay

Peer reviewed

Direct link

Kobrin, Jennifer L.; Deng, Hui; Shaw, Emily J. – Journal of Applied Testing Technology, 2007

This study was designed to address two frequent criticisms of the SAT essay--that essay length is the best predictor of scores, and that there is an advantage in using more "sophisticated" examples as opposed to personal experience. The study was based on 2,820 essays from the first three administrations of the new SAT. Each essay was…

Descriptors: Testing Programs, Computer Assisted Testing, Construct Validity, Writing Skills

Previous Page | Next Page »

Pages: 1 | 2

Sireci, Stephen G.	2
Bell, Karen N.	1
Chang, Chung-Yen	1
Cho, Daeyeon	1
Choi, Bo Young	1
Deng, Hui	1
Freedle, Roy	1
Fu, Hsieh-Hai	1
Geisinger, Kurt F.	1
Gorin, Joanna S.	1
Gregoire, Jacques	1
Hater, John J.	1
Hsiao, Chien-Hua	1
Kettler, Ryan J.	1
Kobrin, Jennifer L.	1
Kostin, Irene	1
Kumazawa, Takaaki	1
Lee, Jayoung	1
Lee, Kwangmin	1
Lee, Sang Min	1
Lin, Chung-Yen	1
Liu, Xiufeng	1
Ludlow, Larry H.	1
Michael Hout	1
Morreale, Cathleen	1
More ▼