ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	4

Descriptor

Construct Validity	13
Test Format	13
Test Validity	13
Factor Analysis	5
Test Construction	5
Test Items	5
Multiple Choice Tests	4
Psychometrics	4
Second Language Learning	4
Test Reliability	4
English	3
Foreign Countries	3
Language Tests	3
Science Tests	3
Undergraduate Students	3
College Entrance Examinations	2
Comparative Analysis	2
Comparative Testing	2
Content Validity	2
Difficulty Level	2
Goodness of Fit	2
High Stakes Tests	2
Higher Education	2
Middle Schools	2
Scores	2
More ▼

Source

Annual Review of Applied…	1
Canadian Modern Language…	1
College Board	1
Educational and Psychological…	1
International Journal of…	1
Language Assessment Quarterly	1
Language Testing	1
Research in Science and…	1

Publication Type

Reports - Research	8
Journal Articles	7
Speeches/Meeting Papers	4
Reports - Evaluative	3
Information Analyses	1
Non-Print Media	1
Reference Materials - General	1

Education Level

Higher Education	3
Postsecondary Education	2
High Schools	1
Secondary Education	1

Audience

Practitioners	1
Researchers	1
Teachers	1

Location

Canada	1
Singapore	1
Tanzania	1

Laws, Policies, & Programs

Assessments and Surveys

Embedded Figures Test	2
National Assessment of…	1
Piers Harris Childrens Self…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Evaluating the Explanation Inference of a High-Stakes French Listening Test: An Argument-Based Perspective

Peer reviewed

Direct link

Arias, Angel; Blais, Jean-Guy – Canadian Modern Language Review, 2023

This article draws on argument-based validation to gather and evaluate construct-related evidence (i.e., the explanation inference) of a high-stakes test. The data stemmed from the listening component of a French test used for immigration to Canada through the province of Quebec. An expert panel with varied backgrounds in applied linguistics…

Descriptors: French, Listening Comprehension Tests, Second Language Learning, High Stakes Tests

The Singapore-Cambridge General Certificate of Education Advanced-Level General Paper Examination

Peer reviewed

Direct link

Hassan, Nurul Huda; Shih, Chih-Min – Language Assessment Quarterly, 2013

This article describes and reviews the Singapore-Cambridge General Certificate of Education Advanced Level General Paper (GP) examination. As a written test that is administered to preuniversity students, the GP examination is internationally recognised and accepted by universities and employers as proof of English competence. In this article, the…

Descriptors: Foreign Countries, College Entrance Examinations, English (Second Language), Writing Tests

Test Review: The Modern Language Aptitude Test (Paper-and-Pencil Version)

Peer reviewed

Direct link

Sasaki, Miyuki – Language Testing, 2012

The Modern Language Aptitude Test (Paper-and-Pencil Version, henceforth, the MLAT) measures "an individual's ability to learn a foreign language." It targets English-speaking adults (over Grade 9) who are literate. The test has only one form, which has not changed since it was first published by the Psychological Corporation in 1959. The test can…

Descriptors: Aptitude Tests, Test Reviews, Rewards, Acoustics

Adapting Item Format for Cultural Effects in Translated Tests: Cultural Effects on Construct Validity of the Chinese Versions of the MBTI

Peer reviewed

Direct link

Osterlind, Steven J.; Miao, Danmin; Sheng, Yanyan; Chia, Rosina C. – International Journal of Testing, 2004

This study investigated the interaction between different cultural groups and item type, and the ensuing effect on construct validity for a psychological inventory, the Myers-Briggs Type Indicator (MBTI, Form G). The authors analyzed 94 items from 2 Chinese-translated versions of the MBTI (Form G) for factorial differences among groups of…

Descriptors: Test Format, Undergraduate Students, Cultural Differences, Test Validity

The Construct Validity of an Examination Designed to Test Practical Ability in Biology.

Peer reviewed

Brown, C. R.; Njabili, A. F. – Research in Science and Technological Education, 1989

Explores the concept of construct validity in the context of a public examination. Multitrait-multimethod analysis and factor analysis were used. Discusses the analyses in terms of theory versus practical components and experimental versus observational investigation in practical components. Sample items for the practical examinations are…

Descriptors: Biology, Construct Validity, Factor Analysis, Foreign Countries

Testing the Dimensionality of the Piers-Harris Children's Self-Concept Scale.

Peer reviewed

Benson, Jeri; Rentsch, Joan – Educational and Psychological Measurement, 1988

Confirmatory factor analysis techniques assessed several structural models that have been reported regarding the construct validity of the Piers-Harris Children's Self-Concept Scale. Responses of 885 Black, White, and Hispanic students in grades three-six suggest that the scale's construct validity is a function of content and manner of phrasing.…

Descriptors: Black Students, Child Development, Construct Validity, Elementary Education

Measurement Characteristics of the Finding Embedded Figures Test with Middle School Students.

Download full text

Melancon, Janet G.; Thompson, Bruce – 1989

Classical measurement theory was used to investigate the measurement (psychometric) characteristics of both parts of the Finding Embedded Figures Test (FEFT) administered in either a "no guessing" supply format or a multiple-choice selection format to undergraduate college students or to middle school students. Three issues were…

Descriptors: Comparative Testing, Construct Validity, Higher Education, Junior High School Students

Validity of Translations of a Cosmetology Licensure Examination.

Download full text

Bolton, David L.; And Others – 1989

A study was conducted to assess the validity of translations of two different forms of a licensing examination for cosmetologists in Florida to ensure that both Spanish and English candidates have equal chances of being licensed. The LISREL computer program was used to test the equivalence of factor structure, units of measurement, and standard…

Descriptors: Construct Validity, Cosmetology, English, Factor Analysis

CATs, Testlets, and Test Construction: A Rationale for Putting Test Developers Back into CAT.

Wainer, Howard; Kiely, Gerard L. – 1986

Recent experience with the Computerized Adaptive Test (CAT) has raised a number of concerns about its practical applications. The concerns are principally involved with the concept of having the computer construct the test from a precalibrated item pool, and substituting statistical characteristics for the test developer's skills. Problems with…

Descriptors: Adaptive Testing, Algorithms, Computer Assisted Testing, Construct Validity

The Effect of Using Different Weights for Multiple-Choice and Free-Response Item Sections

Download full text

Hendrickson, Amy; Patterson, Brian; Melican, Gerald – College Board, 2008

Presented at the Annual National Council on Measurement in Education (NCME) in New York in March 2008. This presentation explores how different item weighting can affect the effective weights, validity coefficents and test reliability of composite scores among test takers.

Descriptors: Multiple Choice Tests, Test Format, Test Validity, Test Reliability

Assessing Speaking.

Peer reviewed

Turner, Jean – Annual Review of Applied Linguistics, 1998

This review of research on second-language oral testing outlines the nature of early research in interview-format proficiency testing, then reports on new directions in investigation of construct validity of interview-format and other oral skills tests through examination of examinee, interviewer, and rater performance. Research on empirically…

Descriptors: Construct Validity, Educational Trends, Interrater Reliability, Interviews

Measurement Characteristics of the Finding Embedded Figures Test in "Speed" versus "Power" Administrations.

Download full text

Melancon, Janet G.; Thompson, Bruce – 1990

Classical measurement theory was used to investigate measurement characteristics of both parts of the Finding Embedded Figures Test (FEFT) when the test was: administered in either a "no guessing" supply format or a multiple-choice selection format; administered to either undergraduate college students or middle school students; and…

Descriptors: Comparative Testing, Construct Validity, Guessing (Tests), Higher Education

Relationships between Test Specifications, Item Responses, Task Demands, and Item Attributes in a Large-Scale Science Assessment.

Download full text

Park, Chung; Allen, Nancy L. – 1994

This study is part of continuing research into the meaning of future National Assessment of Educational Progress (NAEP) science scales. In this study, the test framework, as examined by NAEP's consensus process, and attributes of the items, identified by science experts, cognitive scientists, and measurement specialists, are examined. Preliminary…

Descriptors: Communication (Thought Transfer), Comparative Analysis, Construct Validity, Content Validity

Melancon, Janet G.	2
Thompson, Bruce	2
Allen, Nancy L.	1
Arias, Angel	1
Benson, Jeri	1
Blais, Jean-Guy	1
Bolton, David L.	1
Brown, C. R.	1
Chia, Rosina C.	1
Hassan, Nurul Huda	1
Hendrickson, Amy	1
Kiely, Gerard L.	1
Melican, Gerald	1
Miao, Danmin	1
Njabili, A. F.	1
Osterlind, Steven J.	1
Park, Chung	1
Patterson, Brian	1
Rentsch, Joan	1
Sasaki, Miyuki	1
Sheng, Yanyan	1
Shih, Chih-Min	1
Turner, Jean	1
Wainer, Howard	1
More ▼