NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing all 13 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Arias, Angel; Blais, Jean-Guy – Canadian Modern Language Review, 2023
This article draws on argument-based validation to gather and evaluate construct-related evidence (i.e., the explanation inference) of a high-stakes test. The data stemmed from the listening component of a French test used for immigration to Canada through the province of Quebec. An expert panel with varied backgrounds in applied linguistics…
Descriptors: French, Listening Comprehension Tests, Second Language Learning, High Stakes Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Hassan, Nurul Huda; Shih, Chih-Min – Language Assessment Quarterly, 2013
This article describes and reviews the Singapore-Cambridge General Certificate of Education Advanced Level General Paper (GP) examination. As a written test that is administered to preuniversity students, the GP examination is internationally recognised and accepted by universities and employers as proof of English competence. In this article, the…
Descriptors: Foreign Countries, College Entrance Examinations, English (Second Language), Writing Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Sasaki, Miyuki – Language Testing, 2012
The Modern Language Aptitude Test (Paper-and-Pencil Version, henceforth, the MLAT) measures "an individual's ability to learn a foreign language." It targets English-speaking adults (over Grade 9) who are literate. The test has only one form, which has not changed since it was first published by the Psychological Corporation in 1959. The test can…
Descriptors: Aptitude Tests, Test Reviews, Rewards, Acoustics
Peer reviewed Peer reviewed
Direct linkDirect link
Osterlind, Steven J.; Miao, Danmin; Sheng, Yanyan; Chia, Rosina C. – International Journal of Testing, 2004
This study investigated the interaction between different cultural groups and item type, and the ensuing effect on construct validity for a psychological inventory, the Myers-Briggs Type Indicator (MBTI, Form G). The authors analyzed 94 items from 2 Chinese-translated versions of the MBTI (Form G) for factorial differences among groups of…
Descriptors: Test Format, Undergraduate Students, Cultural Differences, Test Validity
Peer reviewed Peer reviewed
Brown, C. R.; Njabili, A. F. – Research in Science and Technological Education, 1989
Explores the concept of construct validity in the context of a public examination. Multitrait-multimethod analysis and factor analysis were used. Discusses the analyses in terms of theory versus practical components and experimental versus observational investigation in practical components. Sample items for the practical examinations are…
Descriptors: Biology, Construct Validity, Factor Analysis, Foreign Countries
Peer reviewed Peer reviewed
Benson, Jeri; Rentsch, Joan – Educational and Psychological Measurement, 1988
Confirmatory factor analysis techniques assessed several structural models that have been reported regarding the construct validity of the Piers-Harris Children's Self-Concept Scale. Responses of 885 Black, White, and Hispanic students in grades three-six suggest that the scale's construct validity is a function of content and manner of phrasing.…
Descriptors: Black Students, Child Development, Construct Validity, Elementary Education
Melancon, Janet G.; Thompson, Bruce – 1989
Classical measurement theory was used to investigate the measurement (psychometric) characteristics of both parts of the Finding Embedded Figures Test (FEFT) administered in either a "no guessing" supply format or a multiple-choice selection format to undergraduate college students or to middle school students. Three issues were…
Descriptors: Comparative Testing, Construct Validity, Higher Education, Junior High School Students
Bolton, David L.; And Others – 1989
A study was conducted to assess the validity of translations of two different forms of a licensing examination for cosmetologists in Florida to ensure that both Spanish and English candidates have equal chances of being licensed. The LISREL computer program was used to test the equivalence of factor structure, units of measurement, and standard…
Descriptors: Construct Validity, Cosmetology, English, Factor Analysis
Wainer, Howard; Kiely, Gerard L. – 1986
Recent experience with the Computerized Adaptive Test (CAT) has raised a number of concerns about its practical applications. The concerns are principally involved with the concept of having the computer construct the test from a precalibrated item pool, and substituting statistical characteristics for the test developer's skills. Problems with…
Descriptors: Adaptive Testing, Algorithms, Computer Assisted Testing, Construct Validity
Hendrickson, Amy; Patterson, Brian; Melican, Gerald – College Board, 2008
Presented at the Annual National Council on Measurement in Education (NCME) in New York in March 2008. This presentation explores how different item weighting can affect the effective weights, validity coefficents and test reliability of composite scores among test takers.
Descriptors: Multiple Choice Tests, Test Format, Test Validity, Test Reliability
Peer reviewed Peer reviewed
Turner, Jean – Annual Review of Applied Linguistics, 1998
This review of research on second-language oral testing outlines the nature of early research in interview-format proficiency testing, then reports on new directions in investigation of construct validity of interview-format and other oral skills tests through examination of examinee, interviewer, and rater performance. Research on empirically…
Descriptors: Construct Validity, Educational Trends, Interrater Reliability, Interviews
Melancon, Janet G.; Thompson, Bruce – 1990
Classical measurement theory was used to investigate measurement characteristics of both parts of the Finding Embedded Figures Test (FEFT) when the test was: administered in either a "no guessing" supply format or a multiple-choice selection format; administered to either undergraduate college students or middle school students; and…
Descriptors: Comparative Testing, Construct Validity, Guessing (Tests), Higher Education
Park, Chung; Allen, Nancy L. – 1994
This study is part of continuing research into the meaning of future National Assessment of Educational Progress (NAEP) science scales. In this study, the test framework, as examined by NAEP's consensus process, and attributes of the items, identified by science experts, cognitive scientists, and measurement specialists, are examined. Preliminary…
Descriptors: Communication (Thought Transfer), Comparative Analysis, Construct Validity, Content Validity