Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 0 |
| Since 2007 (last 20 years) | 3 |
Descriptor
| Multidimensional Scaling | 15 |
| Test Format | 15 |
| Test Construction | 6 |
| Item Response Theory | 5 |
| Test Items | 5 |
| Higher Education | 4 |
| Item Bias | 4 |
| Difficulty Level | 3 |
| Evaluation Methods | 3 |
| Factor Analysis | 3 |
| Item Analysis | 3 |
| More ▼ | |
Source
| Applied Measurement in… | 2 |
| Educational and Psychological… | 2 |
| Applied Psychological… | 1 |
| Educational Sciences: Theory… | 1 |
| International Journal of… | 1 |
| Journal of Counseling… | 1 |
| Journal of Educational… | 1 |
Author
Publication Type
| Reports - Research | 11 |
| Journal Articles | 9 |
| Speeches/Meeting Papers | 7 |
| Reports - Evaluative | 2 |
| Numerical/Quantitative Data | 1 |
| Reports - Descriptive | 1 |
| Reports - General | 1 |
Education Level
| Elementary Education | 1 |
| Elementary Secondary Education | 1 |
| Grade 8 | 1 |
| Junior High Schools | 1 |
| Middle Schools | 1 |
| Secondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
| Trends in International… | 2 |
What Works Clearinghouse Rating
Lee, Guemin; Lee, Won-Chan – Applied Measurement in Education, 2016
The main purposes of this study were to develop bi-factor multidimensional item response theory (BF-MIRT) observed-score equating procedures for mixed-format tests and to investigate relative appropriateness of the proposed procedures. Using data from a large-scale testing program, three types of pseudo data sets were formulated: matched samples,…
Descriptors: Test Format, Multidimensional Scaling, Item Response Theory, Equated Scores
Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016
The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…
Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores
Socha, Alan; DeMars, Christine E. – Educational and Psychological Measurement, 2013
Modeling multidimensional test data with a unidimensional model can result in serious statistical errors, such as bias in item parameter estimates. Many methods exist for assessing the dimensionality of a test. The current study focused on DIMTEST. Using simulated data, the effects of sample size splitting for use with the ATFIND procedure for…
Descriptors: Sample Size, Test Length, Correlation, Test Format
Peer reviewedRobin, Frederic; Sireci, Stephen G.; Hambleton, Ronald K. – International Journal of Testing, 2003
Illustrates how multidimensional scaling (MDS) and differential item functioning (DIF) procedures can be used to evaluate the equivalence of different language versions of an examination. Presents examples of structural differences and DIF across languages. (SLD)
Descriptors: Item Bias, Licensing Examinations (Professions), Multidimensional Scaling, Multilingual Materials
Sireci, Stephen G.; Bastari, B. – 1998
In many cross-cultural research studies, assessment instruments are translated or adapted for use in multiple languages. However, it cannot be assumed that different language versions of an assessment are equivalent across languages. A fundamental issue to be addressed is the comparability or equivalence of the construct measured by each language…
Descriptors: Construct Validity, Cross Cultural Studies, Evaluation Methods, Multidimensional Scaling
Peer reviewedDawis, Rene V. – Journal of Counseling Psychology, 1987
Discusses design, development, and evaluation of scales used in counseling psychology research. Describes methods of scale construction including the Thurstone, Q-sort, rank-order methods, Likert, semantic differential, Guttman, Rasch, and external criterion methods. Presents ways of evaluating newly developed scales. Discusses measurement versus…
Descriptors: Counseling, Measures (Individuals), Multidimensional Scaling, Psychology
Peer reviewedChristiansen, Neil D.; And Others – Educational and Psychological Measurement, 1996
The usefulness of examining the structural validity of scores on multidimensional measures using nested hierarchical model comparisons was evaluated in 2 studies using the Social Problem Solving Inventory (SPSI) with samples of 464 and 216 undergraduates. Results support the conceptual model of the SPSI. (SLD)
Descriptors: Comparative Analysis, Construct Validity, Higher Education, Interpersonal Relationship
Sireci, Stephen G.; Gonzalez, Eugenio J. – 2003
International comparative educational studies make use of test instruments originally developed in English by international panels of experts, but that are ultimately administered in the language of instruction of the students. The comparability of the different language versions of these assessments is a critical issue in validating the…
Descriptors: Academic Achievement, Comparative Analysis, Difficulty Level, International Education
Sireci, Stephen G.; Khaliq, Shameem Nyla – 2002
Many students in the United States who are required to take educational tests are not fully proficient in English. To address this problem, a state-mandated testing program created dual language English-Spanish versions of some of their tests. In this study, the psychometric properties of the English and dual language versions of a fourth-grade…
Descriptors: Item Bias, Language Proficiency, Limited English Speaking, Multidimensional Scaling
Peer reviewedDiekhoff, George M. – Journal of Educational Psychology, 1983
Alternatives to multidimensional scaling analysis of numerical relationship judgments in extracting information about students' knowledge were investigated. Undergraduate students completed a multiple choice test over knowledge of individual concepts, an essay test covering relationships among concepts, and a relationship judgment test that…
Descriptors: Cognitive Ability, Cognitive Tests, Correlation, Essay Tests
Peer reviewedRocklin, Thomas – Applied Measurement in Education, 1992
College students rated dissimilarity of pairs of common test item formats. A multidimensional scaling model with individual differences fit to data from 111 students suggested that they used 2 dimensions to distinguish among the formats, 1 separating supply from selection items and 1 based on the number of options. (SLD)
Descriptors: Academic Ability, Academic Achievement, College Students, Higher Education
Reckase, Mark D.; And Others – 1989
The purpose of the paper is to determine whether test forms of the Mathematics Usage Test (AAP Math) of the American College Testing Program are parallel in a multidimensional sense. The AAP Math is an achievement test of mathematics concepts acquired by high school students by the end of their third year. To determine the dimensionality of the…
Descriptors: Achievement Tests, Factor Analysis, High School Students, High Schools
Yao, Lihua; Schwarz, Richard D. – Applied Psychological Measurement, 2006
Multidimensional item response theory (IRT) models have been proposed for better understanding the dimensional structure of data or to define diagnostic profiles of student learning. A compensatory multidimensional two-parameter partial credit model (M-2PPC) for constructed-response items is presented that is a generalization of those proposed to…
Descriptors: Models, Item Response Theory, Markov Processes, Monte Carlo Methods
Sireci, Stephen G.; Fitzgerald, Cyndy; Xing, Dehui – 1998
Adapting credentialing examinations for international uses involves translating tests for use in multiple languages. This paper explores methods for evaluating construct equivalence and item equivalence across different language versions of a test. These methods were applied to four different language versions (English, French, German, and…
Descriptors: Credentials, Engineers, Factor Analysis, Foreign Countries
Sykes, Robert C.; And Others – 1992
The sources of multidimensionality found in several different forms of a licensure examination were studied. The relationship between one source of multidimensionality, differential item functioning (DIF) (or factors producing DIF), and content characteristics was explored in an attempt to isolate aspects of training or curriculum that could…
Descriptors: Factor Analysis, Factor Structure, Health Personnel, Higher Education

Direct link
