Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 4 |
Descriptor
| Comparative Testing | 15 |
| Multidimensional Scaling | 15 |
| Item Response Theory | 7 |
| Item Analysis | 4 |
| Factor Analysis | 3 |
| Factor Structure | 3 |
| Foreign Countries | 3 |
| Psychometrics | 3 |
| Rating Scales | 3 |
| Research Methodology | 3 |
| Test Reliability | 3 |
| More ▼ | |
Source
Author
| Marsh, Herbert W. | 2 |
| Aldhafri, Said | 1 |
| Azen, Razia | 1 |
| Bandalos, Deborah | 1 |
| Barisa, Mark | 1 |
| Benson, Jeri | 1 |
| Bhola, Dennison S. | 1 |
| Byrne, Barbara M. | 1 |
| DeMars, Christine E. | 1 |
| Gitomer, Drew H. | 1 |
| Haertel, Edward H. | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 11 |
| Journal Articles | 8 |
| Reports - Evaluative | 4 |
| Speeches/Meeting Papers | 3 |
Education Level
| Elementary Education | 2 |
| Elementary Secondary Education | 2 |
| Higher Education | 2 |
| Junior High Schools | 1 |
| Middle Schools | 1 |
| Secondary Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
| Self Description Questionnaire | 2 |
| Computer Anxiety Scale | 1 |
What Works Clearinghouse Rating
Megan Kuhfeld; James Soland – Annenberg Institute for School Reform at Brown University, 2020
A huge portion of what we know about how humans develop, learn, behave, and interact is based on survey data. Researchers use longitudinal growth modeling to understand the development of students on psychological and social-emotional learning constructs across elementary and middle school. In these designs, students are typically administered a…
Descriptors: Elementary School Students, Middle School Students, Social Emotional Learning, Measurement Techniques
Klassen, Robert M.; Aldhafri, Said; Mansfield, Caroline F.; Purwanto, Edy; Siu, Angela F. Y.; Wong, Marina W.; Woods-McConney, Amanda – Journal of Experimental Education, 2012
This study explored the validity of the Utrecht Work Engagement Scale in a sample of 853 practicing teachers from Australia, Canada, China (Hong Kong), Indonesia, and Oman. The authors used multigroup confirmatory factor analysis to test the factor structure and measurement invariance across settings, after which they examined the relationships…
Descriptors: Job Satisfaction, Factor Structure, Measures (Individuals), Factor Analysis
Kong, Xiaojing J.; Wise, Steven L.; Bhola, Dennison S. – Educational and Psychological Measurement, 2007
This study compared four methods for setting item response time thresholds to differentiate rapid-guessing behavior from solution behavior. Thresholds were either (a) common for all test items, (b) based on item surface features such as the amount of reading required, (c) based on visually inspecting response time frequency distributions, or (d)…
Descriptors: Test Items, Reaction Time, Timed Tests, Item Response Theory
Shin, Tacksoo – Asia Pacific Education Review, 2007
This study introduces three growth modeling techniques: latent growth modeling (LGM), hierarchical linear modeling (HLM), and longitudinal profile analysis via multidimensional scaling (LPAMS). It compares the multilevel growth parameter estimates and potential predictor effects obtained using LGM, HLM, and LPAMS. The purpose of this multilevel…
Descriptors: Multidimensional Scaling, Academic Achievement, Structural Equation Models, Causal Models
Turner, Carol J.; Smith, Jeffrey K. – Measurement and Evaluation in Guidance, 1982
Used aggregate ratings of teacher behavior as data for a multitrait-multimethod validity analysis. Scaled ratings using Rasch latent trait scaling model and traditional scaling techniques. Compared Rasch-scaled multitrait-multimethod matrix to the traditionally scaled multitrait-multimethod matrix. Results showed Rasch scaling resulted in higher…
Descriptors: Children, Comparative Testing, Data Analysis, Elementary Education
Peer reviewedBandalos, Deborah; Benson, Jeri – Educational and Psychological Measurement, 1990
The Computer Anxiety Scale was tested for invariance over the grouping conditions of males/females and graduate/undergraduate status. Subjects included 187 undergraduates and 188 graduates; analyses were conducted on 136 males and 236 females. Results indicate that the construct of computer anxiety appears to be multidimensional with highly…
Descriptors: Anxiety, Comparative Testing, Computers, Factor Structure
Walker, Cindy M.; Azen, Razia; Schmitt, Thomas – Educational and Psychological Measurement, 2006
It is believed by some that most tests are multidimensional, meaning that they measure more than one underlying construct. The primary objective of this study is to illustrate how variations in the secondary ability distribution affect the statistical detection of dimensionality and to demonstrate the difference between substantive and statistical…
Descriptors: Multidimensional Scaling, Item Response Theory, Comparative Testing, Statistical Analysis
Peer reviewedGitomer, Drew H.; Yamamoto, Kentaro – Journal of Educational Measurement, 1991
A model integrating latent trait and latent class theories in characterizing individual performance on the basis of qualitative understanding is presented. This HYBRID model is illustrated through experiments with 119 Air Force technicians taking a paper-and-pencil test and 136 Air Force technicians taking a computerized test. (SLD)
Descriptors: Comparative Testing, Computer Assisted Testing, Educational Assessment, Item Response Theory
DeMars, Christine E. – Online Submission, 2005
Several methods for estimating item response theory scores for multiple subtests were compared. These methods included two multidimensional item response theory models: a bi-factor model where each subtest was a composite score based on the primary trait measured by the set of tests and a secondary trait measured by the individual subtest, and a…
Descriptors: Item Response Theory, Multidimensional Scaling, Correlation, Scoring Rubrics
Wang, Wen-Chung; Wilson, Mark; Shih, Ching-Lin – Journal of Educational Measurement, 2006
This study presents the random-effects rating scale model (RE-RSM) which takes into account randomness in the thresholds over persons by treating them as random-effects and adding a random variable for each threshold in the rating scale model (RSM) (Andrich, 1978). The RE-RSM turns out to be a special case of the multidimensional random…
Descriptors: Item Analysis, Rating Scales, Item Response Theory, Monte Carlo Methods
Ho, Andrew D.; Haertel, Edward H. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2006
Problems of scale typically arise when comparing test score trends, gaps, and gap trends across different tests. To overcome some of these difficulties, we can express the difference between the observed test performance of two groups with graphs or statistics that are metric-free (i.e., invariant under positive monotonic transformations of the…
Descriptors: Testing Programs, Test Results, Comparative Testing, Multidimensional Scaling
Barisa, Mark; And Others – 1991
The validity of the Self Concept as a Learner Revised (SCALR) inventory was studied. The construct issue of academic self-concept was explored by comparing the SCALR to the academic portion of the Multidimensional Self Concept Scale (MSCS). The SCALR contains 44 items (four scales of 11 items each). The SCALR version for grades 7 through 12 was…
Descriptors: Cognitive Style, Comparative Testing, Construct Validity, Correlation
Marsh, Herbert W.; And Others – 1990
The purposes of the present investigation are to evaluate a new, adaptive procedure for assessing multiple dimensions of self-concept for children younger than 8 years and to examine related theoretical issues. The multidimensional, hierarchical structure of self-concept is well established for older children, but there is a paucity of research…
Descriptors: Child Development, Comparative Testing, Factor Structure, Foreign Countries
Marsh, Herbert W.; Byrne, Barbara M. – 1990
Self/other agreement between self-concept ratings by the individual and self-concepts inferred by significant others is of theoretical and practical importance, but the review by J. S. Shrauger and T. J. Schoeneman (1979) found no evidence for such agreement. In the present investigation, the Self Description Questionnaire III (SDQIII) was…
Descriptors: Comparative Testing, Cross Cultural Studies, Factor Analysis, Foreign Countries
Steinberg, Wendy J. – 1990
The purpose of this study was to examine the nature and degree of differences in expert versus novice knowledge structures, both before and after training, when judging the similarity of multiple-choice test items within a statistics and test theory (STT) domain. Subjects were employees of the Testing Division of the New York State Department of…
Descriptors: Adults, Cognitive Structures, Comparative Testing, Government Employees

Direct link
