ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	4

Descriptor

Comparative Testing	15
Multidimensional Scaling	15
Item Response Theory	7
Item Analysis	4
Factor Analysis	3
Factor Structure	3
Foreign Countries	3
Psychometrics	3
Rating Scales	3
Research Methodology	3
Test Reliability	3
Test Validity	3
Undergraduate Students	3
Achievement Gains	2
Construct Validity	2
Correlation	2
Elementary School Students	2
Evaluation Methods	2
Goodness of Fit	2
Higher Education	2
Methods Research	2
Multitrait Multimethod…	2
Predictor Variables	2
Scoring Rubrics	2
Self Concept Measures	2
More ▼

Source

Educational and Psychological…	3
Journal of Educational…	2
Annenberg Institute for…	1
Asia Pacific Education Review	1
Journal of Experimental…	1
Measurement and Evaluation in…	1
National Center for Research…	1
Online Submission	1

Publication Type

Reports - Research	11
Journal Articles	8
Reports - Evaluative	4
Speeches/Meeting Papers	3

Education Level

Elementary Education	2
Elementary Secondary Education	2
Higher Education	2
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

Australia	3
Canada	2
California	1
China	1
Hong Kong	1
Indonesia	1
Netherlands	1
New York	1
Oman	1
Tennessee	1

Laws, Policies, & Programs

Assessments and Surveys

Self Description Questionnaire	2
Computer Anxiety Scale	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Estimating Student Growth on Psychological and Social-Emotional Constructs: A Comparison of Multiple Scoring Approaches. EdWorkingPaper No. 20-193

Download full text

Megan Kuhfeld; James Soland – Annenberg Institute for School Reform at Brown University, 2020

A huge portion of what we know about how humans develop, learn, behave, and interact is based on survey data. Researchers use longitudinal growth modeling to understand the development of students on psychological and social-emotional learning constructs across elementary and middle school. In these designs, students are typically administered a…

Descriptors: Elementary School Students, Middle School Students, Social Emotional Learning, Measurement Techniques

Teachers' Engagement at Work: An International Validation Study

Peer reviewed

Direct link

Klassen, Robert M.; Aldhafri, Said; Mansfield, Caroline F.; Purwanto, Edy; Siu, Angela F. Y.; Wong, Marina W.; Woods-McConney, Amanda – Journal of Experimental Education, 2012

This study explored the validity of the Utrecht Work Engagement Scale in a sample of 853 practicing teachers from Australia, Canada, China (Hong Kong), Indonesia, and Oman. The authors used multigroup confirmatory factor analysis to test the factor structure and measurement invariance across settings, after which they examined the relationships…

Descriptors: Job Satisfaction, Factor Structure, Measures (Individuals), Factor Analysis

Setting the Response Time Threshold Parameter to Differentiate Solution Behavior from Rapid-Guessing Behavior

Peer reviewed

Direct link

Kong, Xiaojing J.; Wise, Steven L.; Bhola, Dennison S. – Educational and Psychological Measurement, 2007

This study compared four methods for setting item response time thresholds to differentiate rapid-guessing behavior from solution behavior. Thresholds were either (a) common for all test items, (b) based on item surface features such as the amount of reading required, (c) based on visually inspecting response time frequency distributions, or (d)…

Descriptors: Test Items, Reaction Time, Timed Tests, Item Response Theory

Comparison of Three Growth Modeling Techniques in the Multilevel Analysis of Longitudinal Academic Achievement Scores: Latent Growth Modeling, Hierarchical Linear Modeling, and Longitudinal Profile Analysis via Multidimensional Scaling

Peer reviewed
PDF on ERIC

Download full text

Direct link

Shin, Tacksoo – Asia Pacific Education Review, 2007

This study introduces three growth modeling techniques: latent growth modeling (LGM), hierarchical linear modeling (HLM), and longitudinal profile analysis via multidimensional scaling (LPAMS). It compares the multilevel growth parameter estimates and potential predictor effects obtained using LGM, HLM, and LPAMS. The purpose of this multilevel…

Descriptors: Multidimensional Scaling, Academic Achievement, Structural Equation Models, Causal Models

Traditional versus Rasch Scaling of Aggregate Data in the Multitrait-Multimethod Matrix.

Turner, Carol J.; Smith, Jeffrey K. – Measurement and Evaluation in Guidance, 1982

Used aggregate ratings of teacher behavior as data for a multitrait-multimethod validity analysis. Scaled ratings using Rasch latent trait scaling model and traditional scaling techniques. Compared Rasch-scaled multitrait-multimethod matrix to the traditionally scaled multitrait-multimethod matrix. Results showed Rasch scaling resulted in higher…

Descriptors: Children, Comparative Testing, Data Analysis, Elementary Education

Testing the Factor Structure Invariance of a Computer Attitude Scale over Two Grouping Conditions.

Peer reviewed

Bandalos, Deborah; Benson, Jeri – Educational and Psychological Measurement, 1990

The Computer Anxiety Scale was tested for invariance over the grouping conditions of males/females and graduate/undergraduate status. Subjects included 187 undergraduates and 188 graduates; analyses were conducted on 136 males and 236 females. Results indicate that the construct of computer anxiety appears to be multidimensional with highly…

Descriptors: Anxiety, Comparative Testing, Computers, Factor Structure

Statistical versus Substantive Dimensionality: The Effect of Distributional Differences on Dimensionality Assessment Using DIMTEST

Peer reviewed

Direct link

Walker, Cindy M.; Azen, Razia; Schmitt, Thomas – Educational and Psychological Measurement, 2006

It is believed by some that most tests are multidimensional, meaning that they measure more than one underlying construct. The primary objective of this study is to illustrate how variations in the secondary ability distribution affect the statistical detection of dimensionality and to demonstrate the difference between substantive and statistical…

Descriptors: Multidimensional Scaling, Item Response Theory, Comparative Testing, Statistical Analysis

Performance Modeling That Integrates Latent Trait and Class Theory.

Peer reviewed

Gitomer, Drew H.; Yamamoto, Kentaro – Journal of Educational Measurement, 1991

A model integrating latent trait and latent class theories in characterizing individual performance on the basis of qualitative understanding is presented. This HYBRID model is illustrated through experiments with 119 Air Force technicians taking a paper-and-pencil test and 136 Air Force technicians taking a computerized test. (SLD)

Descriptors: Comparative Testing, Computer Assisted Testing, Educational Assessment, Item Response Theory

Scoring Subscales Using Multidimensional Item Response Theory Models

Download full text

DeMars, Christine E. – Online Submission, 2005

Several methods for estimating item response theory scores for multiple subtests were compared. These methods included two multidimensional item response theory models: a bi-factor model where each subtest was a composite score based on the primary trait measured by the set of tests and a secondary trait measured by the individual subtest, and a…

Descriptors: Item Response Theory, Multidimensional Scaling, Correlation, Scoring Rubrics

Modeling Randomness in Judging Rating Scales with a Random-Effects Rating Scale Model

Peer reviewed

Direct link

Wang, Wen-Chung; Wilson, Mark; Shih, Ching-Lin – Journal of Educational Measurement, 2006

This study presents the random-effects rating scale model (RE-RSM) which takes into account randomness in the thresholds over persons by treating them as random-effects and adding a random variable for each threshold in the rating scale model (RSM) (Andrich, 1978). The RE-RSM turns out to be a special case of the multidimensional random…

Descriptors: Item Analysis, Rating Scales, Item Response Theory, Monte Carlo Methods

Metric-Free Measures of Test Score Trends and Gaps with Policy-Relevant Examples. CSE Report 665

Download full text

Ho, Andrew D.; Haertel, Edward H. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2006

Problems of scale typically arise when comparing test score trends, gaps, and gap trends across different tests. To overcome some of these difficulties, we can express the difference between the observed test performance of two groups with graphs or statistics that are metric-free (i.e., invariant under positive monotonic transformations of the…

Descriptors: Testing Programs, Test Results, Comparative Testing, Multidimensional Scaling

The Self Concept as a Learner Inventory: A Cross-Validation Study.

Download full text

Barisa, Mark; And Others – 1991

The validity of the Self Concept as a Learner Revised (SCALR) inventory was studied. The construct issue of academic self-concept was explored by comparing the SCALR to the academic portion of the Multidimensional Self Concept Scale (MSCS). The SCALR contains 44 items (four scales of 11 items each). The SCALR version for grades 7 through 12 was…

Descriptors: Cognitive Style, Comparative Testing, Construct Validity, Correlation

Self-Concepts of Young Children Aged 5 to 8: Their Measurement and Multidimensional Structure.

Download full text

Marsh, Herbert W.; And Others – 1990

The purposes of the present investigation are to evaluate a new, adaptive procedure for assessing multiple dimensions of self-concept for children younger than 8 years and to examine related theoretical issues. The multidimensional, hierarchical structure of self-concept is well established for older children, but there is a paucity of research…

Descriptors: Child Development, Comparative Testing, Factor Structure, Foreign Countries

Do We See Ourselves as Others Infer: A Comparison of Self-Other Agreement on Multiple Dimensions of Self-Concept from Two Continents.

Download full text

Marsh, Herbert W.; Byrne, Barbara M. – 1990

Self/other agreement between self-concept ratings by the individual and self-concepts inferred by significant others is of theoretical and practical importance, but the review by J. S. Shrauger and T. J. Schoeneman (1979) found no evidence for such agreement. In the present investigation, the Self Description Questionnaire III (SDQIII) was…

Descriptors: Comparative Testing, Cross Cultural Studies, Factor Analysis, Foreign Countries

Differences between Novice and Expert Knowledge Structure, Pre- and Post-Training, in a Statistics and Test Theory Domain.

Download full text

Steinberg, Wendy J. – 1990

The purpose of this study was to examine the nature and degree of differences in expert versus novice knowledge structures, both before and after training, when judging the similarity of multiple-choice test items within a statistics and test theory (STT) domain. Subjects were employees of the Testing Division of the New York State Department of…

Descriptors: Adults, Cognitive Structures, Comparative Testing, Government Employees

Marsh, Herbert W.	2
Aldhafri, Said	1
Azen, Razia	1
Bandalos, Deborah	1
Barisa, Mark	1
Benson, Jeri	1
Bhola, Dennison S.	1
Byrne, Barbara M.	1
DeMars, Christine E.	1
Gitomer, Drew H.	1
Haertel, Edward H.	1
Ho, Andrew D.	1
James Soland	1
Klassen, Robert M.	1
Kong, Xiaojing J.	1
Mansfield, Caroline F.	1
Megan Kuhfeld	1
Purwanto, Edy	1
Schmitt, Thomas	1
Shih, Ching-Lin	1
Shin, Tacksoo	1
Siu, Angela F. Y.	1
Smith, Jeffrey K.	1
Steinberg, Wendy J.	1
More ▼