Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 2 |
| Since 2007 (last 20 years) | 4 |
Descriptor
| Evaluation Methods | 7 |
| Multidimensional Scaling | 7 |
| Test Construction | 7 |
| Psychometrics | 3 |
| Item Response Theory | 2 |
| Scores | 2 |
| Scoring Rubrics | 2 |
| Test Items | 2 |
| Test Reliability | 2 |
| Test Validity | 2 |
| Accuracy | 1 |
| More ▼ | |
Source
| Educational Assessment | 2 |
| Applied Psychological… | 1 |
| Assessment in Education:… | 1 |
| Journal of Educational… | 1 |
| Psychological Assessment | 1 |
Author
| Amery D. Wu | 1 |
| Darkes, Jack | 1 |
| Ferrini-Mundy, Joan | 1 |
| Floden, Robert E. | 1 |
| Galaczi, Evelina D. | 1 |
| Geisinger, Kurt F. | 1 |
| Goldman, Mark S. | 1 |
| Green, Anthony | 1 |
| Hubbard, Chris | 1 |
| Jake Stone | 1 |
| Lee, Minji K. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 6 |
| Reports - Research | 4 |
| Reports - Descriptive | 2 |
| Speeches/Meeting Papers | 2 |
| Reports - Evaluative | 1 |
Education Level
| Higher Education | 2 |
| Elementary Secondary Education | 1 |
| Postsecondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017
This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…
Descriptors: Scores, Test Construction, Test Reliability, Test Validity
Reckase, Mark D.; McCrory, Raven; Floden, Robert E.; Ferrini-Mundy, Joan; Senk, Sharon L. – Educational Assessment, 2015
Numerous researchers have suggested that there are multiple mathematical knowledge and skill areas needed by teachers in order for them to be effective teachers of mathematics: knowledge of the mathematics that are the goals of instruction, advanced mathematics beyond the instructional material, and mathematical knowledge that is specific to what…
Descriptors: Algebra, Knowledge Base for Teaching, Multidimensional Scaling, Psychometrics
Galaczi, Evelina D.; ffrench, Angela; Hubbard, Chris; Green, Anthony – Assessment in Education: Principles, Policy & Practice, 2011
The process of constructing assessment scales for performance testing is complex and multi-dimensional. As a result, a number of different approaches, both empirically and intuitively based, are open to developers. In this paper we outline the approach taken in the revision of a set of assessment scales used with speaking tests, and present the…
Descriptors: Speech Communication, Research Methodology, Foreign Countries, Statistical Analysis
Goldman, Mark S.; Darkes, Jack – Psychological Assessment, 2004
Despite several decades of activity, alcohol expectancy research has yet to merge measurement approaches with developing memory theory. This article offers an expectancy assessment approach built on a conceptualization of expectancy as an information processing network. The authors began with multidimensional scaling models of expectancy space,…
Descriptors: Memory, Information Processing, Multidimensional Scaling, Expectation
Peer reviewedSireci, Stephen G.; Geisinger, Kurt F. – Applied Psychological Measurement, 1995
An expanded version of the method of content evaluation proposed by S. G. Sireci and K. F. Giesinger (1992) was evaluated with respect to a national licensure examination and a nationally standardized social studies achievement test. Two groups of 15 subject-matter experts rated the similarity and content relevance of the items. (SLD)
Descriptors: Achievement Tests, Cluster Analysis, Construct Validity, Content Validity
Stacks, Don W.; And Others – 1983
A study provided the initial test of a multidimensional instrument based on the idea that syntactic language choice might predict writing apprehension. The test measured six factors: (1) blank page paralysis, (2) general affect toward writing, (3) positive/negative business affect, (4) alternative modes, (5) attitude toward writing competence, and…
Descriptors: Business Communication, College Students, Evaluation Methods, Higher Education

Direct link
