ERIC Number: EJ789904
Record Type: Journal
Publication Date: 2008-Mar
Pages: 17
Abstractor: Author
ISBN: N/A
ISSN: ISSN-0033-3123
EISSN: N/A
Available Date: N/A
Optimal Partitioning of a Data Set Based on the "p"-Median Model
Brusco, Michael J.; Kohn, Hans-Friedrich
Psychometrika, v73 n1 p89-105 Mar 2008
Although the "K"-means algorithm for minimizing the within-cluster sums of squared deviations from cluster centroids is perhaps the most common method for applied cluster analyses, a variety of other criteria are available. The "p"-median model is an especially well-studied clustering problem that requires the selection of "p" objects to serve as cluster centers. The objective is to choose the cluster centers such that the sum of the Euclidean distances (or some other dissimilarity measure) of objects assigned to each center is minimized. Using 12 data sets from the literature, we demonstrate that a three-stage procedure consisting of a greedy heuristic, Lagrangian relaxation, and a branch-and-bound algorithm can produce globally optimal solutions for "p"-median problems of nontrivial size (several hundred objects, five or more variables, and up to 10 clusters). We also report the results of an application of the "p"-median model to an empirical data set from the telecommunications industry.
Descriptors: Telecommunications, Item Response Theory, Multivariate Analysis, Heuristics, Mathematical Models
Springer. 233 Spring Street, New York, NY 10013. Tel: 800-777-4643; Tel: 212-460-1500; Fax: 212-348-4505; e-mail: service-ny@springer.com; Web site: http://www.springerlink.com.bibliotheek.ehb.be
Publication Type: Journal Articles; Reports - Research
Education Level: N/A
Audience: Researchers
Language: English
Sponsor: N/A
Authoring Institution: N/A
Grant or Contract Numbers: N/A
Author Affiliations: N/A