Parallel WaveCluster: A linear scaling parallel clustering algorithm implementation with application to very large datasets
No Thumbnail Available
Date
2011
Journal Title
Journal ISSN
Volume Title
Publisher
Academic Press inc Elsevier Science
Open Access Color
OpenAIRE Downloads
OpenAIRE Views
Abstract
A linear scaling parallel clustering algorithm implementation and its application to very large datasets for cluster analysis is reported. WaveCluster is a novel clustering approach based on wavelet transforms. Despite this approach has an ability to detect clusters of arbitrary shapes in an efficient way, it requires considerable amount of time to collect results for large sizes of multi-dimensional datasets. We propose the parallel implementation of the WaveCluster algorithm based on the message passing model for a distributed-memory multiprocessor system. In the proposed method, communication among processors and memory requirements are kept at minimum to achieve high efficiency. We have conducted the experiments on a dense dataset and a sparse dataset to measure the algorithm behavior appropriately. Our results obtained from performed experiments demonstrate that developed parallel WaveCluster algorithm exposes high speedup and scales linearly with the increasing number of processors. (C) 2011 Elsevier Inc. All rights reserved.
Description
Yildirim, Ahmet Artu/0000-0001-6555-765X; Ozdogan, Cem/0000-0002-9644-0013
Keywords
Cluster Analysis, Wavecluster Algorithm, Parallel Wavecluster
Turkish CoHE Thesis Center URL
Fields of Science
Citation
Yıldırım, A.A., Özdoğan, C. (2011). Parallel WaveCluster: A linear scaling parallel clustering algorithm implementation with application to very large datasets. Journal of Parallel and Distributed Computing, 71(7), 955-962. http://dx.doi.org/10.1016/j.jpdc.2011.03.007
WoS Q
Q2
Scopus Q
Q1
Source
Volume
71
Issue
7
Start Page
955
End Page
962