ACM SIGMOD Anthology VLDB dblp.uni-trier.de

WaveCluster: A Wavelet Based Clustering Approach for Spatial Data in Very Large Databases.

Gholamhosein Sheikholeslami, Surojit Chatterjee, Aidong Zhang: WaveCluster: A Wavelet Based Clustering Approach for Spatial Data in Very Large Databases. VLDB J. 8(3-4): 289-304(2000)
@article{DBLP:journals/vldb/SheikholeslamiCZ00,
  author    = {Gholamhosein Sheikholeslami and
               Surojit Chatterjee and
               Aidong Zhang},
  title     = {WaveCluster: A Wavelet Based Clustering Approach for Spatial
               Data in Very Large Databases},
  journal   = {VLDB J.},
  volume    = {8},
  number    = {3-4},
  year      = {2000},
  pages     = {289-304},
  ee        = {db/journals/vldb/SheikholeslamiCZ00.html},
  bibsource = {DBLP, http://dblp.uni-trier.de}
}

Abstract

Many applications require the management of spatial data in a multidimensional feature space. Clustering large spatial databases is an important problem, which tries to find the densely populated regions in the feature space to be used in data mining, knowledge discovery, or efficient information retrieval. A good clustering approach should be efficient and detect clusters of arbitrary shape. It must be insensitive to the noise (outliers) and the order of input data. We propose WaveCluster, a novel clustering approach based on wavelet transforms, which satisfies all the above requirements. Using the multiresolution property of wavelet transforms, we can effectively identify arbitrarily shaped clusters at different degrees of detail. We also demonstrate that WaveCluster is highly efficient in terms of time complexity. Experimental results on very large datasets are presented, which show the efficiency and effectiveness of the proposed approach compared to the other recent clustering methods.

Copyright © 2000 by Springer, Berlin, Heidelberg. Permission to make digital or hard copies of the abstract is granted provided that copies are not made or distributed for profit or direct commercial advantage, and that copies show this notice along with the full citation.


Online Edition (Springer)

Citation Page

ACM SIGMOD Anthology

CDROM Version: Load the CDROM "Volume 5 Issue 2, JACM, VLDB-J, POS, ..." and ... DVD Version: Load ACM SIGMOD Anthology DVD 2" and ...

References

[AF97]
...
[AGGR98]
Rakesh Agrawal, Johannes Gehrke, Dimitrios Gunopulos, Prabhakar Raghavan: Automatic Subspace Clustering of High Dimensional Data for Data Mining Applications. SIGMOD Conference 1998: 94-105 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[BR95]
...
[COM95]
...
[EKSX96]
Martin Ester, Hans-Peter Kriegel, Jörg Sander, Xiaowei Xu: A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. KDD 1996: 226-231 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[EKSX98]
Martin Ester, Hans-Peter Kriegel, Jörg Sander, Xiaowei Xu: Clustering for Mining in Large Spatial Databases. KI 12(1): 18-24(1998) CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[Gor81]
...
[HJS94]
M. L. Hilton, Björn D. Jawerth, A. Sengupta: Compressing Still and Moving Images with Wavelets. Multimedia Syst. 2(5): 218-227(1994) CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[Hor88]
...
[JFS95]
...
[JM95]
Ramesh Jain, S. N. Murthy, Luong Tran, Shankar Chatterjee: Similarity Measures for Image Databases. Storage and Retrieval for Image and Video Databases (SPIE) 1995: 58-65 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[Knu98]
...
[KR90]
L. Kaufman, P. J. Rousseeuw: Finding Groups in Data: An Introduction to Cluster Analysis. John Wiley 1990
CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[Mal89a]
...
[Mal89b]
...
[NH94]
Raymond T. Ng, Jiawei Han: Efficient and Effective Clustering Methods for Spatial Data Mining. VLDB 1994: 144-155 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[NS80]
David Nassimi, Sartaj Sahni: Finding Connected Components and Connected Ones on a Mesh-Connected Parallel Computer. SIAM J. Comput. 9(4): 744-757(1980) CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[Ope77]
...
[OT81]
...
[PFG97]
...
[SC94]
...
[Sch92]
...
[SCZ98]
Gholamhosein Sheikholeslami, Surojit Chatterjee, Aidong Zhang: WaveCluster: A Multi-Resolution Clustering Approach for Very Large Spatial Databases. VLDB 1998: 428-439 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[SN96]
...
[SV82]
Yossi Shiloach, Uzi Vishkin: An O(log n) Parallel Connectivity Algorithm. J. Algorithms 3(1): 57-67(1982) CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[SZ97]
...
[SZB97]
...
[URB97]
...
[Vai93]
...
[WYM97]
Wei Wang, Jiong Yang, Richard R. Muntz: STING: A Statistical Information Grid Approach to Spatial Data Mining. VLDB 1997: 186-195 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[XMKS98]
Xiaowei Xu, Martin Ester, Hans-Peter Kriegel, Jörg Sander: A Distribution-Based Clustering Algorithm for Mining in Large Spatial Databases. ICDE 1998: 324-331 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[YCSZ98]
...
[ZM97]
...
[ZRL96]
Tian Zhang, Raghu Ramakrishnan, Miron Livny: BIRCH: An Efficient Data Clustering Method for Very Large Databases. SIGMOD Conference 1996: 103-114 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML

Copyright © Mon Dec 21 22:09:49 2009 by Michael Ley (ley@uni-trier.de)