Please wait a minute...
Frontiers of Computer Science

ISSN 2095-2228

ISSN 2095-2236(Online)

CN 10-1014/TP

Postal Subscription Code 80-970

2018 Impact Factor: 1.129

Front Comput Sci Chin    2011, Vol. 5 Issue (3) : 268-278    https://doi.org/10.1007/s11704-011-0023-0
RESEARCH ARTICLE
An improved spectral clustering algorithm based on random walk
Xianchao ZHANG(), Quanzeng YOU
School of Software, Dalian University of Technology, Dalian 116623, China
 Download: PDF(925 KB)   HTML
 Export: BibTeX | EndNote | Reference Manager | ProCite | RefWorks
Abstract

The construction process for a similarity matrix has an important impact on the performance of spectral clustering algorithms. In this paper, we propose a random walk based approach to process the Gaussian kernel similarity matrix. In this method, the pair-wise similarity between two data points is not only related to the two points, but also related to their neighbors. As a result, the new similarity matrix is closer to the ideal matrix which can provide the best clustering result. We give a theoretical analysis of the similarity matrix and apply this similarity matrix to spectral clustering. We also propose a method to handle noisy items which may cause deterioration of clustering performance. Experimental results on real-world data sets show that the proposed spectral clustering algorithm significantly outperforms existing algorithms.

Keywords spectral clustering      random walk      probability transition matrix      matrix perturbation     
Corresponding Author(s): ZHANG Xianchao,Email:xczhang@dlut.edu.cn   
Issue Date: 05 September 2011
 Cite this article:   
Xianchao ZHANG,Quanzeng YOU. An improved spectral clustering algorithm based on random walk[J]. Front Comput Sci Chin, 2011, 5(3): 268-278.
 URL:  
https://academic.hep.com.cn/fcs/EN/10.1007/s11704-011-0023-0
https://academic.hep.com.cn/fcs/EN/Y2011/V5/I3/268
1 Ng A Y, Jordan M I, Weiss Y. On spectral clustering: analysis and an algorithm. In: Proceedings of Advances in Neural Information Pressing Systems 14 . 2001, 849–856
2 Wang F, Zhang C S, Shen H C, Wang J D. Semi-supervised classification using linear neighborhood propagation. In: Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. . 2006, 160–167
3 Wang F, Zhang C S. Robust self-tuning semi-supervised learning. Neurocomputing , 2006, 70(16-18): 2931–2939
doi: 10.1016/j.neucom.2006.11.004
4 Kamvar S D, Klein D, Manning C D. Spectral learning. In: Proceedings of the 18th International Joint Conference on Artificial Intelligence . 2003, 561–566
5 Lu Z D, Carreira-Perpiňán M A. Constrained spectral clustering through affinity propagation. In: Proceedings of 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition . 2008, 1–8
6 Meila M, Shi J. A random walks view of spectral segmentation. In: Proceedings of 8th International Workshop on Artificial Intelligence and Statistics . 2001
7 Azran A, Ghahramani Z. Spectral methods for automatic multiscale data clustering. In: Proceedings of 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition . 2006, 190–197
8 Meila M. The multicut lemma. UW Statistics Technical Report 417 , 2001
9 Shi J, Malik J. Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence , 2000, 22(8): 888–905
doi: 10.1109/34.868688
10 Hagen L, Kahng A B. New spectral methods for ratio cut partitioning and clustering. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems , 1992, 11(9): 1074–1085
doi: 10.1109/43.159993
11 Ding C H Q, He X F, Zha H Y, Gu M, Simon H D. A min-max cut algorithm for graph partitioning and data clustering. In: Proceedings of 1st IEEE International Conference on Data Mining , 2001, 107–114
doi: 10.1109/ICDM.2001.989507
12 von Luxburg U. A tutorial on spectral clustering. Statistics and Computing , 2007, 17(4): 395–416
doi: 10.1007/s11222-007-9033-z
13 Zelnik-Manor L, Perona P. Self-tuning spectral clustering. In. Proceedings of Advances in Neural Information Processing Systems 17 . 2004, 1601–1608
14 Huang T, Yang C. Matrix Analysis with Applications. Beijing: Scientific Publishing House, 2007 (in Chinese)
15 Lovász L, Lov L, Erdos O. Random walks on graphs: a survey. Combinatorics , 1993, 2: 353–398
16 Gong C H. Matrix Theory and Applications. Beijing: Scientific Publishing House, 2007 (in Chinese)
17 Tian Z, Li X B, Ju Y W. Spectral clustering based on matrix perturbation theory. Science in China Series F: Information Sciences , 2007, 50(1): 63–81
doi: 10.1007/s11432-007-0007-8
18 Fouss F, Pirotte A, Renders J, Saerens M. Random-walk computation of similarities between nodes of a graph with application to collaborative recommendation. IEEE Transactions on Knowledge and Data Engineering , 2007, 19(3): 355–369
doi: 10.1109/TKDE.2007.46
19 Banerjee A, Dhillon I, Ghosh J, Sra S. Generative model-based clustering of directional data. In: Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining . 2003, 19–28
doi: 10.1145/956750.956757
20 Wang L, Leckie C. Ramamohanarao K, Bezdek J C. Approximate spectral clustering. In: Proceedings of 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining . 2009, 134–146
21 Fowlkes C, Belongie S, Chung F, Malik J. Spectral grouping using the Nystr?m method. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2004, 26(2): 214–225
doi: 10.1109/TPAMI.2004.1262185
22 Puzicha J, Belongie S. Model-based halftoning for color image segmentation. In: Proceedings of 15th International Conference on Pattern Recognition . 2000, 629–632
doi: 10.1109/ICPR.2000.903624
23 Puzicha J, Held M, Ketterer J, Buhmann J M, Fellner D W. On spatial quantization of color images. IEEE Transactions on Image Processing , 2000, 9(4): 666–682
doi: 10.1109/83.841942
[1] Chengyu WANG, Guomin ZHOU, Xiaofeng HE, Aoying ZHOU. NERank+: a graph-based approach for entity ranking in document collections[J]. Front. Comput. Sci., 2018, 12(3): 504-517.
[2] Heng CHEN, Hai JIN, Feng ZHAO. PSG: a two-layer graph model for document summarization[J]. Front. Comput. Sci., 2014, 8(1): 119-130.
[3] Jianhua JIA, Bingxiang LIU, Licheng JIAO. Soft spectral clustering ensemble applied to image segmentation[J]. Front Comput Sci Chin, 2011, 5(1): 66-78.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed