Please wait a minute...
Frontiers of Computer Science

ISSN 2095-2228

ISSN 2095-2236(Online)

CN 10-1014/TP

邮发代号 80-970

2019 Impact Factor: 1.275

Frontiers of Computer Science in China  2011, Vol. 5 Issue (3): 268-278   https://doi.org/10.1007/s11704-011-0023-0
  RESEARCH ARTICLE 本期目录
An improved spectral clustering algorithm based on random walk
An improved spectral clustering algorithm based on random walk
Xianchao ZHANG(), Quanzeng YOU
School of Software, Dalian University of Technology, Dalian 116623, China
 全文: PDF(925 KB)   HTML
Abstract

The construction process for a similarity matrix has an important impact on the performance of spectral clustering algorithms. In this paper, we propose a random walk based approach to process the Gaussian kernel similarity matrix. In this method, the pair-wise similarity between two data points is not only related to the two points, but also related to their neighbors. As a result, the new similarity matrix is closer to the ideal matrix which can provide the best clustering result. We give a theoretical analysis of the similarity matrix and apply this similarity matrix to spectral clustering. We also propose a method to handle noisy items which may cause deterioration of clustering performance. Experimental results on real-world data sets show that the proposed spectral clustering algorithm significantly outperforms existing algorithms.

Key wordsspectral clustering    random walk    probability transition matrix    matrix perturbation
收稿日期: 2010-03-23      出版日期: 2011-09-05
Corresponding Author(s): ZHANG Xianchao,Email:xczhang@dlut.edu.cn   
 引用本文:   
. An improved spectral clustering algorithm based on random walk[J]. Frontiers of Computer Science in China, 2011, 5(3): 268-278.
Xianchao ZHANG, Quanzeng YOU. An improved spectral clustering algorithm based on random walk. Front Comput Sci Chin, 2011, 5(3): 268-278.
 链接本文:  
https://academic.hep.com.cn/fcs/CN/10.1007/s11704-011-0023-0
https://academic.hep.com.cn/fcs/CN/Y2011/V5/I3/268
1 Ng A Y, Jordan M I, Weiss Y. On spectral clustering: analysis and an algorithm. In: Proceedings of Advances in Neural Information Pressing Systems 14 . 2001, 849–856
2 Wang F, Zhang C S, Shen H C, Wang J D. Semi-supervised classification using linear neighborhood propagation. In: Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. . 2006, 160–167
3 Wang F, Zhang C S. Robust self-tuning semi-supervised learning. Neurocomputing , 2006, 70(16-18): 2931–2939
doi: 10.1016/j.neucom.2006.11.004
4 Kamvar S D, Klein D, Manning C D. Spectral learning. In: Proceedings of the 18th International Joint Conference on Artificial Intelligence . 2003, 561–566
5 Lu Z D, Carreira-Perpiňán M A. Constrained spectral clustering through affinity propagation. In: Proceedings of 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition . 2008, 1–8
6 Meila M, Shi J. A random walks view of spectral segmentation. In: Proceedings of 8th International Workshop on Artificial Intelligence and Statistics . 2001
7 Azran A, Ghahramani Z. Spectral methods for automatic multiscale data clustering. In: Proceedings of 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition . 2006, 190–197
8 Meila M. The multicut lemma. UW Statistics Technical Report 417 , 2001
9 Shi J, Malik J. Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence , 2000, 22(8): 888–905
doi: 10.1109/34.868688
10 Hagen L, Kahng A B. New spectral methods for ratio cut partitioning and clustering. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems , 1992, 11(9): 1074–1085
doi: 10.1109/43.159993
11 Ding C H Q, He X F, Zha H Y, Gu M, Simon H D. A min-max cut algorithm for graph partitioning and data clustering. In: Proceedings of 1st IEEE International Conference on Data Mining , 2001, 107–114
doi: 10.1109/ICDM.2001.989507
12 von Luxburg U. A tutorial on spectral clustering. Statistics and Computing , 2007, 17(4): 395–416
doi: 10.1007/s11222-007-9033-z
13 Zelnik-Manor L, Perona P. Self-tuning spectral clustering. In. Proceedings of Advances in Neural Information Processing Systems 17 . 2004, 1601–1608
14 Huang T, Yang C. Matrix Analysis with Applications. Beijing: Scientific Publishing House, 2007 (in Chinese)
15 Lovász L, Lov L, Erdos O. Random walks on graphs: a survey. Combinatorics , 1993, 2: 353–398
16 Gong C H. Matrix Theory and Applications. Beijing: Scientific Publishing House, 2007 (in Chinese)
17 Tian Z, Li X B, Ju Y W. Spectral clustering based on matrix perturbation theory. Science in China Series F: Information Sciences , 2007, 50(1): 63–81
doi: 10.1007/s11432-007-0007-8
18 Fouss F, Pirotte A, Renders J, Saerens M. Random-walk computation of similarities between nodes of a graph with application to collaborative recommendation. IEEE Transactions on Knowledge and Data Engineering , 2007, 19(3): 355–369
doi: 10.1109/TKDE.2007.46
19 Banerjee A, Dhillon I, Ghosh J, Sra S. Generative model-based clustering of directional data. In: Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining . 2003, 19–28
doi: 10.1145/956750.956757
20 Wang L, Leckie C. Ramamohanarao K, Bezdek J C. Approximate spectral clustering. In: Proceedings of 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining . 2009, 134–146
21 Fowlkes C, Belongie S, Chung F, Malik J. Spectral grouping using the Nystr?m method. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2004, 26(2): 214–225
doi: 10.1109/TPAMI.2004.1262185
22 Puzicha J, Belongie S. Model-based halftoning for color image segmentation. In: Proceedings of 15th International Conference on Pattern Recognition . 2000, 629–632
doi: 10.1109/ICPR.2000.903624
23 Puzicha J, Held M, Ketterer J, Buhmann J M, Fellner D W. On spatial quantization of color images. IEEE Transactions on Image Processing , 2000, 9(4): 666–682
doi: 10.1109/83.841942
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed