Please wait a minute...
Frontiers of Computer Science

ISSN 2095-2228

ISSN 2095-2236(Online)

CN 10-1014/TP

Postal Subscription Code 80-970

2018 Impact Factor: 1.129

Front Comput Sci    2013, Vol. 7 Issue (2) : 204-213    https://doi.org/10.1007/s11704-013-1076-z
RESEARCH ARTICLE
A semi-supervised hierarchical approach: two-dimensional clustering of microarray gene expression data
R PRISCILLA(), S SWAMYNATHAN
Department of Information and Science and Technology, The College of Engineering, Guindy Campus, Anna University, Chennai 600025, India
 Download: PDF(957 KB)   HTML
 Export: BibTeX | EndNote | Reference Manager | ProCite | RefWorks
Abstract

Micro array technologies have become a widespread research technique for biomedical researchers to assess tens of thousands of gene expression values simultaneously in a single experiment. Micro array data analysis for biological discovery requires computational tools. In this research a novel two-dimensional hierarchical clustering is presented. From the review, it is evident that the previous research works have used clustering which have been applied in gene expression data to create only one cluster for a gene that leads to biological complexity. This is mainly because of the nature of proteins and their interactions. Since proteins normally interact with different groups of proteins in order to serve different biological roles, the genes that produce these proteins are therefore expected to co express with more than one group of genes. This constructs that in micro array gene expression data, a gene may makes its presence in more than one cluster. In this research, multi-level micro array clustering, performed in two dimensions by the proposed two-dimensional hierarchical clustering technique can be used to represent the existence of genes in one or more clusters consistent with the nature of the gene and its attributes and prevent biological complexities.

Keywords clustering      hierarchical clustering      supervised clustering      overlapping clustering     
Corresponding Author(s): PRISCILLA R,Email:prisci_durai@yahoo.co.in   
Issue Date: 01 April 2013
 Cite this article:   
R PRISCILLA,S SWAMYNATHAN. A semi-supervised hierarchical approach: two-dimensional clustering of microarray gene expression data[J]. Front Comput Sci, 2013, 7(2): 204-213.
 URL:  
https://academic.hep.com.cn/fcs/EN/10.1007/s11704-013-1076-z
https://academic.hep.com.cn/fcs/EN/Y2013/V7/I2/204
1 Liang J, Kachalo S. Computational analysis of microarray gene expression profiles: clustering, classification, and beyond. Chemometrics and Intelligent Laboratory Systems , 2002, 62(2): 199-216
doi: 10.1016/S0169-7439(02)00014-X
2 Khlopova N S, Glazko V I, Glazko T T. Differentiation of gene expression profiles data for liver and kidney of pigs. World Academy of Science, Engineering and Technology , 2009, 31: 263-266
3 Sarmah C, Samarasinghe S, Kulasiri D, Catchpoole D. A simple affymetrix ratio-transformation method yields comparable expression level quantifications with cdna data. International Journal of Biological and Life Sciences , 2012, 8(3): 157-162
4 Gruzdz A, Ihnatowicz A, Siddiqi J, Akhgar B. Mining genes relations in microarray data combined with ontology in colon cancer automated diagnosis system. World Academy of Science, Engineering and Technology , 2008(16): 920-28
5 Cvek U, Trutschl M, Randolph Stone I, Syed Z, Clifford J, Sabichi A. Multidimensional visualization tools for analysis of expression data. World Academy of Science, Engineering and Technology , 2009(30): 281-89
6 Kim S, Choi T, Bae J. Fuzzy types clustering for microarray data. International Journal of Computational Intelligence , 2006, 2(1): 12-15
7 Wu X, Chen Y, Brooks B, Su Y. The local maximum clustering method and its application in microarray gene expression data analysis. EURASIP Journal on Advances in Signal Processing , 1900, 2004(1): 53-63
doi: 10.1155/S1110865704309145
8 Kim S, Lee J, Bae J. Iterative clustering algorithm for analyzing temporal patterns of gene expression. World Academy of Science, Engineering and Technology , 2007(4): 502-505
9 Chen G, Jaradat S, Banerjee N, Tanaka T, Ko M, Zhang M. Evaluation and comparison of clustering algorithms in analyzing ES cell gene expression data. Statistica Sinica , 2002, 12(1): 241-262
10 Qin Z. Clustering microarray gene expression data using weighted chinese restaurant process. Bioinformatics , 2006, 22(16): 1988-1997
doi: 10.1093/bioinformatics/btl284
11 Lee M, Kim Y, Kim Y, Lee Y, Yoon H. An ant-based clustering system for knowledge discovery in DNA chip analysis data. In: Proceedings of World Academy of Science, Engineering and Technology . 2007, (5): 261-266
12 Wang R, Scharenbroich L, Hart C, Wold B, Mjolsness E. Clustering analysis of microarray gene expression data by splitting algorithm. Journal of Parallel and Distributed Computing , 2003, 63(7): 692-706
doi: 10.1016/S0743-7315(03)00085-6
13 Kalocsai P, Shams S. Visualization and analysis of gene expression data. Journal of the Association for Laboratory Automation , 1999, 4(5): 58-61
doi: 10.1016/S1535-5535(04)00033-4
14 Van Der Laan M, Pollard K. A new algorithm for hybrid clustering of gene expression data with visualization and the bootstrap. Journal of Statistical Planning and Inference , 2003, 117: 275-303
doi: 10.1016/S0378-3758(02)00388-9
15 Do J, Choi D, others. Clustering approaches to identifying gene expression patterns from DNA microarray data. Molecules and Cells , 2008, 25(2): 279
16 Trepalin S, Yarkov A. Hierarchical clustering of large databases and classification of antibiotics at high noise levels. Algorithms , 2008, 1(2): 183-200
doi: 10.3390/a1020183
17 Tuncbag N, Haliloglu T, Keskin O. Correspondence between function and interaction in protein interaction network of saccaromyces cerevisiae. International Journal of Biological and Medical Sciences , 2006, 1(3): 167-174
18 Kim S, Hamasaki T. Evaluation of clustering based on preprocessing in gene expression data. International Journal of Biological, Biomedical and Medical Sciences , 2008, 3(1): 48-53
19 Valarmathie P, Srinath M, Ravichandran T, Dinakaran K. Hybrid fuzzy C-means clustering technique for gene expression data. International Journal of Research and Reviews in Applied Sciences , 2009, 1(1): 33-37
20 Dey L, Mukhopadhyay A. Microarray gene expression data clustering using PSO based K-means algorithm. UACEE International Journal of Computer Science and its Applications , 2009, 1(1): 232-236
21 Mar J, Wells C, Quackenbush J. Defining an informativeness metric for clustering gene expression data. Bioinformatics , 2011, 27(8): 1094-1100
doi: 10.1093/bioinformatics/btr074
22 Jing L, Ng M, Zeng T. Novel hybrid method for gene selection and cancer prediction. World Academy of Science, Engineering and Technology , 2010 (38): 482-489
23 ALL/AML datasets. http://www.broadinstitute.org/cancer/software/ genepattern/datasets/
24 Larsen B, Aone C. Fast and effective text mining using linear-time document clustering. In: Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining . 1999, 16-22
25 Steinbach M, Karypis G, Kumar V, others. A comparison of document clustering techniques. In: KDD Workshop on Text Mining . 2000, 525-526
26 Yin X, Chen S, Hu E, Zhang D. Semi-supervised clustering with metric learning: an adaptive kernel method. Pattern Recognition , 2010, 43(4): 1320-1333
doi: 10.1016/j.patcog.2009.11.005
27 Alfred R. Summarizing relational data using semi-supervised genetic algorithm-based clustering techniques. Journal of Computer Science , 2010, 6(7): 775-784
doi: 10.3844/jcssp.2010.775.784
[1] Panthadeep BHATTACHARJEE, Pinaki MITRA. A survey of density based clustering algorithms[J]. Front. Comput. Sci., 2021, 15(1): 151308-.
[2] Zhihan JIANG, Yan LIU, Xiaoliang FAN, Cheng WANG, Jonathan LI, Longbiao CHEN. Understanding urban structures and crowd dynamics leveraging large-scale vehicle mobility data[J]. Front. Comput. Sci., 2020, 14(5): 145310-.
[3] Xibin DONG, Zhiwen YU, Wenming CAO, Yifan SHI, Qianli MA. A survey on ensemble learning[J]. Front. Comput. Sci., 2020, 14(2): 241-258.
[4] Hui XUE, Sen LI, Xiaohong CHEN, Yunyun WANG. A maximum margin clustering algorithm based on indefinite kernels[J]. Front. Comput. Sci., 2019, 13(4): 813-827.
[5] Ratha PECH, Dong HAO, Hong CHENG, Tao ZHOU. Enhancing subspace clustering based on dynamic prediction[J]. Front. Comput. Sci., 2019, 13(4): 802-812.
[6] Gaoqi HE, Qi CHEN, Dongxu JIANG, Yubo YUAN, Xingjian LU. Physical-barrier detection based collective motion analysis[J]. Front. Comput. Sci., 2019, 13(2): 426-436.
[7] Alampallam Ramaswamy VASUDEVAN,Subramanian SELVAKUMAR. Local outlier factor and stronger one class classifier based hierarchical model for detection of attacks in network intrusion detection dataset[J]. Front. Comput. Sci., 2016, 10(4): 755-766.
[8] Chenggang Clarence YAN,Hongtao XIE,Bing ZHANG,Yanping MA,Qiong DAI,Yizhi LIU. Fast approximate matching of binary codes with distinctive bits[J]. Front. Comput. Sci., 2015, 9(5): 741-750.
[9] Zhikun CHEN,Shuqiang YANG,Shuang TAN,Li HE,Hong YIN,Ge ZHANG. A new fragment re-allocation strategy for NoSQL database systems[J]. Front. Comput. Sci., 2015, 9(1): 111-127.
[10] Lu LIU,Tao PENG. Clustering-based topical Web crawling using CFu-tree guided by link-context[J]. Front. Comput. Sci., 2014, 8(4): 581-595.
[11] Qingliang MIAO,Qiudan LI,Daniel ZENG,Yao MENG,Shu ZHANG,Hao YU. Entity attribute discovery and clustering from online reviews[J]. Front. Comput. Sci., 2014, 8(2): 279-288.
[12] Yaobin HE, Haoyu TAN, Wuman LUO, Shengzhong FENG, Jianping FAN. MR-DBSCAN: a scalable MapReduce-based DBSCAN algorithm for heavily skewed data[J]. Front. Comput. Sci., 2014, 8(1): 83-99.
[13] Ran LIU, Wenjian LUO, Lihua YUE. Classifying and clustering in negative databases[J]. Front Comput Sci, 2013, 7(6): 864-874.
[14] Yuwei WANG, Yuanchun ZHOU, Ying LIU, Ze LUO, Danhuai GUO, Jing SHAO, Fei TAN, Liang WU, Jianhui LI, Baoping YAN. A grid-based clustering algorithm for wild bird distribution[J]. Front Comput Sci, 2013, 7(4): 475-485.
[15] Lai KANG, Lingda WU, Yee-Hong YANG. A novel unsupervised approach for multilevel image clustering from unordered image collection[J]. Front Comput Sci, 2013, 7(1): 69-82.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed