Please wait a minute...
Frontiers of Electrical and Electronic Engineering

ISSN 2095-2732

ISSN 2095-2740(Online)

CN 10-1028/TM

Front Elect Electr Eng    2012, Vol. 7 Issue (4) : 357-366    https://doi.org/10.1007/s11460-012-0207-x
RESEARCH ARTICLE
Co-regulated gene module detection for time series gene expression data
Wanwan TANG(), Rui LI, Shao LI, Yanda LI
MOE Key Laboratory of Bioinformatics, Bioinformatics Division, Tsinghua National Laboratory of Information Science and Technology / Department of Automation, Tsinghua University, Beijing 100084, China
 Download: PDF(411 KB)   HTML
 Export: BibTeX | EndNote | Reference Manager | ProCite | RefWorks
Abstract

It is important to detect interaction effect of multiple genes during certain biological process. In this paper, we proposed, from systems biology perspective, the concept of co-regulated gene module, which consists of genes that are regulated by the same regulator(s). Given a time series gene expression data, a hidden Markov model-based Bayesian model was developed to calculate the likelihood of the observed data, assuming the co-regulated gene modules are known. We further developed a Gibbs sampling strategy that is integrated with reversible jump Markov chain Monte Carlo to obtain the posterior probabilities of the co-regulated gene modules. Simulation study validated the proposed method. When compared with two existing methods, the proposed approach significantly outperformed the conventional methods.

Keywords co-regulated gene module      Bayesian      hidden Markov model      Markov chain Monte Carlo     
Corresponding Author(s): TANG Wanwan,Email:tww05@mails.tsinghua.edu.cn   
Issue Date: 05 December 2012
 Cite this article:   
Wanwan TANG,Rui LI,Shao LI, et al. Co-regulated gene module detection for time series gene expression data[J]. Front Elect Electr Eng, 2012, 7(4): 357-366.
 URL:  
https://academic.hep.com.cn/fee/EN/10.1007/s11460-012-0207-x
https://academic.hep.com.cn/fee/EN/Y2012/V7/I4/357
Fig.1  Relationship between genes and their regulators. Regulated genes are contained in CrGMs and the unregulated ones are outside the modules. Each CrGM has a hidden regulator.
Fig.1  Relationship between genes and their regulators. Regulated genes are contained in CrGMs and the unregulated ones are outside the modules. Each CrGM has a hidden regulator.
Fig.1  Relationship between genes and their regulators. Regulated genes are contained in CrGMs and the unregulated ones are outside the modules. Each CrGM has a hidden regulator.
Fig.1  Relationship between genes and their regulators. Regulated genes are contained in CrGMs and the unregulated ones are outside the modules. Each CrGM has a hidden regulator.
Fig.1  Relationship between genes and their regulators. Regulated genes are contained in CrGMs and the unregulated ones are outside the modules. Each CrGM has a hidden regulator.
Fig.2  Relationship between increments of genes and states of the regulator in a CrGM in certain biologic process
Fig.2  Relationship between increments of genes and states of the regulator in a CrGM in certain biologic process
Fig.2  Relationship between increments of genes and states of the regulator in a CrGM in certain biologic process
Fig.2  Relationship between increments of genes and states of the regulator in a CrGM in certain biologic process
Fig.2  Relationship between increments of genes and states of the regulator in a CrGM in certain biologic process
modelordernumber of genesnumber of CrGMsnumber of genes in CrGM 1number of genes in CrGM 2
Model 11202510
Model 211002510
Model 32202510
Model 421002510
Tab.1  Four time series microarray models with different characteristics
modelCrGMordertransition matrix
Model 1 / 2CrGM 11[0100.050.050.90.90.050.05]
CrGM 21[0.050.90.051/31/31/30.90.050.05]
Model 3 / 4CrGM 12[0.90.050.051/31/31/30.050.050.91/31/31/30.050.90.051/31/31/30.90.050.051/31/31/30.050.050.9]
CrGM 22[1/31/31/30.050.90.051/31/31/30.050.050.91/31/31/30.90.050.051/31/31/30.050.90.051/31/31/3]
Tab.2  Transition matrices
state value123
CrGM 1Gene 1-0.40.80
Gene 20.80.40
Gene 30.4-0.4-0.8
Gene 40-0.40.8
Gene 50.80.6-0.8
CrGM 2Gene 11.40.2-1.4
Gene 20.61.20.6,
Gene 31.2-1-1.4
Gene 4-1-1.2-0.2
Gene 5-0.2-0.61.2
Gene 6-0.2-1-1.4
Gene 7-1-1.41
Gene 8-1.40.6-0.2
Gene 9-10.6-0.6
Gene 10-1.20.21.2
Tab.3  Mean of the Gaussian distribution for emission matrices
Fig.3  Posterior probabilities of genes that belong to certain modules. For each time series microarray models, the posterior probabilities of genes that belong to CrGMs are plotted. Posterior probabilities of all the other genes are all zero. For all the four models, counting backward in the gene list, the 1st to the 10th genes are in the same CrGM and the 11th to the 15th genes are in the same CrGM. (a) Model 1; (b) Model 2; (c) Model 3; (d) Model 4.
Fig.3  Posterior probabilities of genes that belong to certain modules. For each time series microarray models, the posterior probabilities of genes that belong to CrGMs are plotted. Posterior probabilities of all the other genes are all zero. For all the four models, counting backward in the gene list, the 1st to the 10th genes are in the same CrGM and the 11th to the 15th genes are in the same CrGM. (a) Model 1; (b) Model 2; (c) Model 3; (d) Model 4.
Fig.3  Posterior probabilities of genes that belong to certain modules. For each time series microarray models, the posterior probabilities of genes that belong to CrGMs are plotted. Posterior probabilities of all the other genes are all zero. For all the four models, counting backward in the gene list, the 1st to the 10th genes are in the same CrGM and the 11th to the 15th genes are in the same CrGM. (a) Model 1; (b) Model 2; (c) Model 3; (d) Model 4.
Fig.3  Posterior probabilities of genes that belong to certain modules. For each time series microarray models, the posterior probabilities of genes that belong to CrGMs are plotted. Posterior probabilities of all the other genes are all zero. For all the four models, counting backward in the gene list, the 1st to the 10th genes are in the same CrGM and the 11th to the 15th genes are in the same CrGM. (a) Model 1; (b) Model 2; (c) Model 3; (d) Model 4.
Fig.3  Posterior probabilities of genes that belong to certain modules. For each time series microarray models, the posterior probabilities of genes that belong to CrGMs are plotted. Posterior probabilities of all the other genes are all zero. For all the four models, counting backward in the gene list, the 1st to the 10th genes are in the same CrGM and the 11th to the 15th genes are in the same CrGM. (a) Model 1; (b) Model 2; (c) Model 3; (d) Model 4.
Fig.4  Comparison of CrgMODE, -means, and WGCNA.
Fig.4  Comparison of CrgMODE, -means, and WGCNA.
Fig.4  Comparison of CrgMODE, -means, and WGCNA.
Fig.4  Comparison of CrgMODE, -means, and WGCNA.
Fig.4  Comparison of CrgMODE, -means, and WGCNA.
1 Mootha V K, Lindgren C M, Eriksson K F, Subramanian A, Sihag S, Lehar J, Puigserver P, Carlsson E, Ridderstr?le M, Laurila E, Houstis N, Daly M J, Patterson N, Mesirov J P, Golub T R, Tamayo P, Spiegelman B, Lander E S, Hirschhorn J N, Altshuler D, Groop L C. PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nature Genetics , 2003, 34(3): 267–273
doi: 10.1038/ng1180 pmid:12808457
2 Hartwell L H, Hopfield J J, Leibler S, Murray A W. From molecular to modular cell biology. Nature , 1999, 402(6761 Suppl): C47–C52
doi: 10.1038/35011540 pmid:10591225
3 Wang L, Zhang B, Wolfinger R D, Chen X. An integrated approach for the analysis of biological pathways using mixed models. PLoS Genetics , 2008, 4(7): e1000115
4 Gu J, Chen Y, Li S, Li Y. Identification of responsive gene modules by network-based gene clustering and extending: application to inflammation and angiogenesis. BMC Systems Biology , 2010, 4(1): 47
doi: 10.1186/1752-0509-4-47 pmid:20406493
5 Eisen M B, Spellman P T, Brown P O, Botstein D. Cluster analysis and display of genome-wide expression patterns. Proceedings of the National Academy of Sciences of the United States of America , 1998, 95(25): 14863–14868
doi: 10.1073/pnas.95.25.14863 pmid:9843981
6 Tavazoie S, Hughes J D, Campbell M J, Cho R J, Church G M. Systematic determination of genetic network architecture. Nature Genetics , 1999, 22(3): 281–285
doi: 10.1038/10343 pmid:10391217
7 Tamayo P, Slonim D, Mesirov J, Zhu Q, Kitareewan S, Dmitrovsky E, Lander E S, Golub T R. Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation. Proceedings of the National Academy of Sciences of the United States of America , 1999, 96(6): 2907–2912
doi: 10.1073/pnas.96.6.2907 pmid:10077610
8 Carter S L, Brechbühler C M, Griffin M, Bond A T. Gene co-expression network topology provides a framework for molecular characterization of cellular state. Bioinformatics , 2004, 20(14): 2242–2250
doi: 10.1093/bioinformatics/bth234 pmid:15130938
9 Davidson G S, Wylie B N, Boyack K W. Cluster stability and the use of noise in interpretation of clustering. In: Proceedings of IEEE Symposium on Information Visualization 2001 . 2001, 23–30
10 Elo L L, J?rvenp?? H, Oresic M, Lahesmaa R, Aittokallio T. Systematic construction of gene coexpression networks with applications to human T helper cell differentiation process. Bioinformatics , 2007, 23(16): 2096–2103
doi: 10.1093/bioinformatics/btm309 pmid:17553854
11 Rabiner L R. A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE , 1989, 77(2): 257–286
doi: 10.1109/5.18626
12 Baum L E, Petrie T, Soules G, Weiss N. A Maximization Technique Occurring in Statistical Analysis of Probabilistic Functions of Markov Chains. Annals of Mathematical Statistics , 1970, 41(1): 164–171
doi: 10.1214/aoms/1177697196
13 Green P J. Reversible jump Markov chain Monte Carlo computation and Bayesian model determination. Biometrika , 1995, 82(4): 711–732
doi: 10.1093/biomet/82.4.711
14 Strauss D J. Clustering Algorithms — Hartigan, JA. Biometrics , 1975, 31(3): 793
doi: 10.2307/2529577
15 Oldham M C, Horvath S, Geschwind D H. Conservation and evolution of gene coexpression networks in human and chimpanzee brains. Proceedings of the National Academy of Sciences of the United States of America , 2006, 103(47): 17973–17978
doi: 10.1073/pnas.0605938103 pmid:17101986
16 Horvath S, Zhang B, Carlson M, Lu K V, Zhu S, Felciano R M, Laurance M F, Zhao W, Qi S, Chen Z, Lee Y, Scheck A C, Liau L M, Wu H, Geschwind D H, Febbo P G, Kornblum H I, Cloughesy T F, Nelson S F, Mischel P S. Analysis of oncogenic signaling networks in glioblastoma identifies ASPM as a molecular target. Proceedings of the National Academy of Sciences of the United States of America , 2006, 103(46): 17402–17407
doi: 10.1073/pnas.0608396103 pmid:17090670
17 Zhang B, Horvath S. A general framework for weighted gene co-expression network analysis. Statistical Applications in Genetics and Molecular Biology , 2005, 4: Article17
18 Tang W, Wu X, Jiang R, Li Y. Epistatic module detection for case-control studies: a Bayesian model with a Gibbs sampling strategy. PLoS Genetics , 2009, 5(5): e1000464
doi: 10.1371/journal.pgen.1000464 pmid:19412524
[1] Lincong WANG, Shuxue ZOU, Yao WANG. Algorithmic challenges in structure-based drug design and NMR structural biology[J]. Front Elect Electr Eng, 2012, 7(1): 69-84.
[2] Lei XU. On essential topics of BYY harmony learning: Current status, challenging issues, and gene analysis applications[J]. Front Elect Electr Eng, 2012, 7(1): 147-196.
[3] Penghui WANG, Lei SHI, Lan DU, Hongwei LIU, Lei XU, Zheng BAO. Radar HRRP statistical recognition with temporal factor analysis by automatic Bayesian Ying-Yang harmony learning[J]. Front Elect Electr Eng Chin, 2011, 6(2): 300-317.
[4] Zaihu PANG, Shikui TU, Dan SU, Xihong WU, Lei XU. Discriminative training of GMM-HMM acoustic model by RPCL learning[J]. Front Elect Electr Eng Chin, 2011, 6(2): 283-290.
[5] Shikui TU, Lei XU. Parameterizations make different model selections: Empirical findings from factor analysis[J]. Front Elect Electr Eng Chin, 2011, 6(2): 256-274.
[6] Lei SHI, Shikui TU, Lei XU. Learning Gaussian mixture with automatic model selection: A comparative study on three Bayesian related approaches[J]. Front Elect Electr Eng Chin, 2011, 6(2): 215-244.
[7] Payam S. RAHMDEL, Minh Nhut NGUYEN, Liying ZHENG. Optimization of fuzzy CMAC using evolutionary Bayesian Ying-Yang learning[J]. Front Elect Electr Eng Chin, 2011, 6(2): 208-214.
[8] Lei XU. Codimensional matrix pairing perspective of BYY harmony learning: hierarchy of bilinear systems, joint decomposition of data-covariance, and applications of network biology[J]. Front Elect Electr Eng Chin, 2011, 6(1): 86-119.
[9] Lei XU. Bayesian Ying-Yang system, best harmony learning, and five action circling[J]. Front Elect Electr Eng Chin, 2010, 5(3): 281-328.
[10] Hao WU, Xihong WU, Huisheng CHI, . Product HMM-based training method for acoustic model with multiple-size units[J]. Front. Electr. Electron. Eng., 2010, 5(1): 65-71.
[11] SONG Xin, WANG Jinkuan, HAN Yinghua, WANG Han. Robust adaptive beamforming algorithm based on Bayesian approach[J]. Front. Electr. Electron. Eng., 2008, 3(4): 435-439.
[12] QIANG Lei, XIAO Tian-yuan. Cooperated Bayesian algorithm for distributed scheduling problem[J]. Front. Electr. Electron. Eng., 2006, 1(3): 251-254.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed