Transcriptome analyses of insect cells to facilitate baculovirus-insect expression
Kai Yu1,2,Yang Yu1,2,Xiaoyan Tang1,2,Huimin Chen1,2,Junyu Xiao2,3,*(),Xiao-Dong Su1,2,*()
1. Biodynamic Optical Imaging Center, School of Life Science, Peking University, Beijing 100871, China 2. State Key Laboratory of Protein and Plant Gene Research, Peking University, Beijing 100871, China 3. Peking-Tsinghua Center for Life Sciences, Peking University, Beijing 100871, China
The High Five cell line (BTI-TN-5B1-4) isolated from the cabbage looper, Trichoplusia ni is an insect cell line widely used for baculovirus-mediated recombinant protein expression. Despite its widespread application in industry and academic laboratories, the genomic background of this cell line remains unclear. Here we sequenced the transcriptome of High Five cells and assembled 25,234 transcripts. Codon usage analysis showed that High Five cells have a robust codon usage capacity and therefore suit for expressing proteins of both eukaryotic- and prokaryotic-origin. Genes involved in glycosylation were profiled in our study, providing guidance for engineering glycosylated proteins in the insect cells. We also predicted signal peptides for transcripts with high expression abundance in both High Five and Sf21 cell lines, and these results have important implications for optimizing the expression level of some secretory and membrane proteins.
Breitbach K, Jarvis DL (2001) Improved glycosylation of a foreign protein by Tn-5B1-4 cells engineered to express mammalian glycosyltransferases. Biotechnol Bioeng 74:230–239.
https://doi.org/10.1002/bit.1112
Conesa A, Götz S (2008) Blast2GO: A comprehensive suite for functional analysis in plant genomics. Int J Plant Genomics 2008:619832.
https://doi.org/10.1155/2008/619832
9
Davis TR, Trotter KM, Granados RR, Wood HA (1992) Baculovirus Expression of Alkaline Phosphatase as a Reporter Gene for Evaluation of Production, Glycosylation and Secretion. Bio/Technology 10:1148–1150. doi:10.1038/nbt1092-1148
https://doi.org/10.1038/nbt1092-1148
10
Finn RD, Bateman A, Clements J et al (2014) Pfam: the protein families database. Nucleic Acids Res 42:D222–D230. doi:10.1093/nar/gkt1223
Grabherr MG, Haas BJ, Yassour M (2011) Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol 29:644–652.
https://doi.org/10.1038/nbt.1883
13
Haas BJ, Papanicolaou A, Yassour M (2013) De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat Protoc 8:1494–1512.
https://doi.org/10.1038/nprot.2013.084
14
Hollister JR, Jarvis DL (2001) Engineering lepidopteran insect cells for sialoglycoprotein production by genetic transformation with mammalian 1,4-galactosyltransferase and 2,6-sialyltransferase genes. Glycobiology 11:1–9.
https://doi.org/10.1093/glycob/11.1.1
15
Hollister J, Grabenhorst E, Nimtz M (2002) Engineering the Protein N-Glycosylation Pathway in Insect Cells for Production of Biantennary, Complex N-Glycans †. Biochemistry 41:15093–15104.
https://doi.org/10.1021/bi026455d
16
Hollister JR, Shaper JH, Jarvis DL (1998) Stable expression of mammalian beta 1,4-galactosyltransferase extends the N-glycosylation pathway in insect cells. Glycobiology 8:473–480
https://doi.org/10.1093/glycob/8.5.473
Jarvis DL (2003) Developing baculovirus-insect cell expression systems for humanized recombinant glycoprotein production. Virology 310:1–7.
https://doi.org/10.1016/S0042-6822(03)00120-X
19
Kakumani PK, Malhotra P, Mukherjee SK, Bhatnagar RK (2014) A draft genome assembly of the army worm, Spodoptera frugiperda. Genomics 104:134–143.
https://doi.org/10.1016/j.ygeno.2014.06.005
20
Kakumani PK, Shukla R, Todur VN (2015) De novo transcriptome assembly and analysis of Sf21 cells using illumina paired end sequencing. Biol Direct 10:44.
https://doi.org/10.1186/s13062-015-0072-7
21
Kost TA, Condreay JP, Jarvis DL (2005) Baculovirus as versatile vectors for protein expression in insect and mammalian cells. Nat Biotechnol 23:567–575.
https://doi.org/10.1038/nbt1095
Langmead B, Trapnell C, Pop M, Salzberg SL (2009) Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10:R25.
https://doi.org/10.1186/gb-2009-10-3-r25
24
Li B, Dewey CN (2011) RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics 12:323.
https://doi.org/10.1186/1471-2105-12-323
Lombard V, Golaconda Ramulu H, Drula E (2014) The carbohydrate-active enzymes database (CAZy) in 2013. Nucleic Acids Res 42:D490–D495.
https://doi.org/10.1093/nar/gkt1178
Olczak M, Olczak T (2006) Comparison of different signal peptides for protein secretion in nonlytic insect cell system. Anal Biochem 359:45–53.
https://doi.org/10.1016/j.ab.2006.09.003
29
Petersen TN, Brunak S, von Heijne G, Nielsen H (2011) SignalP 4.0: discriminating signal peptides from transmembrane regions. Nat Methods 8:785–786.
https://doi.org/10.1038/nmeth.1701
30
Powell S, Forslund K, Szklarczyk D (2014) eggNOG v4.0: nested orthology inference across 3686 organisms. Nucleic Acids Res 42:D231–D239.
https://doi.org/10.1093/nar/gkt1253
31
Soejima Y, Lee J, Nagata Y (2013) Comparison of signal peptides for efficient protein secretion in the baculovirus-silkworm system. Open Life Sci 8:1–7.
https://doi.org/10.2478/s11535-012-0112-6
Vaughn JL, Goodwin RH, Tompkins GJ, McCawley P (1977) The Establishment of Two Cell Lines from the Insect Spodoptera frugiperda (Lepidoptera; Noctuidae). In Vitro 13:213–217
https://doi.org/10.1007/BF02615077
35
von Heijne G, Abrahmsén L (1989) Species-specific variation in signal peptide design. Implications for protein secretion in foreign hosts. FEBS Lett 244:439–446
https://doi.org/10.1016/0014-5793(89)80579-4
36
Wickham TJ, Davis T, Granados RR, Screening of insect cell lines for the production of recombinant proteins and infectious virus in the baculovirus expression system. Biotechnol Prog 8:391–6.
https://doi.org/10.1021/bp00017a003
37
Xie C, Mao X, Huang J (2011) KOBAS 2.0: a web server for annotation and identification of enriched pathways and diseases. Nucleic Acids Res 39:W316–W322.
https://doi.org/10.1093/nar/gkr483
38
Xu C, Ng DTW(2015) Glycosylation-directed quality control of protein folding. Nat Rev Mol Cell Biol 16:742–752.
https://doi.org/10.1038/nrm4073
39
Ye J, Fang L, Zheng H (2006) WEGO: a web tool for plotting GO annotations. Nucleic Acids Res 34:W293–W297.
https://doi.org/10.1093/nar/gkl031