Quantitative Biology
Cover Story   2016, Volume 4 Issue 4
Single-cell RNA sequencing (scRNA-seq) is an emerging technology that enables high resolution detection of heterogeneities between cells. One important application of scRNA-seq data is to detect differential expression (DE) of genes between different types or subtypes of cells or between cells of different conditions. Currently, many researchers st [Detail] ...
   Online First

Administered by

, Volume 4 Issue 4

For Selected: View Abstracts Toggle Thumbnails
RESEARCH ARTICLE
Differential expression analyses for single-cell RNA-Seq: old questions on new data
Zhun Miao, Xuegong Zhang
Quant. Biol.. 2016, 4 (4): 243-260.  
https://doi.org/10.1007/s40484-016-0089-7

Author Summary   Abstract   HTML   PDF (3763KB)

Background: Single-cell RNA sequencing (scRNA-seq) is an emerging technology that enables high resolution detection of heterogeneities between cells. One important application of scRNA-seq data is to detect differential expression (DE) of genes. Currently, some researchers still use DE analysis methods developed for bulk RNA-Seq data on single-cell data, and some new methods for scRNA-seq data have also been developed. Bulk and single-cell RNA-seq data have different characteristics. A systematic evaluation of the two types of methods on scRNA-seq data is needed.

Results: In this study, we conducted a series of experiments on scRNA-seq data to quantitatively evaluate 14 popular DE analysis methods, including both of traditional methods developed for bulk RNA-seq data and new methods specifically designed for scRNA-seq data. We obtained observations and recommendations for the methods under different situations.

Conclusions: DE analysis methods should be chosen for scRNA-seq data with great caution with regard to different situations of data. Different strategies should be taken for data with different sample sizes and/or different strengths of the expected signals. Several methods for scRNA-seq data show advantages in some aspects, and DEGSeq tends to outperform other methods with respect to consistency, reproducibility and accuracy of predictions on scRNA-seq data.

Differential expression (DE) analysis is to find the genes whose expression values are significantly different among the groups of samples compared. Gene expression values could be measured by bulk RNA sequencing (RNA-seq) or single-cell RNA sequencing (scRNA-seq), which is emerging recently and could get the expression values of individual cell, while DE analysis methods designed for bulk RNA-seq are still commonly used on scRNA-seq data. We found that, since the characteristics of the two kinds of data are quite different, different DE analysis methods should be carefully chosen with regard to different situations of data when applied to scRNA-seq.
Figures and Tables | References | Supplementary Material | Related Articles | Metrics
On the possibility of identifying human subjects using behavioural complexity analyses
Petr Kloucek, Armin von Gunten
Quant. Biol.. 2016, 4 (4): 261-269.  
https://doi.org/10.1007/s40484-016-0088-8

Author Summary   Abstract   HTML   PDF (997KB)

Background: Identification of human subjects using a geometric approach to complexity analysis of behavioural data is designed to provide a basis for a more precise diagnosis leading towards personalised medicine.

Methods: The approach is based on capturing behavioural time-series that can be characterized by a fractional dimension using non-invasive longer-time acquisitions of heart rate, perfusion, blood oxygenation, skin temperature, relative movement and steps frequency. The geometry based approach consists in the analysis of the area and centroid of convex hulls encapsulating the behavioural data represented in Euclidian index spaces based on the scaling properties of the self-similar normally distributed behavioural time-series of the above mentioned quantities.

Results: An example demonstrating the presented approach of behavioural fingerprinting is provided using sensory data of eight healthy human subjects based on approximately fifteen hours of data acquisition. Our results show that healthy subjects can be factorized to different similarity groups based on a particular choice of a convex hull in the corresponding Euclidian space. One of the results indicates that healthy subjects share only a small part of the convex hull pertaining to a highly trained individual from the geometric comparison point of view. Similarly, the presented pair-wise individual geometric similarity measure indicates large differences among the subjects suggesting the possibility of neuro-fingerprinting.

Conclusions: Recently introduced multi-channel body-attached sensors provide a possibility to acquire behavioural time-series that can be mathematically analysed to obtain various objective measures of behavioural patterns yielding behavioural diagnoses favouring personalised treatments of, e.g., neuropathologies or aging.

The presented communication is based on the assumption that complexity indexing of behavioral patterns provides objective and quantitative measurements of disorders as well as response to drugs and other treatments. The concept itself stems from and relies on mathematical frameworks yielding macroscopic characterizations of these patterns based on their microscopic dynamical properties. The results show that a longer-time use of novel non-disruptive multichannel body attached sensors can provide time-series that can be used to characterize humans and their behavioural patterns.
Figures and Tables | References | Related Articles | Metrics
A resistant method for landmark-based analysis of individual asymmetry in two dimensions
Sebastián Torcida, Paula Gonzalez, Federico Lotto
Quant. Biol.. 2016, 4 (4): 270-282.  
https://doi.org/10.1007/s40484-016-0086-x

Author Summary   Abstract   HTML   PDF (1236KB)

Background: Symmetry of biological structures can be thought as the repetition of their parts in different positions and orientations. Asymmetry analyses, therefore, focuses on identifying and measuring the location and extent of symmetry departures in such structures. In the context of geometric morphometrics, a key step when studying morphological variation is the estimation of the symmetric shape. The standard procedure uses the least-squares Procrustes superimposition, which by averaging shape differences often underestimates the symmetry departures thus leading to an inaccurate description of the asymmetry pattern. Moreover, the corresponding asymmetry values are neither geometrically intuitive nor visually perceivable.

Methods: In this work, a resistant method for landmark-based asymmetry analysis of individual bilateral symmetric structures in 2D is introduced. A geometrical derivation of this new approach is offered, while its advantages in comparison with the standard method are examined and discussed through a few illustrative examples.

Results: Experimental tests on both artificial and real data show that asymmetry is more effectively measured by using the resistant method because the underlying symmetric shape is better estimated. Therefore, the most asymmetric (respectively symmetric) landmarks are better determined through their large (respectively small) residuals. The percentage of asymmetry that is accounted for by each landmark is an additional revealing measure the new method offers which agrees with the displayed results while helping in their biological interpretation.

Conclusions: The resistant method is a useful exploratory tool for analyzing shape asymmetry in 2D, and it might be the preferable method whenever a non homogeneous deformation of bilateral symmetric structures is possible. By offering a more detailed and rather exhaustive explanation of the asymmetry pattern, this new approach will hopefully contribute to improve the quality of biological or developmental inferences.

A resistant method for studying individual shape asymmetry in two dimensions is introduced, which uses the geometry of data to estimate the underlying symmetric shape. Unlike the classical least-squares approach, it is shown show that asymmetry is more accurately measured when a resistant method is used instead; this helps symmetry departures to be more easily understood. The percentage of asymmetry accounted for by each landmark can also be computed in the process, providing an objective basis for a comprehensive characterization of asymmetry. Overall, the resistant method turns out to be a useful exploratory tool whenever a non homogeneous deformation of bilateral symmetric structures is possible.
Figures and Tables | References | Related Articles | Metrics
REVIEW
Applications of integrative OMICs approaches to gene regulation studies
Jing Qin, Bin Yan, Yaohua Hu, Panwen Wang, Junwen Wang
Quant. Biol.. 2016, 4 (4): 283-301.  
https://doi.org/10.1007/s40484-016-0085-y

Author Summary   Abstract   HTML   PDF (1231KB)

Background: Functional genomics employs dozens of OMICs technologies to explore the functions of DNA, RNA and protein regulators in gene regulation processes. Despite each of these technologies being powerful tools on their own, like the parable of blind men and an elephant, any one single technology has a limited ability to depict the complex regulatory system. Integrative OMICS approaches have emerged and become an important area in biology and medicine. It provides a precise and effective way to study gene regulations.

Results: This article reviews current popular OMICs technologies, OMICs data integration strategies, and bioinformatics tools used for multi-dimensional data integration. We highlight the advantages of these methods, particularly in elucidating molecular basis of biological regulatory mechanisms.

Conclusions: To better understand the complexity of biological processes, we need powerful bioinformatics tools to integrate these OMICs data. Integrating multi-dimensional OMICs data will generate novel insights into system-level gene regulations and serves as a foundation for further hypothesis-driven research.

Dozens of OMICs technologies have been developed to dissect the functions of DNA, RNA and protein in gene regulation. Each of these technologies is powerful on their own. However, like the parable of blind men and an elephant, any single technology has a limited ability to depict the whole complex regulatory system. Integrative approaches have emerged and become an important area in biology and medicine. It provides a precise and effective way to elucidate the regulatory mechanisms. This article reviews the popular OMICs technologies, OMICs data integration strategies, and the bioinformatics tools used for multi-dimensional data integration.
Figures and Tables | References | Related Articles | Metrics
Computational inference of physical spatial organization of eukaryotic genomes
Bingxiang Xu, Zhihua Zhang
Quant. Biol.. 2016, 4 (4): 302-309.  
https://doi.org/10.1007/s40484-016-0082-1

Author Summary   Abstract   HTML   PDF (317KB)

Background: Chromosomes are packed in the cell’s nucleus, and chromosomal conformation is critical to nearly all intranuclear biological reactions, including gene transcription and DNA replication. Nevertheless, chromosomal conformation is largely a mystery in terms of its formation and the regulatory machinery that accesses it.

Results: Thanks to recent technological developments, we can now probe chromatin interaction in substantial detail, boosting research interest in modeling genome spatial organization. Here, we review the current computational models that simulate chromosome dynamics, and explain the physical and topological properties of chromosomal conformation, as inferred from these newly generated data.

Conclusion: Novel models shall be developed to address questions beyond averaged structure in the near further.

Genome is always working in the 3D space of the nucleus, and its 3D structure is critical for gene regulation. We review the computational methods that rebuild genome 3D structures from high throughput technologies, such as Hi-C. We also discuss the pros and cons in current methods and possible further directions in the field.
Figures and Tables | References | Related Articles | Metrics
A survey on biomarker identification based on molecular networks
Guanghui Zhu, Xing-Ming Zhao, Jun Wu
Quant. Biol.. 2016, 4 (4): 310-319.  
https://doi.org/10.1007/s40484-016-0084-z

Author Summary   Abstract   HTML   PDF (424KB)

Background: Identifying biomarkers for accurate diagnosis and prognosis of diseases is important for the prevention of disease development. The molecular networks that describe the functional relationships among molecules provide a global view of the complex biological systems. With the molecular networks, the molecular mechanisms underlying diseases can be unveiled, which helps identify biomarkers in a systematic way.

Results: In this survey, we report the recent progress on identifying biomarkers based on the topology of molecular networks, and we categorize those biomarkers into three groups, including node biomarkers, edge biomarkers and network biomarkers. These distinct types of biomarkers can be detected under different conditions depending on the data available.

Conclusions: The biomarkers identified based on molecular networks can provide more accurate diagnosis and prognosis. The pros and cons of different types of biomarkers as well as future directions to improve the methods for identifying biomarkers are also discussed.

The biomarkers identified based on molecular networks can be of help for accurate diagnosis and prognosis in a more systematic way. In this survey, three types of network based biomarkers are introduced, including node biomarkers, edge biomarkers and network biomarkers. The computational approaches for detecting the network based biomarkers are also introduced.
Figures and Tables | References | Related Articles | Metrics
Performance measures in evaluating machine learning based bioinformatics predictors for classifications
Yasen Jiao, Pufeng Du
Quant. Biol.. 2016, 4 (4): 320-330.  
https://doi.org/10.1007/s40484-016-0081-2

Author Summary   Abstract   HTML   PDF (289KB)

Background: Many existing bioinformatics predictors are based on machine learning technology. When applying these predictors in practical studies, their predictive performances should be well understood. Different performance measures are applied in various studies as well as different evaluation methods. Even for the same performance measure, different terms, nomenclatures or notations may appear in different context.

Results: We carried out a review on the most commonly used performance measures and the evaluation methods for bioinformatics predictors.

Conclusions: It is important in bioinformatics to correctly understand and interpret the performance, as it is the key to rigorously compare performances of different predictors and to choose the right predictor.

It is important in bioinformatics to correctly understand and interpret the performance, as it is the key to rigorously compare performances of different predictors and to choose the right predictor. We present a comprehensive review to the performance measures in evaluating bioinformatics predictors, especially, the classification predictors.
Figures and Tables | References | Related Articles | Metrics
7 articles