↓ Skip to main content

Incorporating prior biological knowledge for network-based differential gene expression analysis using differentially weighted graphical LASSO

Overview of attention for article published in BMC Bioinformatics, February 2017
Altmetric Badge

Mentioned by

twitter
2 X users

Citations

dimensions_citation
38 Dimensions

Readers on

mendeley
112 Mendeley
citeulike
1 CiteULike
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Incorporating prior biological knowledge for network-based differential gene expression analysis using differentially weighted graphical LASSO
Published in
BMC Bioinformatics, February 2017
DOI 10.1186/s12859-017-1515-1
Pubmed ID
Authors

Yiming Zuo, Yi Cui, Guoqiang Yu, Ruijiang Li, Habtom W. Ressom

Abstract

Conventional differential gene expression analysis by methods such as student's t-test, SAM, and Empirical Bayes often searches for statistically significant genes without considering the interactions among them. Network-based approaches provide a natural way to study these interactions and to investigate the rewiring interactions in disease versus control groups. In this paper, we apply weighted graphical LASSO (wgLASSO) algorithm to integrate a data-driven network model with prior biological knowledge (i.e., protein-protein interactions) for biological network inference. We propose a novel differentially weighted graphical LASSO (dwgLASSO) algorithm that builds group-specific networks and perform network-based differential gene expression analysis to select biomarker candidates by considering their topological differences between the groups. Through simulation, we showed that wgLASSO can achieve better performance in building biologically relevant networks than purely data-driven models (e.g., neighbor selection, graphical LASSO), even when only a moderate level of information is available as prior biological knowledge. We evaluated the performance of dwgLASSO for survival time prediction using two microarray breast cancer datasets previously reported by Bild et al. and van de Vijver et al. Compared with the top 10 significant genes selected by conventional differential gene expression analysis method, the top 10 significant genes selected by dwgLASSO in the dataset from Bild et al. led to a significantly improved survival time prediction in the independent dataset from van de Vijver et al. Among the 10 genes selected by dwgLASSO, UBE2S, SALL2, XBP1 and KIAA0922 have been confirmed by literature survey to be highly relevant in breast cancer biomarker discovery study. Additionally, we tested dwgLASSO on TCGA RNA-seq data acquired from patients with hepatocellular carcinoma (HCC) on tumors samples and their corresponding non-tumorous liver tissues. Improved sensitivity, specificity and area under curve (AUC) were observed when comparing dwgLASSO with conventional differential gene expression analysis method. The proposed network-based differential gene expression analysis algorithm dwgLASSO can achieve better performance than conventional differential gene expression analysis methods by integrating information at both gene expression and network topology levels. The incorporation of prior biological knowledge can lead to the identification of biologically meaningful genes in cancer biomarker studies.

X Demographics

X Demographics

The data shown below were collected from the profiles of 2 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 112 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 112 100%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 27 24%
Researcher 18 16%
Student > Bachelor 15 13%
Student > Master 10 9%
Student > Doctoral Student 5 4%
Other 14 13%
Unknown 23 21%
Readers by discipline Count As %
Biochemistry, Genetics and Molecular Biology 22 20%
Agricultural and Biological Sciences 17 15%
Computer Science 16 14%
Medicine and Dentistry 10 9%
Engineering 8 7%
Other 14 13%
Unknown 25 22%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 1. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 24 August 2017.
All research outputs
#18,531,724
of 22,953,506 outputs
Outputs from BMC Bioinformatics
#6,341
of 7,308 outputs
Outputs of similar age
#312,077
of 422,694 outputs
Outputs of similar age from BMC Bioinformatics
#113
of 148 outputs
Altmetric has tracked 22,953,506 research outputs across all sources so far. This one is in the 11th percentile – i.e., 11% of other outputs scored the same or lower than it.
So far Altmetric has tracked 7,308 research outputs from this source. They typically receive a little more attention than average, with a mean Attention Score of 5.4. This one is in the 5th percentile – i.e., 5% of its peers scored the same or lower than it.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 422,694 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 15th percentile – i.e., 15% of its contemporaries scored the same or lower than it.
We're also able to compare this research output to 148 others from the same source and published within six weeks on either side of this one. This one is in the 14th percentile – i.e., 14% of its contemporaries scored the same or lower than it.