↓ Skip to main content

RNA-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach

Overview of attention for article published in BMC Bioinformatics, February 2017
Altmetric Badge

About this Attention Score

  • Average Attention Score compared to outputs of the same age
  • Above-average Attention Score compared to outputs of the same age and source (51st percentile)

Mentioned by

twitter
3 X users

Citations

dimensions_citation
178 Dimensions

Readers on

mendeley
198 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
RNA-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach
Published in
BMC Bioinformatics, February 2017
DOI 10.1186/s12859-017-1561-8
Pubmed ID
Authors

Xiaoyong Pan, Hong-Bin Shen

Abstract

RNAs play key roles in cells through the interactions with proteins known as the RNA-binding proteins (RBP) and their binding motifs enable crucial understanding of the post-transcriptional regulation of RNAs. How the RBPs correctly recognize the target RNAs and why they bind specific positions is still far from clear. Machine learning-based algorithms are widely acknowledged to be capable of speeding up this process. Although many automatic tools have been developed to predict the RNA-protein binding sites from the rapidly growing multi-resource data, e.g. sequence, structure, their domain specific features and formats have posed significant computational challenges. One of current difficulties is that the cross-source shared common knowledge is at a higher abstraction level beyond the observed data, resulting in a low efficiency of direct integration of observed data across domains. The other difficulty is how to interpret the prediction results. Existing approaches tend to terminate after outputting the potential discrete binding sites on the sequences, but how to assemble them into the meaningful binding motifs is a topic worth of further investigation. In viewing of these challenges, we propose a deep learning-based framework (iDeep) by using a novel hybrid convolutional neural network and deep belief network to predict the RBP interaction sites and motifs on RNAs. This new protocol is featured by transforming the original observed data into a high-level abstraction feature space using multiple layers of learning blocks, where the shared representations across different domains are integrated. To validate our iDeep method, we performed experiments on 31 large-scale CLIP-seq datasets, and our results show that by integrating multiple sources of data, the average AUC can be improved by 8% compared to the best single-source-based predictor; and through cross-domain knowledge integration at an abstraction level, it outperforms the state-of-the-art predictors by 6%. Besides the overall enhanced prediction performance, the convolutional neural network module embedded in iDeep is also able to automatically capture the interpretable binding motifs for RBPs. Large-scale experiments demonstrate that these mined binding motifs agree well with the experimentally verified results, suggesting iDeep is a promising approach in the real-world applications. The iDeep framework not only can achieve promising performance than the state-of-the-art predictors, but also easily capture interpretable binding motifs. iDeep is available at http://www.csbio.sjtu.edu.cn/bioinf/iDeep.

X Demographics

X Demographics

The data shown below were collected from the profiles of 3 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 198 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Canada 1 <1%
Unknown 197 99%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 49 25%
Researcher 30 15%
Student > Master 21 11%
Student > Bachelor 16 8%
Student > Postgraduate 7 4%
Other 26 13%
Unknown 49 25%
Readers by discipline Count As %
Computer Science 49 25%
Biochemistry, Genetics and Molecular Biology 40 20%
Agricultural and Biological Sciences 23 12%
Engineering 9 5%
Medicine and Dentistry 4 2%
Other 13 7%
Unknown 60 30%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 3. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 17 April 2017.
All research outputs
#13,307,934
of 22,957,478 outputs
Outputs from BMC Bioinformatics
#4,020
of 7,307 outputs
Outputs of similar age
#157,676
of 310,863 outputs
Outputs of similar age from BMC Bioinformatics
#66
of 141 outputs
Altmetric has tracked 22,957,478 research outputs across all sources so far. This one is in the 41st percentile – i.e., 41% of other outputs scored the same or lower than it.
So far Altmetric has tracked 7,307 research outputs from this source. They typically receive a little more attention than average, with a mean Attention Score of 5.4. This one is in the 42nd percentile – i.e., 42% of its peers scored the same or lower than it.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 310,863 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 48th percentile – i.e., 48% of its contemporaries scored the same or lower than it.
We're also able to compare this research output to 141 others from the same source and published within six weeks on either side of this one. This one has gotten more attention than average, scoring higher than 51% of its contemporaries.