Report for: RNA-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach

You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.

Title	RNA-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach
Published in	BMC Bioinformatics, February 2017
DOI	10.1186/s12859-017-1561-8
Pubmed ID	28245811
Authors	Xiaoyong Pan, Hong-Bin Shen
Abstract	RNAs play key roles in cells through the interactions with proteins known as the RNA-binding proteins (RBP) and their binding motifs enable crucial understanding of the post-transcriptional regulation of RNAs. How the RBPs correctly recognize the target RNAs and why they bind specific positions is still far from clear. Machine learning-based algorithms are widely acknowledged to be capable of speeding up this process. Although many automatic tools have been developed to predict the RNA-protein binding sites from the rapidly growing multi-resource data, e.g. sequence, structure, their domain specific features and formats have posed significant computational challenges. One of current difficulties is that the cross-source shared common knowledge is at a higher abstraction level beyond the observed data, resulting in a low efficiency of direct integration of observed data across domains. The other difficulty is how to interpret the prediction results. Existing approaches tend to terminate after outputting the potential discrete binding sites on the sequences, but how to assemble them into the meaningful binding motifs is a topic worth of further investigation. In viewing of these challenges, we propose a deep learning-based framework (iDeep) by using a novel hybrid convolutional neural network and deep belief network to predict the RBP interaction sites and motifs on RNAs. This new protocol is featured by transforming the original observed data into a high-level abstraction feature space using multiple layers of learning blocks, where the shared representations across different domains are integrated. To validate our iDeep method, we performed experiments on 31 large-scale CLIP-seq datasets, and our results show that by integrating multiple sources of data, the average AUC can be improved by 8% compared to the best single-source-based predictor; and through cross-domain knowledge integration at an abstraction level, it outperforms the state-of-the-art predictors by 6%. Besides the overall enhanced prediction performance, the convolutional neural network module embedded in iDeep is also able to automatically capture the interpretable binding motifs for RBPs. Large-scale experiments demonstrate that these mined binding motifs agree well with the experimentally verified results, suggesting iDeep is a promising approach in the real-world applications. The iDeep framework not only can achieve promising performance than the state-of-the-art predictors, but also easily capture interpretable binding motifs. iDeep is available at http://www.csbio.sjtu.edu.cn/bioinf/iDeep.

View on publisher site Alert me about new mentions

X Demographics

The data shown below were collected from the profiles of 3 X users who shared this research output. Click here to find out more about how the information was compiled.

Geographical breakdown

Country	Count	As %
United States	1	33%
France	1	33%
Unknown	1	33%

Demographic breakdown

Type	Count	As %
Members of the public	2	67%
Scientists	1	33%

Mendeley readers

The data shown below were compiled from readership statistics for 198 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country	Count	As %
Canada	1	<1%
Unknown	197	99%

Demographic breakdown

Readers by professional status	Count	As %
Student > Ph. D. Student	49	25%
Researcher	30	15%
Student > Master	21	11%
Student > Bachelor	16	8%
Student > Postgraduate	7	4%
Other	26	13%
Unknown	49	25%

Readers by discipline	Count	As %
Computer Science	49	25%
Biochemistry, Genetics and Molecular Biology	40	20%
Agricultural and Biological Sciences	23	12%
Engineering	9	5%
Medicine and Dentistry	4	2%
Other	13	7%
Unknown	60	30%

Attention Score in Context

This research output has an Altmetric Attention Score of 3. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 17 April 2017.

All research outputs

#13,307,934

of 22,957,478 outputs

Outputs from BMC Bioinformatics

#4,020

of 7,307 outputs

Outputs of similar age

#157,676

of 310,863 outputs

Outputs of similar age from BMC Bioinformatics

#66

of 141 outputs

Altmetric has tracked 22,957,478 research outputs across all sources so far. This one is in the 41st percentile – i.e., 41% of other outputs scored the same or lower than it.

So far Altmetric has tracked 7,307 research outputs from this source. They typically receive a little more attention than average, with a mean Attention Score of 5.4. This one is in the 42nd percentile – i.e., 42% of its peers scored the same or lower than it.

Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 310,863 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 48th percentile – i.e., 48% of its contemporaries scored the same or lower than it.

We're also able to compare this research output to 141 others from the same source and published within six weeks on either side of this one. This one has gotten more attention than average, scoring higher than 51% of its contemporaries.

RNA-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach

About this Attention Score

Mentioned by

Citations

Readers on

X Demographics

Geographical breakdown

Demographic breakdown

Mendeley readers

Geographical breakdown

Demographic breakdown

Attention Score in Context