↓ Skip to main content

Parallel computation of genome-scale RNA secondary structure to detect structural constraints on human genome

Overview of attention for article published in BMC Bioinformatics, May 2016
Altmetric Badge

About this Attention Score

  • In the top 25% of all research outputs scored by Altmetric
  • High Attention Score compared to outputs of the same age (84th percentile)
  • High Attention Score compared to outputs of the same age and source (86th percentile)

Mentioned by

blogs
1 blog
twitter
9 X users

Citations

dimensions_citation
29 Dimensions

Readers on

mendeley
54 Mendeley
citeulike
1 CiteULike
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Parallel computation of genome-scale RNA secondary structure to detect structural constraints on human genome
Published in
BMC Bioinformatics, May 2016
DOI 10.1186/s12859-016-1067-9
Pubmed ID
Authors

Risa Kawaguchi, Hisanori Kiryu

Abstract

RNA secondary structure around splice sites is known to assist normal splicing by promoting spliceosome recognition. However, analyzing the structural properties of entire intronic regions or pre-mRNA sequences has been difficult hitherto, owing to serious experimental and computational limitations, such as low read coverage and numerical problems. Our novel software, "ParasoR", is designed to run on a computer cluster and enables the exact computation of various structural features of long RNA sequences under the constraint of maximal base-pairing distance. ParasoR divides dynamic programming (DP) matrices into smaller pieces, such that each piece can be computed by a separate computer node without losing the connectivity information between the pieces. ParasoR directly computes the ratios of DP variables to avoid the reduction of numerical precision caused by the cancellation of a large number of Boltzmann factors. The structural preferences of mRNAs computed by ParasoR shows a high concordance with those determined by high-throughput sequencing analyses. Using ParasoR, we investigated the global structural preferences of transcribed regions in the human genome. A genome-wide folding simulation indicated that transcribed regions are significantly more structural than intergenic regions after removing repeat sequences and k-mer frequency bias. In particular, we observed a highly significant preference for base pairing over entire intronic regions as compared to their antisense sequences, as well as to intergenic regions. A comparison between pre-mRNAs and mRNAs showed that coding regions become more accessible after splicing, indicating constraints for translational efficiency. Such changes are correlated with gene expression levels, as well as GC content, and are enriched among genes associated with cytoskeleton and kinase functions. We have shown that ParasoR is very useful for analyzing the structural properties of long RNA sequences such as mRNAs, pre-mRNAs, and long non-coding RNAs whose lengths can be more than a million bases in the human genome. In our analyses, transcribed regions including introns are indicated to be subject to various types of structural constraints that cannot be explained from simple sequence composition biases. ParasoR is freely available at https://github.com/carushi/ParasoR .

X Demographics

X Demographics

The data shown below were collected from the profiles of 9 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 54 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Cuba 1 2%
United States 1 2%
Unknown 52 96%

Demographic breakdown

Readers by professional status Count As %
Researcher 13 24%
Student > Ph. D. Student 12 22%
Student > Bachelor 7 13%
Student > Master 6 11%
Professor > Associate Professor 3 6%
Other 6 11%
Unknown 7 13%
Readers by discipline Count As %
Agricultural and Biological Sciences 24 44%
Biochemistry, Genetics and Molecular Biology 13 24%
Computer Science 6 11%
Medicine and Dentistry 2 4%
Engineering 2 4%
Other 1 2%
Unknown 6 11%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 11. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 17 June 2016.
All research outputs
#2,773,519
of 22,869,263 outputs
Outputs from BMC Bioinformatics
#919
of 7,296 outputs
Outputs of similar age
#45,787
of 298,725 outputs
Outputs of similar age from BMC Bioinformatics
#14
of 101 outputs
Altmetric has tracked 22,869,263 research outputs across all sources so far. Compared to these this one has done well and is in the 87th percentile: it's in the top 25% of all research outputs ever tracked by Altmetric.
So far Altmetric has tracked 7,296 research outputs from this source. They typically receive a little more attention than average, with a mean Attention Score of 5.4. This one has done well, scoring higher than 87% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 298,725 tracked outputs that were published within six weeks on either side of this one in any source. This one has done well, scoring higher than 84% of its contemporaries.
We're also able to compare this research output to 101 others from the same source and published within six weeks on either side of this one. This one has done well, scoring higher than 86% of its contemporaries.