Report for: Breaking the computational barriers of pairwise genome comparison

You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.

Title	Breaking the computational barriers of pairwise genome comparison
Published in	BMC Bioinformatics, August 2015
DOI	10.1186/s12859-015-0679-9
Pubmed ID	26260162
Authors	Oscar Torreno, Oswaldo Trelles
Abstract	Conventional pairwise sequence comparison software algorithms are being used to process much larger datasets than they were originally designed for. This can result in processing bottlenecks that limit software capabilities or prevent full use of the available hardware resources. Overcoming the barriers that limit the efficient computational analysis of large biological sequence datasets by retrofitting existing algorithms or by creating new applications represents a major challenge for the bioinformatics community. We have developed C libraries for pairwise sequence comparison within diverse architectures, ranging from commodity systems to high performance and cloud computing environments. Exhaustive tests were performed using different datasets of closely- and distantly-related sequences that span from small viral genomes to large mammalian chromosomes. The tests demonstrated that our solution is capable of generating high quality results with a linear-time response and controlled memory consumption, being comparable or faster than the current state-of-the-art methods. We have addressed the problem of pairwise and all-versus-all comparison of large sequences in general, greatly increasing the limits on input data size. The approach described here is based on a modular out-of-core strategy that uses secondary storage to avoid reaching memory limits during the identification of High-scoring Segment Pairs (HSPs) between the sequences under comparison. Software engineering concepts were applied to avoid intermediate result re-calculation, to minimise the performance impact of input/output (I/O) operations and to modularise the process, thus enhancing application flexibility and extendibility. Our computationally-efficient approach allows tasks such as the massive comparison of complete genomes, evolutionary event detection, the identification of conserved synteny blocks and inter-genome distance calculations to be performed more effectively.

View on publisher site Alert me about new mentions

X Demographics

The data shown below were collected from the profiles of 14 X users who shared this research output. Click here to find out more about how the information was compiled.

Geographical breakdown

Country	Count	As %
Germany	3	21%
United States	2	14%
United Kingdom	1	7%
Norway	1	7%
Sweden	1	7%
Australia	1	7%
Italy	1	7%
Unknown	4	29%

Demographic breakdown

Type	Count	As %
Scientists	10	71%
Members of the public	4	29%

Mendeley readers

The data shown below were compiled from readership statistics for 54 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country	Count	As %
Brazil	3	6%
Philippines	1	2%
Sweden	1	2%
United States	1	2%
Unknown	48	89%

Demographic breakdown

Readers by professional status	Count	As %
Researcher	14	26%
Student > Master	10	19%
Student > Ph. D. Student	8	15%
Student > Bachelor	6	11%
Professor	3	6%
Other	7	13%
Unknown	6	11%

Readers by discipline	Count	As %
Agricultural and Biological Sciences	16	30%
Computer Science	12	22%
Biochemistry, Genetics and Molecular Biology	6	11%
Engineering	3	6%
Immunology and Microbiology	2	4%
Other	5	9%
Unknown	10	19%

Attention Score in Context

This research output has an Altmetric Attention Score of 7. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 14 August 2015.

All research outputs

#5,232,473

of 25,706,302 outputs

Outputs from BMC Bioinformatics

#1,803

of 7,735 outputs

Outputs of similar age

#59,779

of 276,605 outputs

Outputs of similar age from BMC Bioinformatics

#33

of 116 outputs

Altmetric has tracked 25,706,302 research outputs across all sources so far. Compared to these this one has done well and is in the 79th percentile: it's in the top 25% of all research outputs ever tracked by Altmetric.

So far Altmetric has tracked 7,735 research outputs from this source. They typically receive a little more attention than average, with a mean Attention Score of 5.5. This one has done well, scoring higher than 76% of its peers.

Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 276,605 tracked outputs that were published within six weeks on either side of this one in any source. This one has done well, scoring higher than 78% of its contemporaries.

We're also able to compare this research output to 116 others from the same source and published within six weeks on either side of this one. This one has gotten more attention than average, scoring higher than 70% of its contemporaries.

Breaking the computational barriers of pairwise genome comparison

About this Attention Score

Mentioned by

Citations

Readers on

X Demographics

Geographical breakdown

Demographic breakdown

Mendeley readers

Geographical breakdown

Demographic breakdown

Attention Score in Context