Report for: CMSA: a heterogeneous CPU/GPU computing system for multiple similar RNA/DNA sequence alignment

You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.

Title	CMSA: a heterogeneous CPU/GPU computing system for multiple similar RNA/DNA sequence alignment
Published in	BMC Bioinformatics, June 2017
DOI	10.1186/s12859-017-1725-6
Pubmed ID	28646874
Authors	Xi Chen, Chen Wang, Shanjiang Tang, Ce Yu, Quan Zou
Abstract	The multiple sequence alignment (MSA) is a classic and powerful technique for sequence analysis in bioinformatics. With the rapid growth of biological datasets, MSA parallelization becomes necessary to keep its running time in an acceptable level. Although there are a lot of work on MSA problems, their approaches are either insufficient or contain some implicit assumptions that limit the generality of usage. First, the information of users' sequences, including the sizes of datasets and the lengths of sequences, can be of arbitrary values and are generally unknown before submitted, which are unfortunately ignored by previous work. Second, the center star strategy is suited for aligning similar sequences. But its first stage, center sequence selection, is highly time-consuming and requires further optimization. Moreover, given the heterogeneous CPU/GPU platform, prior studies consider the MSA parallelization on GPU devices only, making the CPUs idle during the computation. Co-run computation, however, can maximize the utilization of the computing resources by enabling the workload computation on both CPU and GPU simultaneously. This paper presents CMSA, a robust and efficient MSA system for large-scale datasets on the heterogeneous CPU/GPU platform. It performs and optimizes multiple sequence alignment automatically for users' submitted sequences without any assumptions. CMSA adopts the co-run computation model so that both CPU and GPU devices are fully utilized. Moreover, CMSA proposes an improved center star strategy that reduces the time complexity of its center sequence selection process from O(mn (2)) to O(mn). The experimental results show that CMSA achieves an up to 11× speedup and outperforms the state-of-the-art software. CMSA focuses on the multiple similar RNA/DNA sequence alignment and proposes a novel bitmap based algorithm to improve the center star strategy. We can conclude that harvesting the high performance of modern GPU is a promising approach to accelerate multiple sequence alignment. Besides, adopting the co-run computation model can maximize the entire system utilization significantly. The source code is available at https://github.com/wangvsa/CMSA .

View on publisher site Alert me about new mentions

X Demographics

The data shown below were collected from the profiles of 4 X users who shared this research output. Click here to find out more about how the information was compiled.

Geographical breakdown

Country	Count	As %
Sweden	1	25%
Australia	1	25%
Unknown	2	50%

Demographic breakdown

Type	Count	As %
Scientists	3	75%
Members of the public	1	25%

Mendeley readers

The data shown below were compiled from readership statistics for 27 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country	Count	As %
Unknown	27	100%

Demographic breakdown

Readers by professional status	Count	As %
Student > Master	5	19%
Student > Bachelor	5	19%
Other	2	7%
Student > Ph. D. Student	2	7%
Professor	1	4%
Other	4	15%
Unknown	8	30%

Readers by discipline	Count	As %
Computer Science	9	33%
Biochemistry, Genetics and Molecular Biology	5	19%
Agricultural and Biological Sciences	3	11%
Arts and Humanities	1	4%
Medicine and Dentistry	1	4%
Other	1	4%
Unknown	7	26%

Attention Score in Context

This research output has an Altmetric Attention Score of 2. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 26 June 2017.

All research outputs

#13,900,658

of 23,577,761 outputs

Outputs from BMC Bioinformatics

#4,306

of 7,418 outputs

Outputs of similar age

#162,399

of 317,126 outputs

Outputs of similar age from BMC Bioinformatics

#58

of 115 outputs

Altmetric has tracked 23,577,761 research outputs across all sources so far. This one is in the 39th percentile – i.e., 39% of other outputs scored the same or lower than it.

So far Altmetric has tracked 7,418 research outputs from this source. They typically receive a little more attention than average, with a mean Attention Score of 5.4. This one is in the 38th percentile – i.e., 38% of its peers scored the same or lower than it.

Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 317,126 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 46th percentile – i.e., 46% of its contemporaries scored the same or lower than it.

We're also able to compare this research output to 115 others from the same source and published within six weeks on either side of this one. This one is in the 47th percentile – i.e., 47% of its contemporaries scored the same or lower than it.

CMSA: a heterogeneous CPU/GPU computing system for multiple similar RNA/DNA sequence alignment

About this Attention Score

Mentioned by

Citations

Readers on

X Demographics

Geographical breakdown

Demographic breakdown

Mendeley readers

Geographical breakdown

Demographic breakdown

Attention Score in Context