SAG-QC: quality control of single amplified genome information by subtracting non-target sequences based on sequence compositions

Overview of attention for article published in BMC Bioinformatics, March 2017

Altmetric Badge

Citations

dimensions_citation: 5 Dimensions

Readers on

mendeley: 24 Mendeley

Summary Dimensions citations

You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.

Title	SAG-QC: quality control of single amplified genome information by subtracting non-target sequences based on sequence compositions
Published in	BMC Bioinformatics, March 2017
DOI	10.1186/s12859-017-1572-5
Pubmed ID	28259144
Authors	Toru Maruyama, Tetsushi Mori, Keisuke Yamagishi, Haruko Takeyama
Abstract	Whole genome amplification techniques have enabled the analysis of unexplored genomic information by sequencing of single-amplified genomes (SAGs). Whole genome amplification of single bacteria is currently challenging because contamination often occurs in experimental processes. Thus, to increase the confidence in the analyses of sequenced SAGs, bioinformatics approaches that identify and exclude non-target sequences from SAGs are required. Since currently reported approaches utilize sequence information in public databases, they have limitations when new strains are the targets of interest. Here, we developed a software SAG-QC that identify and exclude non-target sequences independent of database. In our method, "no template control" sequences acquired during WGA were used. We calculated the probability that a sequence was derived from contaminants by comparing k-mer compositions with the no template control sequences. Based on the results of tests using simulated SAG datasets, the accuracy of our method for predicting non-target sequences was higher than that of currently reported techniques. Subsequently, we applied our tool to actual SAG datasets and evaluated the accuracy of the predictions. Our method works independently of public sequence information for distinguishing SAGs from non-target sequences. This method will be effective when employed against SAG sequences of unexplored strains and we anticipate that it will contribute to the correct interpretation of SAGs.

View on publisher site Alert me about new mentions

Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 24 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country	Count	As %
Unknown	24	100%

Demographic breakdown

Readers by professional status	Count	As %
Researcher	4	17%
Student > Master	4	17%
Student > Bachelor	3	13%
Student > Ph. D. Student	3	13%
Student > Doctoral Student	2	8%
Other	2	8%
Unknown	6	25%

Readers by discipline	Count	As %
Biochemistry, Genetics and Molecular Biology	4	17%
Agricultural and Biological Sciences	4	17%
Immunology and Microbiology	3	13%
Medicine and Dentistry	2	8%
Computer Science	1	4%
Other	3	13%
Unknown	7	29%