Title |
MetaCluster-TA: taxonomic annotation for metagenomic data based on assembly-assisted binning
|
---|---|
Published in |
BMC Genomics, January 2014
|
DOI | 10.1186/1471-2164-15-s1-s12 |
Pubmed ID | |
Authors |
Yi Wang, Henry Chi Ming Leung, Siu Ming Yiu, Francis Yuk Lun Chin |
Abstract |
Taxonomic annotation of reads is an important problem in metagenomic analysis. Existing annotation tools, which rely on the approach of aligning each read to the taxonomic structure, are unable to annotate many reads efficiently and accurately as reads (~100 bp) are short and most of them come from unknown genomes. Previous work has suggested assembling the reads to make longer contigs before annotation. More reads/contigs can be annotated as a longer contig (in Kbp) can be aligned to a taxon even if it is from an unknown species as long as it contains a conserved region of that taxon. Unfortunately existing metagenomic assembly tools are not mature enough to produce long enough contigs. Binning tries to group reads/contigs of similar species together. Intuitively, reads in the same group (cluster) should be annotated to the same taxon and these reads altogether should cover a significant portion of the genome alleviating the problem of short contigs if the quality of binning is high. However, no existing work has tried to use binning results to help solve the annotation problem. This work explores this direction. |
X Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
United Kingdom | 2 | 25% |
Canada | 2 | 25% |
Cameroon | 1 | 13% |
Germany | 1 | 13% |
Sweden | 1 | 13% |
Unknown | 1 | 13% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Scientists | 6 | 75% |
Members of the public | 2 | 25% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
Germany | 2 | 2% |
Estonia | 2 | 2% |
Canada | 2 | 2% |
United States | 2 | 2% |
Sweden | 1 | <1% |
Brazil | 1 | <1% |
China | 1 | <1% |
France | 1 | <1% |
Spain | 1 | <1% |
Other | 1 | <1% |
Unknown | 103 | 88% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Researcher | 27 | 23% |
Student > Ph. D. Student | 25 | 21% |
Student > Master | 16 | 14% |
Other | 9 | 8% |
Student > Bachelor | 6 | 5% |
Other | 19 | 16% |
Unknown | 15 | 13% |
Readers by discipline | Count | As % |
---|---|---|
Agricultural and Biological Sciences | 53 | 45% |
Biochemistry, Genetics and Molecular Biology | 16 | 14% |
Computer Science | 12 | 10% |
Environmental Science | 6 | 5% |
Immunology and Microbiology | 4 | 3% |
Other | 10 | 9% |
Unknown | 16 | 14% |