Title |
A scalable and memory-efficient algorithm for de novo transcriptome assembly of non-model organisms
|
---|---|
Published in |
BMC Genomics, May 2017
|
DOI | 10.1186/s12864-017-3735-1 |
Pubmed ID | |
Authors |
Sing-Hoi Sze, Meaghan L. Pimsler, Jeffery K. Tomberlin, Corbin D. Jones, Aaron M. Tarone |
Abstract |
With increased availability of de novo assembly algorithms, it is feasible to study entire transcriptomes of non-model organisms. While algorithms are available that are specifically designed for performing transcriptome assembly from high-throughput sequencing data, they are very memory-intensive, limiting their applications to small data sets with few libraries. We develop a transcriptome assembly algorithm that recovers alternatively spliced isoforms and expression levels while utilizing as many RNA-Seq libraries as possible that contain hundreds of gigabases of data. New techniques are developed so that computations can be performed on a computing cluster with moderate amount of physical memory. Our strategy minimizes memory consumption while simultaneously obtaining comparable or improved accuracy over existing algorithms. It provides support for incremental updates of assemblies when new libraries become available. |
X Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
United Kingdom | 2 | 11% |
China | 1 | 5% |
Germany | 1 | 5% |
Netherlands | 1 | 5% |
Australia | 1 | 5% |
France | 1 | 5% |
Brazil | 1 | 5% |
India | 1 | 5% |
Spain | 1 | 5% |
Other | 0 | 0% |
Unknown | 9 | 47% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Scientists | 11 | 58% |
Members of the public | 7 | 37% |
Science communicators (journalists, bloggers, editors) | 1 | 5% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
United Kingdom | 1 | 2% |
United States | 1 | 2% |
Unknown | 42 | 95% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Student > Ph. D. Student | 10 | 23% |
Researcher | 8 | 18% |
Student > Master | 7 | 16% |
Student > Bachelor | 5 | 11% |
Other | 3 | 7% |
Other | 6 | 14% |
Unknown | 5 | 11% |
Readers by discipline | Count | As % |
---|---|---|
Agricultural and Biological Sciences | 15 | 34% |
Biochemistry, Genetics and Molecular Biology | 13 | 30% |
Computer Science | 5 | 11% |
Pharmacology, Toxicology and Pharmaceutical Science | 1 | 2% |
Economics, Econometrics and Finance | 1 | 2% |
Other | 3 | 7% |
Unknown | 6 | 14% |