Title |
REAPR: a universal tool for genome assembly evaluation
|
---|---|
Published in |
Genome Biology, May 2013
|
DOI | 10.1186/gb-2013-14-5-r47 |
Pubmed ID | |
Authors |
Martin Hunt, Taisei Kikuchi, Mandy Sanders, Chris Newbold, Matthew Berriman, Thomas D Otto |
Abstract |
Methods to reliably assess the accuracy of genome sequence data are lacking. Currently completeness is only described qualitatively and mis-assemblies are overlooked. Here we present REAPR, a tool that precisely identifies errors in genome assemblies without the need for a reference sequence. We have validated REAPR on complete genomes or de novo assemblies from bacteria, malaria and Caenorhabditis elegans, and demonstrate that 86% and 82% of the human and mouse reference genomes are error-free, respectively. When applied to an ongoing genome project, REAPR provides corrected assembly statistics allowing the quantitative comparison of multiple assemblies. REAPR is available at http://www.sanger.ac.uk/resources/software/reapr/. |
X Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 9 | 16% |
United Kingdom | 5 | 9% |
Spain | 5 | 9% |
France | 4 | 7% |
New Zealand | 3 | 5% |
India | 2 | 4% |
Germany | 2 | 4% |
Hong Kong | 2 | 4% |
Canada | 2 | 4% |
Other | 8 | 14% |
Unknown | 14 | 25% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Scientists | 33 | 59% |
Members of the public | 20 | 36% |
Science communicators (journalists, bloggers, editors) | 2 | 4% |
Unknown | 1 | 2% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
Germany | 11 | 1% |
United States | 11 | 1% |
United Kingdom | 7 | <1% |
France | 4 | <1% |
Australia | 3 | <1% |
Brazil | 3 | <1% |
Netherlands | 2 | <1% |
Hong Kong | 2 | <1% |
Kenya | 2 | <1% |
Other | 29 | 4% |
Unknown | 679 | 90% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Student > Ph. D. Student | 173 | 23% |
Researcher | 167 | 22% |
Student > Master | 123 | 16% |
Student > Bachelor | 61 | 8% |
Student > Doctoral Student | 38 | 5% |
Other | 120 | 16% |
Unknown | 71 | 9% |
Readers by discipline | Count | As % |
---|---|---|
Agricultural and Biological Sciences | 414 | 55% |
Biochemistry, Genetics and Molecular Biology | 148 | 20% |
Computer Science | 56 | 7% |
Immunology and Microbiology | 13 | 2% |
Environmental Science | 13 | 2% |
Other | 26 | 3% |
Unknown | 83 | 11% |