Title |
DRUMS: Disk Repository with Update Management and Select option for high throughput sequencing data
|
---|---|
Published in |
BMC Bioinformatics, February 2014
|
DOI | 10.1186/1471-2105-15-38 |
Pubmed ID | |
Authors |
Martin Nettling, Nils Thieme, Andreas Both, Ivo Grosse |
Abstract |
New technologies for analyzing biological samples, like next generation sequencing, are producing a growing amount of data together with quality scores. Moreover, software tools (e.g., for mapping sequence reads), calculating transcription factor binding probabilities, estimating epigenetic modification enriched regions or determining single nucleotide polymorphism increase this amount of position-specific DNA-related data even further. Hence, requesting data becomes challenging and expensive and is often implemented using specialised hardware. In addition, picking specific data as fast as possible becomes increasingly important in many fields of science. The general problem of handling big data sets was addressed by developing specialized databases like HBase, HyperTable or Cassandra. However, these database solutions require also specialized or distributed hardware leading to expensive investments. To the best of our knowledge, there is no database capable of (i) storing billions of position-specific DNA-related records, (ii) performing fast and resource saving requests, and (iii) running on a single standard computer hardware. |
X Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
United Kingdom | 2 | 29% |
United States | 1 | 14% |
Japan | 1 | 14% |
Sweden | 1 | 14% |
Norway | 1 | 14% |
Unknown | 1 | 14% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Members of the public | 4 | 57% |
Scientists | 3 | 43% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
Germany | 2 | 6% |
Sweden | 1 | 3% |
United Kingdom | 1 | 3% |
Iran, Islamic Republic of | 1 | 3% |
United States | 1 | 3% |
Unknown | 25 | 81% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Researcher | 12 | 39% |
Student > Ph. D. Student | 5 | 16% |
Student > Doctoral Student | 3 | 10% |
Student > Master | 3 | 10% |
Lecturer | 2 | 6% |
Other | 5 | 16% |
Unknown | 1 | 3% |
Readers by discipline | Count | As % |
---|---|---|
Agricultural and Biological Sciences | 11 | 35% |
Computer Science | 9 | 29% |
Biochemistry, Genetics and Molecular Biology | 3 | 10% |
Medicine and Dentistry | 2 | 6% |
Unspecified | 1 | 3% |
Other | 3 | 10% |
Unknown | 2 | 6% |