Title |
A comparison study of succinct data structures for use in GWAS
|
---|---|
Published in |
BMC Bioinformatics, December 2013
|
DOI | 10.1186/1471-2105-14-369 |
Pubmed ID | |
Authors |
Patrick P Putnam, Ge Zhang, Philip A Wilsey |
Abstract |
In recent years genetic data analysis has seen a rapid increase in the scale of data to be analyzed. Schadt et al (NRG 11:647-657, 2010) offered that with data sets approaching the petabyte scale, data related challenges such as formatting, management, and transfer are increasingly important topics which need to be addressed. The use of succinct data structures is one method of reducing physical size of a data set without the use of expensive compression techniques. In this work, we consider the use of 2- and 3-bit encoding schemes for genotype data. We compare the computational performance of allele or genotype counting algorithms utilizing genotype data encoded in both schemes. |
X Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
Norway | 1 | 100% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Members of the public | 1 | 100% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 2 | 9% |
Russia | 1 | 4% |
Unknown | 20 | 87% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Researcher | 7 | 30% |
Student > Bachelor | 4 | 17% |
Student > Ph. D. Student | 3 | 13% |
Professor > Associate Professor | 3 | 13% |
Student > Master | 2 | 9% |
Other | 4 | 17% |
Readers by discipline | Count | As % |
---|---|---|
Agricultural and Biological Sciences | 11 | 48% |
Computer Science | 9 | 39% |
Biochemistry, Genetics and Molecular Biology | 2 | 9% |
Medicine and Dentistry | 1 | 4% |