↓ Skip to main content

Incorporating ENCODE information into association analysis of whole genome sequencing data

Overview of attention for article published in BMC Proceedings, October 2016
Altmetric Badge

Citations

dimensions_citation
9 Dimensions

Readers on

mendeley
7 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Incorporating ENCODE information into association analysis of whole genome sequencing data
Published in
BMC Proceedings, October 2016
DOI 10.1186/s12919-016-0040-y
Pubmed ID
Authors

Taebeom Kim, Peng Wei

Abstract

With the rapidly decreasing cost of the next-generation sequencing technology, a large number of whole genome sequences have been generated, enabling researchers to survey rare variants in the protein-coding and regulatory regions of the genome. However, it remains a daunting task to identify functional variants associated with complex diseases from whole genome sequencing (WGS) data because of the millions of candidate variants and yet moderate sample size. We propose to incorporate the Encyclopedia of DNA Elements (ENCODE) information in the association analysis of WGS data to boost the statistical power. We use the RegulomeDB and PolyPhen2 scores as external weights in existing rare variants association tests. We demonstrate the proposed framework using the WGS data and blood pressure phenotype from the San Antonio Family Studies provided by the Genetic Analysis Workshop 19. We identified a genome-wide significant locus in gene SNUPN on chromosome 15 that harbors a rare nonsynonymous variant, which was not detected by benchmark methods that did not incorporate biological information, including the T5 burden test and sequence kernel association test.

Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 7 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 7 100%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 3 43%
Student > Postgraduate 1 14%
Unknown 3 43%
Readers by discipline Count As %
Biochemistry, Genetics and Molecular Biology 2 29%
Agricultural and Biological Sciences 1 14%
Medicine and Dentistry 1 14%
Unknown 3 43%