↓ Skip to main content

Filtering genetic variants and placing informative priors based on putative biological function

Overview of attention for article published in BMC Genomic Data, February 2016
Altmetric Badge

Citations

dimensions_citation
7 Dimensions

Readers on

mendeley
34 Mendeley
citeulike
1 CiteULike
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Filtering genetic variants and placing informative priors based on putative biological function
Published in
BMC Genomic Data, February 2016
DOI 10.1186/s12863-015-0313-x
Pubmed ID
Authors

Stefanie Friedrichs, Dörthe Malzahn, Elizabeth W. Pugh, Marcio Almeida, Xiao Qing Liu, Julia N. Bailey

Abstract

High-density genetic marker data, especially sequence data, imply an immense multiple testing burden. This can be ameliorated by filtering genetic variants, exploiting or accounting for correlations between variants, jointly testing variants, and by incorporating informative priors. Priors can be based on biological knowledge or predicted variant function, or even be used to integrate gene expression or other omics data. Based on Genetic Analysis Workshop (GAW) 19 data, this article discusses diversity and usefulness of functional variant scores provided, for example, by PolyPhen2, SIFT, or RegulomeDB annotations. Incorporating functional scores into variant filters or weights and adjusting the significance level for correlations between variants yielded significant associations with blood pressure traits in a large family study of Mexican Americans (GAW19 data set). Marker rs218966 in gene PHF14 and rs9836027 in MAP4 significantly associated with hypertension; additionally, rare variants in SNUPN significantly associated with systolic blood pressure. Variant weights strongly influenced the power of kernel methods and burden tests. Apart from variant weights in test statistics, prior weights may also be used when combining test statistics or to informatively weight p values while controlling false discovery rate (FDR). Indeed, power improved when gene expression data for FDR-controlled informative weighting of association test p values of genes was used. Finally, approaches exploiting variant correlations included identity-by-descent mapping and the optimal strategy for joint testing rare and common variants, which was observed to depend on linkage disequilibrium structure.

Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 34 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 34 100%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 7 21%
Researcher 7 21%
Student > Master 5 15%
Other 4 12%
Professor > Associate Professor 2 6%
Other 3 9%
Unknown 6 18%
Readers by discipline Count As %
Agricultural and Biological Sciences 9 26%
Biochemistry, Genetics and Molecular Biology 8 24%
Medicine and Dentistry 5 15%
Pharmacology, Toxicology and Pharmaceutical Science 1 3%
Mathematics 1 3%
Other 3 9%
Unknown 7 21%