↓ Skip to main content

Independent test assessment using the extreme value distribution theory

Overview of attention for article published in BMC Proceedings, October 2016
Altmetric Badge

Citations

dimensions_citation
2 Dimensions

Readers on

mendeley
8 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Independent test assessment using the extreme value distribution theory
Published in
BMC Proceedings, October 2016
DOI 10.1186/s12919-016-0038-5
Pubmed ID
Authors

Marcio Almeida, Lucy Blondell, Juan M. Peralta, Jack W. Kent, Goo Jun, Tanya M. Teslovich, Christian Fuchsberger, Andrew R. Wood, Alisa K. Manning, Timothy M. Frayling, Pablo E. Cingolani, Robert Sladek, Thomas D. Dyer, Goncalo Abecasis, Ravindranath Duggirala, John Blangero

Abstract

The new generation of whole genome sequencing platforms offers great possibilities and challenges for dissecting the genetic basis of complex traits. With a very high number of sequence variants, a naïve multiple hypothesis threshold correction hinders the identification of reliable associations by the overreduction of statistical power. In this report, we examine 2 alternative approaches to improve the statistical power of a whole genome association study to detect reliable genetic associations. The approaches were tested using the Genetic Analysis Workshop 19 (GAW19) whole genome sequencing data. The first tested method estimates the real number of effective independent tests actually being performed in whole genome association project by the use of an extreme value distribution and a set of phenotype simulations. Given the familiar nature of the GAW19 data and the finite number of pedigree founders in the sample, the number of correlations between genotypes is greater than in a set of unrelated samples. Using our procedure, we estimate that the effective number represents only 15 % of the total number of independent tests performed. However, even using this corrected significance threshold, no genome-wide significant association could be detected for systolic and diastolic blood pressure traits. The second approach implements a biological relevance-driven hypothesis tested by exploiting prior computational predictions on the effect of nonsynonymous genetic variants detected in a whole genome sequencing association study. This guided testing approach was able to identify 2 promising single-nucleotide polymorphisms (SNPs), 1 for each trait, targeting biologically relevant genes that could help shed light on the genesis of the human hypertension. The first gene, PFH14, associated with systolic blood pressure, interacts directly with genes involved in calcium-channel formation and the second gene, MAP4, encodes a microtubule-associated protein and had already been detected by previous genome-wide association study experiments conducted in an Asian population. Our results highlight the necessity of the development of alternative approached to improve the efficiency on the detection of reasonable candidate associations in whole genome sequencing studies.

Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 8 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 8 100%

Demographic breakdown

Readers by professional status Count As %
Professor 2 25%
Student > Master 2 25%
Researcher 1 13%
Professor > Associate Professor 1 13%
Unknown 2 25%
Readers by discipline Count As %
Biochemistry, Genetics and Molecular Biology 2 25%
Agricultural and Biological Sciences 2 25%
Nursing and Health Professions 1 13%
Mathematics 1 13%
Unknown 2 25%