↓ Skip to main content

A new statistical framework for genetic pleiotropic analysis of high dimensional phenotype data

Overview of attention for article published in BMC Genomics, November 2016
Altmetric Badge

Readers on

mendeley
37 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
A new statistical framework for genetic pleiotropic analysis of high dimensional phenotype data
Published in
BMC Genomics, November 2016
DOI 10.1186/s12864-016-3169-1
Pubmed ID
Authors

Panpan Wang, Mohammad Rahman, Li Jin, Momiao Xiong

Abstract

The widely used genetic pleiotropic analyses of multiple phenotypes are often designed for examining the relationship between common variants and a few phenotypes. They are not suited for both high dimensional phenotypes and high dimensional genotype (next-generation sequencing) data. To overcome limitations of the traditional genetic pleiotropic analysis of multiple phenotypes, we develop sparse structural equation models (SEMs) as a general framework for a new paradigm of genetic analysis of multiple phenotypes. To incorporate both common and rare variants into the analysis, we extend the traditional multivariate SEMs to sparse functional SEMs. To deal with high dimensional phenotype and genotype data, we employ functional data analysis and the alternative direction methods of multiplier (ADMM) techniques to reduce data dimension and improve computational efficiency. Using large scale simulations we showed that the proposed methods have higher power to detect true causal genetic pleiotropic structure than other existing methods. Simulations also demonstrate that the gene-based pleiotropic analysis has higher power than the single variant-based pleiotropic analysis. The proposed method is applied to exome sequence data from the NHLBI's Exome Sequencing Project (ESP) with 11 phenotypes, which identifies a network with 137 genes connected to 11 phenotypes and 341 edges. Among them, 114 genes showed pleiotropic genetic effects and 45 genes were reported to be associated with phenotypes in the analysis or other cardiovascular disease (CVD) related phenotypes in the literature. Our proposed sparse functional SEMs can incorporate both common and rare variants into the analysis and the ADMM algorithm can efficiently solve the penalized SEMs. Using this model we can jointly infer genetic architecture and casual phenotype network structure, and decompose the genetic effect into direct, indirect and total effect. Using large scale simulations we showed that the proposed methods have higher power to detect true causal genetic pleiotropic structure than other existing methods.

Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 37 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United States 1 3%
Australia 1 3%
Unknown 35 95%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 13 35%
Researcher 8 22%
Student > Bachelor 3 8%
Student > Doctoral Student 2 5%
Student > Postgraduate 2 5%
Other 3 8%
Unknown 6 16%
Readers by discipline Count As %
Agricultural and Biological Sciences 15 41%
Biochemistry, Genetics and Molecular Biology 6 16%
Medicine and Dentistry 3 8%
Computer Science 3 8%
Mathematics 2 5%
Other 2 5%
Unknown 6 16%