Title |
Whole genome sequencing of an ethnic Pathan (Pakhtun) from the north-west of Pakistan
|
---|---|
Published in |
BMC Genomics, March 2015
|
DOI | 10.1186/s12864-015-1290-1 |
Pubmed ID | |
Authors |
Muhammad Ilyas, Jong-Soo Kim, Jesse Cooper, Young-Ah Shin, Hak-Min Kim, Yun Sung Cho, Seungwoo Hwang, Hyunho Kim, Jaewoo Moon, Oksung Chung, JeHoon Jun, Achal Rastogi, Sanghoon Song, Junsu Ko, Andrea Manica, Ziaur Rahman, Tayyab Husnain, Jong Bhak |
Abstract |
Pakistan covers a key geographic area in human history, being both part of the Indus River region that acted as one of the cradles of civilization and as a link between Western Eurasia and Eastern Asia. This region is inhabited by a number of distinct ethnic groups, the largest being the Punjabi, Pathan (Pakhtuns), Sindhi, and Baloch. We analyzed the first ethnic male Pathan genome by sequencing it to 29.7-fold coverage using the Illumina HiSeq2000 platform. A total of 3.8 million single nucleotide variations (SNVs) and 0.5 million small indels were identified by comparing with the human reference genome. Among the SNVs, 129,441 were novel, and 10,315 nonsynonymous SNVs were found in 5,344 genes. SNVs were annotated for health consequences and high risk diseases, as well as possible influences on drug efficacy. We confirmed that the Pathan genome presented here is representative of this ethnic group by comparing it to a panel of Central Asians from the HGDP-CEPH panels typed for ~650 k SNPs. The mtDNA (H2) and Y haplogroup (L1) of this individual were also typical of his geographic region of origin. Finally, we reconstruct the demographic history by PSMC, which highlights a recent increase in effective population size compatible with admixture between European and Asian lineages expected in this geographic region. We present a whole-genome sequence and analyses of an ethnic Pathan from the north-west province of Pakistan. It is a useful resource to understand genetic variation and human migration across the whole Asian continent. |
X Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 3 | 33% |
Pakistan | 1 | 11% |
Mexico | 1 | 11% |
United Kingdom | 1 | 11% |
Unknown | 3 | 33% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Members of the public | 5 | 56% |
Scientists | 3 | 33% |
Practitioners (doctors, other healthcare professionals) | 1 | 11% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 1 | 2% |
South Africa | 1 | 2% |
Brazil | 1 | 2% |
Unknown | 47 | 94% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Researcher | 12 | 24% |
Student > Ph. D. Student | 11 | 22% |
Student > Master | 7 | 14% |
Student > Doctoral Student | 4 | 8% |
Other | 3 | 6% |
Other | 8 | 16% |
Unknown | 5 | 10% |
Readers by discipline | Count | As % |
---|---|---|
Agricultural and Biological Sciences | 21 | 42% |
Biochemistry, Genetics and Molecular Biology | 15 | 30% |
Computer Science | 2 | 4% |
Neuroscience | 2 | 4% |
Nursing and Health Professions | 1 | 2% |
Other | 3 | 6% |
Unknown | 6 | 12% |