Title |
A phylogeny-based sampling strategy and power calculator informs genome-wide associations study design for microbial pathogens
|
---|---|
Published in |
Genome Medicine, November 2014
|
DOI | 10.1186/s13073-014-0101-7 |
Pubmed ID | |
Authors |
Maha R Farhat, B Jesse Shapiro, Samuel K Sheppard, Caroline Colijn, Megan Murray |
Abstract |
Whole genome sequencing is increasingly used to study phenotypic variation among infectious pathogens and to evaluate their relative transmissibility, virulence, and immunogenicity. To date, relatively little has been published on how and how many pathogen strains should be selected for studies associating phenotype and genotype. There are specific challenges when identifying genetic associations in bacteria which often comprise highly structured populations. Here we consider general methodological questions related to sampling and analysis focusing on clonal to moderately recombining pathogens. We propose that a matched sampling scheme constitutes an efficient study design, and provide a power calculator based on phylogenetic convergence. We demonstrate this approach by applying it to genomic datasets for two microbial pathogens: Mycobacterium tuberculosis and Campylobacter species. |
X Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
United Kingdom | 7 | 32% |
Spain | 2 | 9% |
United States | 2 | 9% |
Australia | 2 | 9% |
Cameroon | 1 | 5% |
India | 1 | 5% |
Canada | 1 | 5% |
Unknown | 6 | 27% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Scientists | 14 | 64% |
Members of the public | 8 | 36% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 3 | 2% |
Brazil | 1 | <1% |
Netherlands | 1 | <1% |
Spain | 1 | <1% |
United Kingdom | 1 | <1% |
Unknown | 133 | 95% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Researcher | 38 | 27% |
Student > Ph. D. Student | 21 | 15% |
Student > Master | 17 | 12% |
Student > Bachelor | 9 | 6% |
Professor | 7 | 5% |
Other | 22 | 16% |
Unknown | 26 | 19% |
Readers by discipline | Count | As % |
---|---|---|
Agricultural and Biological Sciences | 41 | 29% |
Biochemistry, Genetics and Molecular Biology | 29 | 21% |
Medicine and Dentistry | 13 | 9% |
Immunology and Microbiology | 11 | 8% |
Computer Science | 4 | 3% |
Other | 11 | 8% |
Unknown | 31 | 22% |