↓ Skip to main content

Consequences of splitting whole-genome sequencing effort over multiple breeds on imputation accuracy

Overview of attention for article published in BMC Genomic Data, October 2014
Altmetric Badge

Mentioned by

twitter
1 X user

Citations

dimensions_citation
38 Dimensions

Readers on

mendeley
46 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Consequences of splitting whole-genome sequencing effort over multiple breeds on imputation accuracy
Published in
BMC Genomic Data, October 2014
DOI 10.1186/s12863-014-0105-8
Pubmed ID
Authors

Aniek C Bouwman, Roel F Veerkamp

Abstract

BackgroundThe aim of this study was to determine the consequences of splitting sequencing effort over multiple breeds for imputation accuracy from a high-density SNP chip towards whole-genome sequence. Such information would assist for instance numerical smaller cattle breeds, but also pig and chicken breeders, who have to choose wisely how to spend their sequencing efforts over all the breeds or lines they evaluate. Sequence data from cattle breeds was used, because there are currently relatively many individuals from several breeds sequenced within the 1,000 Bull Genomes project. The advantage of whole-genome sequence data is that it carries the causal mutations, but the question is whether it is possible to impute the causal variants accurately. This study therefore focussed on imputation accuracy of variants with low minor allele frequency and breed specific variants.ResultsImputation accuracy was assessed for chromosome 1 and 29 as the correlation between observed and imputed genotypes. For chromosome 1, the average imputation accuracy was 0.70 with a reference population of 20 Holstein, and increased to 0.83 when the reference population was increased by including 3 other dairy breeds with 20 animals each. When the same amount of animals from the Holstein breed were added the accuracy improved to 0.88, while adding the 3 other breeds to the reference population of 80 Holstein improved the average imputation accuracy marginally to 0.89. For chromosome 29, the average imputation accuracy was lower. Some variants benefitted from the inclusion of other breeds in the reference population, initially determined by the MAF of the variant in each breed, but even Holstein specific variants did gain imputation accuracy from the multi-breed reference population.ConclusionsThis study shows that splitting sequencing effort over multiple breeds and combining the reference populations is a good strategy for imputation from high-density SNP panels towards whole-genome sequence when reference populations are small and sequencing effort is limiting. When sequencing effort is limiting and interest lays in multiple breeds or lines this provides imputation of each breed.

X Demographics

X Demographics

The data shown below were collected from the profile of 1 X user who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 46 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Netherlands 1 2%
United States 1 2%
France 1 2%
Unknown 43 93%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 10 22%
Researcher 10 22%
Student > Master 9 20%
Student > Doctoral Student 4 9%
Other 3 7%
Other 3 7%
Unknown 7 15%
Readers by discipline Count As %
Agricultural and Biological Sciences 30 65%
Biochemistry, Genetics and Molecular Biology 3 7%
Computer Science 2 4%
Mathematics 1 2%
Veterinary Science and Veterinary Medicine 1 2%
Other 2 4%
Unknown 7 15%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 1. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 04 October 2014.
All research outputs
#22,756,649
of 25,371,288 outputs
Outputs from BMC Genomic Data
#1,008
of 1,204 outputs
Outputs of similar age
#227,537
of 266,010 outputs
Outputs of similar age from BMC Genomic Data
#19
of 26 outputs
Altmetric has tracked 25,371,288 research outputs across all sources so far. This one is in the 1st percentile – i.e., 1% of other outputs scored the same or lower than it.
So far Altmetric has tracked 1,204 research outputs from this source. They receive a mean Attention Score of 4.3. This one is in the 1st percentile – i.e., 1% of its peers scored the same or lower than it.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 266,010 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 1st percentile – i.e., 1% of its contemporaries scored the same or lower than it.
We're also able to compare this research output to 26 others from the same source and published within six weeks on either side of this one. This one is in the 1st percentile – i.e., 1% of its contemporaries scored the same or lower than it.