Title |
Biased estimates of clonal evolution and subclonal heterogeneity can arise from PCR duplicates in deep sequencing experiments
|
---|---|
Published in |
Genome Biology, August 2014
|
DOI | 10.1186/s13059-014-0420-4 |
Pubmed ID | |
Authors |
Erin N Smith, Kristen Jepsen, Mahdieh Khosroheidari, Laura Z Rassenti, Matteo D’Antonio, Emanuela M Ghia, Dennis A Carson, Catriona HM Jamieson, Thomas J Kipps, Kelly A Frazer |
Abstract |
Accurate allele frequencies are important for measuring subclonal heterogeneity and clonal evolution. Deep-targeted sequencing data can contain PCR duplicates, inflating perceived read depth. Here we adapted the Illumina TruSeq Custom Amplicon kit to include single molecule tagging (SMT) and show that SMT-identified duplicates arise from PCR. We demonstrate that retention of PCR duplicate reads can imply clonal evolution when none exists, while their removal effectively controls the false positive rate. Additionally, PCR duplicates alter estimates of subclonal heterogeneity in tumor samples. Our method simplifies PCR duplicate identification and emphasizes their removal in studies of tumor heterogeneity and clonal evolution. |
X Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 3 | 43% |
United Kingdom | 2 | 29% |
Canada | 1 | 14% |
Unknown | 1 | 14% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Scientists | 3 | 43% |
Members of the public | 3 | 43% |
Science communicators (journalists, bloggers, editors) | 1 | 14% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
United Kingdom | 4 | 3% |
United States | 4 | 3% |
Sweden | 1 | <1% |
Canada | 1 | <1% |
Unknown | 113 | 92% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Student > Ph. D. Student | 35 | 28% |
Researcher | 34 | 28% |
Student > Master | 8 | 7% |
Student > Doctoral Student | 6 | 5% |
Student > Bachelor | 6 | 5% |
Other | 20 | 16% |
Unknown | 14 | 11% |
Readers by discipline | Count | As % |
---|---|---|
Agricultural and Biological Sciences | 53 | 43% |
Biochemistry, Genetics and Molecular Biology | 26 | 21% |
Medicine and Dentistry | 10 | 8% |
Computer Science | 9 | 7% |
Neuroscience | 2 | 2% |
Other | 7 | 6% |
Unknown | 16 | 13% |