Title |
An ensemble approach to accurately detect somatic mutations using SomaticSeq
|
---|---|
Published in |
Genome Biology, September 2015
|
DOI | 10.1186/s13059-015-0758-2 |
Pubmed ID | |
Authors |
Li Tai Fang, Pegah Tootoonchi Afshar, Aparna Chhibber, Marghoob Mohiyuddin, Yu Fan, John C. Mu, Greg Gibeling, Sharon Barr, Narges Bani Asadi, Mark B. Gerstein, Daniel C. Koboldt, Wenyi Wang, Wing H. Wong, Hugo Y.K. Lam |
Abstract |
SomaticSeq is an accurate somatic mutation detection pipeline implementing a stochastic boosting algorithm to produce highly accurate somatic mutation calls for both single nucleotide variants and small insertions and deletions. The workflow currently incorporates five state-of-the-art somatic mutation callers, and extracts over 70 individual genomic and sequencing features for each candidate site. A training set is provided to an adaptively boosted decision tree learner to create a classifier for predicting mutation statuses. We validate our results with both synthetic and real data. We report that SomaticSeq is able to achieve better overall accuracy than any individual tool incorporated. |
Twitter Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 16 | 52% |
United Kingdom | 4 | 13% |
India | 1 | 3% |
Australia | 1 | 3% |
Sao Tome and Principe | 1 | 3% |
Unknown | 8 | 26% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Scientists | 18 | 58% |
Members of the public | 12 | 39% |
Science communicators (journalists, bloggers, editors) | 1 | 3% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
Italy | 2 | 1% |
United States | 2 | 1% |
Korea, Republic of | 1 | <1% |
Netherlands | 1 | <1% |
United Kingdom | 1 | <1% |
Australia | 1 | <1% |
Taiwan | 1 | <1% |
New Zealand | 1 | <1% |
Unknown | 172 | 95% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Researcher | 50 | 27% |
Student > Master | 27 | 15% |
Student > Ph. D. Student | 25 | 14% |
Student > Bachelor | 13 | 7% |
Other | 9 | 5% |
Other | 23 | 13% |
Unknown | 35 | 19% |
Readers by discipline | Count | As % |
---|---|---|
Biochemistry, Genetics and Molecular Biology | 45 | 25% |
Agricultural and Biological Sciences | 45 | 25% |
Computer Science | 24 | 13% |
Medicine and Dentistry | 13 | 7% |
Engineering | 8 | 4% |
Other | 9 | 5% |
Unknown | 38 | 21% |