↓ Skip to main content

Predicting the recurrence of noncoding regulatory mutations in cancer

Overview of attention for article published in BMC Bioinformatics, December 2016
Altmetric Badge

About this Attention Score

  • Good Attention Score compared to outputs of the same age (70th percentile)
  • Good Attention Score compared to outputs of the same age and source (71st percentile)

Mentioned by

twitter
7 X users

Citations

dimensions_citation
9 Dimensions

Readers on

mendeley
42 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Predicting the recurrence of noncoding regulatory mutations in cancer
Published in
BMC Bioinformatics, December 2016
DOI 10.1186/s12859-016-1385-y
Pubmed ID
Authors

Woojin Yang, Hyoeun Bang, Kiwon Jang, Min Kyung Sung, Jung Kyoon Choi

Abstract

One of the greatest challenges in cancer genomics is to distinguish driver mutations from passenger mutations. Whereas recurrence is a hallmark of driver mutations, it is difficult to observe recurring noncoding mutations owing to a limited amount of whole-genome sequenced samples. Hence, it is required to develop a method to predict potentially recurrent mutations. In this work, we developed a random forest classifier that predicts regulatory mutations that may recur based on the features of the mutations repeatedly appearing in a given cohort. With breast cancer as a model, we profiled 35 quantitative features describing genetic and epigenetic signals at the mutation site, transcription factors whose binding motif was disrupted by the mutation, and genes targeted by long-range chromatin interactions. A true set of mutations for machine learning was generated by interrogating publicly available pan-cancer genomes based on our statistical model of mutation recurrence. The performance of our random forest classifier was evaluated by cross validations. The variable importance of each feature in the classification of mutations was investigated. Our statistical recurrence model for the random forest classifier showed an area under the curve (AUC) of ~0.78 in predicting recurrent mutations. Chromatin accessibility at the mutation sites, the distance from the mutations to known cancer risk loci, and the role of the target genes in the regulatory or protein interaction network were among the most important variables. Our methods enable to characterize recurrent regulatory mutations using a limited number of whole-genome samples, and based on the characterization, to predict potential driver mutations whose recurrence is not found in the given samples but likely to be observed with additional samples.

X Demographics

X Demographics

The data shown below were collected from the profiles of 7 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 42 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 42 100%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 9 21%
Student > Master 9 21%
Student > Bachelor 5 12%
Professor > Associate Professor 5 12%
Researcher 3 7%
Other 7 17%
Unknown 4 10%
Readers by discipline Count As %
Computer Science 12 29%
Agricultural and Biological Sciences 7 17%
Biochemistry, Genetics and Molecular Biology 5 12%
Medicine and Dentistry 5 12%
Engineering 3 7%
Other 3 7%
Unknown 7 17%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 4. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 08 August 2017.
All research outputs
#6,447,992
of 22,908,162 outputs
Outputs from BMC Bioinformatics
#2,483
of 7,305 outputs
Outputs of similar age
#118,517
of 416,044 outputs
Outputs of similar age from BMC Bioinformatics
#34
of 126 outputs
Altmetric has tracked 22,908,162 research outputs across all sources so far. This one has received more attention than most of these and is in the 70th percentile.
So far Altmetric has tracked 7,305 research outputs from this source. They typically receive a little more attention than average, with a mean Attention Score of 5.4. This one has gotten more attention than average, scoring higher than 64% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 416,044 tracked outputs that were published within six weeks on either side of this one in any source. This one has gotten more attention than average, scoring higher than 70% of its contemporaries.
We're also able to compare this research output to 126 others from the same source and published within six weeks on either side of this one. This one has gotten more attention than average, scoring higher than 71% of its contemporaries.