↓ Skip to main content

Machine learning classifier for identification of damaging missense mutations exclusive to human mitochondrial DNA-encoded polypeptides

Overview of attention for article published in BMC Bioinformatics, March 2017
Altmetric Badge

About this Attention Score

  • Average Attention Score compared to outputs of the same age
  • Above-average Attention Score compared to outputs of the same age and source (56th percentile)

Mentioned by

twitter
3 X users

Citations

dimensions_citation
26 Dimensions

Readers on

mendeley
36 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Machine learning classifier for identification of damaging missense mutations exclusive to human mitochondrial DNA-encoded polypeptides
Published in
BMC Bioinformatics, March 2017
DOI 10.1186/s12859-017-1562-7
Pubmed ID
Authors

Antonio Martín-Navarro, Andrés Gaudioso-Simón, Jorge Álvarez-Jarreta, Julio Montoya, Elvira Mayordomo, Eduardo Ruiz-Pesini

Abstract

Several methods have been developed to predict the pathogenicity of missense mutations but none has been specifically designed for classification of variants in mtDNA-encoded polypeptides. Moreover, there is not available curated dataset of neutral and damaging mtDNA missense variants to test the accuracy of predictors. Because mtDNA sequencing of patients suffering mitochondrial diseases is revealing many missense mutations, it is needed to prioritize candidate substitutions for further confirmation. Predictors can be useful as screening tools but their performance must be improved. We have developed a SVM classifier (Mitoclass.1) specific for mtDNA missense variants. Training and validation of the model was executed with 2,835 mtDNA damaging and neutral amino acid substitutions, previously curated by a set of rigorous pathogenicity criteria with high specificity. Each instance is described by a set of three attributes based on evolutionary conservation in Eukaryota of wildtype and mutant amino acids as well as coevolution and a novel evolutionary analysis of specific substitutions belonging to the same domain of mitochondrial polypeptides. Our classifier has performed better than other web-available tested predictors. We checked performance of three broadly used predictors with the total mutations of our curated dataset. PolyPhen-2 showed the best results for a screening proposal with a good sensitivity. Nevertheless, the number of false positive predictions was too high. Our method has an improved sensitivity and better specificity in relation to PolyPhen-2. We also publish predictions for the complete set of 24,201 possible missense variants in the 13 human mtDNA-encoded polypeptides. Mitoclass.1 allows a better selection of candidate damaging missense variants from mtDNA. A careful search of discriminatory attributes and a training step based on a curated dataset of amino acid substitutions belonging exclusively to human mtDNA genes allows an improved performance. Mitoclass.1 accuracy could be improved in the future when more mtDNA missense substitutions will be available for updating the attributes and retraining the model.

X Demographics

X Demographics

The data shown below were collected from the profiles of 3 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 36 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Germany 1 3%
Unknown 35 97%

Demographic breakdown

Readers by professional status Count As %
Student > Master 5 14%
Student > Ph. D. Student 5 14%
Student > Bachelor 4 11%
Other 3 8%
Researcher 3 8%
Other 5 14%
Unknown 11 31%
Readers by discipline Count As %
Biochemistry, Genetics and Molecular Biology 8 22%
Computer Science 5 14%
Agricultural and Biological Sciences 3 8%
Unspecified 2 6%
Medicine and Dentistry 2 6%
Other 6 17%
Unknown 10 28%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 3. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 11 March 2017.
All research outputs
#13,030,137
of 22,958,253 outputs
Outputs from BMC Bioinformatics
#3,806
of 7,306 outputs
Outputs of similar age
#150,465
of 307,995 outputs
Outputs of similar age from BMC Bioinformatics
#54
of 130 outputs
Altmetric has tracked 22,958,253 research outputs across all sources so far. This one is in the 42nd percentile – i.e., 42% of other outputs scored the same or lower than it.
So far Altmetric has tracked 7,306 research outputs from this source. They typically receive a little more attention than average, with a mean Attention Score of 5.4. This one is in the 45th percentile – i.e., 45% of its peers scored the same or lower than it.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 307,995 tracked outputs that were published within six weeks on either side of this one in any source. This one has gotten more attention than average, scoring higher than 50% of its contemporaries.
We're also able to compare this research output to 130 others from the same source and published within six weeks on either side of this one. This one has gotten more attention than average, scoring higher than 56% of its contemporaries.