↓ Skip to main content

3D deep convolutional neural networks for amino acid environment similarity analysis

Overview of attention for article published in BMC Bioinformatics, June 2017
Altmetric Badge

About this Attention Score

  • In the top 25% of all research outputs scored by Altmetric
  • Good Attention Score compared to outputs of the same age (72nd percentile)
  • Good Attention Score compared to outputs of the same age and source (74th percentile)

Mentioned by

twitter
5 X users
patent
1 patent

Citations

dimensions_citation
113 Dimensions

Readers on

mendeley
287 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
3D deep convolutional neural networks for amino acid environment similarity analysis
Published in
BMC Bioinformatics, June 2017
DOI 10.1186/s12859-017-1702-0
Pubmed ID
Authors

Wen Torng, Russ B. Altman

Abstract

Central to protein biology is the understanding of how structural elements give rise to observed function. The surfeit of protein structural data enables development of computational methods to systematically derive rules governing structural-functional relationships. However, performance of these methods depends critically on the choice of protein structural representation. Most current methods rely on features that are manually selected based on knowledge about protein structures. These are often general-purpose but not optimized for the specific application of interest. In this paper, we present a general framework that applies 3D convolutional neural network (3DCNN) technology to structure-based protein analysis. The framework automatically extracts task-specific features from the raw atom distribution, driven by supervised labels. As a pilot study, we use our network to analyze local protein microenvironments surrounding the 20 amino acids, and predict the amino acids most compatible with environments within a protein structure. To further validate the power of our method, we construct two amino acid substitution matrices from the prediction statistics and use them to predict effects of mutations in T4 lysozyme structures. Our deep 3DCNN achieves a two-fold increase in prediction accuracy compared to models that employ conventional hand-engineered features and successfully recapitulates known information about similar and different microenvironments. Models built from our predictions and substitution matrices achieve an 85% accuracy predicting outcomes of the T4 lysozyme mutation variants. Our substitution matrices contain rich information relevant to mutation analysis compared to well-established substitution matrices. Finally, we present a visualization method to inspect the individual contributions of each atom to the classification decisions. End-to-end trained deep learning networks consistently outperform methods using hand-engineered features, suggesting that the 3DCNN framework is well suited for analysis of protein microenvironments and may be useful for other protein structural analyses.

X Demographics

X Demographics

The data shown below were collected from the profiles of 5 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 287 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Canada 1 <1%
Unknown 286 100%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 59 21%
Researcher 47 16%
Student > Master 29 10%
Student > Bachelor 26 9%
Student > Doctoral Student 11 4%
Other 37 13%
Unknown 78 27%
Readers by discipline Count As %
Biochemistry, Genetics and Molecular Biology 67 23%
Computer Science 40 14%
Agricultural and Biological Sciences 24 8%
Chemistry 24 8%
Engineering 17 6%
Other 30 10%
Unknown 85 30%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 6. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 10 May 2023.
All research outputs
#5,917,449
of 24,226,848 outputs
Outputs from BMC Bioinformatics
#2,029
of 7,512 outputs
Outputs of similar age
#89,037
of 321,304 outputs
Outputs of similar age from BMC Bioinformatics
#31
of 119 outputs
Altmetric has tracked 24,226,848 research outputs across all sources so far. Compared to these this one has done well and is in the 75th percentile: it's in the top 25% of all research outputs ever tracked by Altmetric.
So far Altmetric has tracked 7,512 research outputs from this source. They typically receive a little more attention than average, with a mean Attention Score of 5.5. This one has gotten more attention than average, scoring higher than 72% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 321,304 tracked outputs that were published within six weeks on either side of this one in any source. This one has gotten more attention than average, scoring higher than 72% of its contemporaries.
We're also able to compare this research output to 119 others from the same source and published within six weeks on either side of this one. This one has gotten more attention than average, scoring higher than 74% of its contemporaries.