Report for: Enzyme mechanism prediction: a template matching problem on InterPro signature subspaces

You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.

Title	Enzyme mechanism prediction: a template matching problem on InterPro signature subspaces
Published in	BMC Research Notes, December 2015
DOI	10.1186/s13104-015-1730-7
Pubmed ID	26634450
Authors	Hamse Y. Mussa, Luna De Ferrari, John B. O. Mitchell
Abstract	We recently reported that one may be able to predict with high accuracy the chemical mechanism of an enzyme by employing a simple pattern recognition approach: a k Nearest Neighbour rule with k = 1 (k1NN) and 321 InterPro sequence signatures as enzyme features. The nearest-neighbour rule is known to be highly sensitive to errors in the training data, in particular when the available training dataset is small. This was the case in our previous study, in which our dataset comprised 248 enzymes annotated against 71 enzymatic mechanism labels from the MACiE database. In the current study, we have carefully re-analysed our dataset and prediction results to "explain" why a high variance k1NN rule exhibited such remarkable classification performance. We find that enzymes with different chemical mechanism labels in this dataset reside in barely overlapping subspaces in the feature space defined by the 321 features selected. These features contain the appropriate information needed to accurately classify the enzymatic mechanisms, rendering our classification problem a basic look-up exercise. This observation dovetails with the low misclassification rate we reported. Our results provide explanations for the "anomaly"-a basic nearest-neighbour algorithm exhibiting remarkable prediction performance for enzymatic mechanism despite the fact that the feature space was large and sparse. Our results also dovetail well with another finding we reported, namely that InterPro signatures are critical for accurate prediction of enzyme mechanism. We also suggest simple rules that might enable one to inductively predict whether a novel enzyme possesses any of our 71 predefined mechanisms.

View on publisher site Alert me about new mentions

X Demographics

The data shown below were collected from the profile of 1 X user who shared this research output. Click here to find out more about how the information was compiled.

Geographical breakdown

Country	Count	As %
Unknown	1	100%

Demographic breakdown

Type	Count	As %
Members of the public	1	100%

Mendeley readers

The data shown below were compiled from readership statistics for 5 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country	Count	As %
Unknown	5	100%

Demographic breakdown

Readers by professional status	Count	As %
Student > Master	2	40%
Lecturer > Senior Lecturer	1	20%
Student > Postgraduate	1	20%
Unknown	1	20%

Readers by discipline	Count	As %
Computer Science	1	20%
Agricultural and Biological Sciences	1	20%
Chemistry	1	20%
Unknown	2	40%

Attention Score in Context

This research output has an Altmetric Attention Score of 1. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 05 December 2015.

All research outputs

#15,351,145

of 22,834,308 outputs

Outputs from BMC Research Notes

#2,315

of 4,265 outputs

Outputs of similar age

#227,317

of 387,656 outputs

Outputs of similar age from BMC Research Notes

#81

of 152 outputs

Altmetric has tracked 22,834,308 research outputs across all sources so far. This one is in the 22nd percentile – i.e., 22% of other outputs scored the same or lower than it.

So far Altmetric has tracked 4,265 research outputs from this source. They typically receive a little more attention than average, with a mean Attention Score of 5.5. This one is in the 33rd percentile – i.e., 33% of its peers scored the same or lower than it.

Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 387,656 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 32nd percentile – i.e., 32% of its contemporaries scored the same or lower than it.

We're also able to compare this research output to 152 others from the same source and published within six weeks on either side of this one. This one is in the 34th percentile – i.e., 34% of its contemporaries scored the same or lower than it.

Enzyme mechanism prediction: a template matching problem on InterPro signature subspaces

About this Attention Score

Mentioned by

Citations

Readers on

X Demographics

Geographical breakdown

Demographic breakdown

Mendeley readers

Geographical breakdown

Demographic breakdown

Attention Score in Context