↓ Skip to main content

Verifying the fully “Laplacianised” posterior Naïve Bayesian approach and more

Overview of attention for article published in Journal of Cheminformatics, June 2015
Altmetric Badge

About this Attention Score

  • Good Attention Score compared to outputs of the same age (72nd percentile)
  • Average Attention Score compared to outputs of the same age and source

Mentioned by

twitter
5 X users

Citations

dimensions_citation
5 Dimensions

Readers on

mendeley
19 Mendeley
citeulike
3 CiteULike
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Verifying the fully “Laplacianised” posterior Naïve Bayesian approach and more
Published in
Journal of Cheminformatics, June 2015
DOI 10.1186/s13321-015-0075-5
Pubmed ID
Authors

Hamse Y Mussa, David Marcus, John B O Mitchell, Robert C Glen

Abstract

In a recent paper, Mussa, Mitchell and Glen (MMG) have mathematically demonstrated that the "Laplacian Corrected Modified Naïve Bayes" (LCMNB) algorithm can be viewed as a variant of the so-called Standard Naïve Bayes (SNB) scheme, whereby the role played by absence of compound features in classifying/assigning the compound to its appropriate class is ignored. MMG have also proffered guidelines regarding the conditions under which this omission may hold. Utilising three data sets, the present paper examines the validity of these guidelines in practice. The paper also extends MMG's work and introduces a new version of the SNB classifier: "Tapered Naïve Bayes" (TNB). TNB does not discard the role of absence of a feature out of hand, nor does it fully consider its role. Hence, TNB encapsulates both SNB and LCMNB. LCMNB, SNB and TNB performed differently on classifying 4,658, 5,031 and 1,149 ligands (all chosen from the ChEMBL Database) distributed over 31 enzymes, 23 membrane receptors, and one ion-channel, four transporters and one transcription factor as their target proteins. When the number of features utilised was equal to or smaller than the "optimal" number of features for a given data set, SNB classifiers systematically gave better classification results than those yielded by LCMNB classifiers. The opposite was true when the number of features employed was markedly larger than the "optimal" number of features for this data set. Nonetheless, these LCMNB performances were worse than the classification performance achieved by SNB when the "optimal" number of features for the data set was utilised. TNB classifiers systematically outperformed both SNB and LCMNB classifiers. The classification results obtained in this study concur with the mathematical based guidelines given in MMG's paper-that is, ignoring the role of absence of a feature out of hand does not necessarily improve classification performance of the SNB approach; if anything, it could make the performance of the SNB method worse. The results obtained also lend support to the rationale, on which the TNB algorithm rests: handled judiciously, taking into account absence of features can enhance (not impair) the discriminatory classification power of the SNB approach.

X Demographics

X Demographics

The data shown below were collected from the profiles of 5 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 19 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United Kingdom 1 5%
Unknown 18 95%

Demographic breakdown

Readers by professional status Count As %
Researcher 10 53%
Other 3 16%
Student > Master 2 11%
Professor 1 5%
Lecturer > Senior Lecturer 1 5%
Other 1 5%
Unknown 1 5%
Readers by discipline Count As %
Chemistry 6 32%
Biochemistry, Genetics and Molecular Biology 5 26%
Agricultural and Biological Sciences 3 16%
Computer Science 1 5%
Mathematics 1 5%
Other 0 0%
Unknown 3 16%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 5. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 16 June 2015.
All research outputs
#6,148,629
of 22,811,321 outputs
Outputs from Journal of Cheminformatics
#515
of 833 outputs
Outputs of similar age
#71,546
of 264,930 outputs
Outputs of similar age from Journal of Cheminformatics
#11
of 20 outputs
Altmetric has tracked 22,811,321 research outputs across all sources so far. This one has received more attention than most of these and is in the 72nd percentile.
So far Altmetric has tracked 833 research outputs from this source. They typically receive a lot more attention than average, with a mean Attention Score of 10.9. This one is in the 37th percentile – i.e., 37% of its peers scored the same or lower than it.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 264,930 tracked outputs that were published within six weeks on either side of this one in any source. This one has gotten more attention than average, scoring higher than 72% of its contemporaries.
We're also able to compare this research output to 20 others from the same source and published within six weeks on either side of this one. This one is in the 45th percentile – i.e., 45% of its contemporaries scored the same or lower than it.