Report for: Verifying the fully “Laplacianised” posterior Naïve Bayesian approach and more

You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.

Title	Verifying the fully “Laplacianised” posterior Naïve Bayesian approach and more
Published in	Journal of Cheminformatics, June 2015
DOI	10.1186/s13321-015-0075-5
Pubmed ID	26075027
Authors	Hamse Y Mussa, David Marcus, John B O Mitchell, Robert C Glen
Abstract	In a recent paper, Mussa, Mitchell and Glen (MMG) have mathematically demonstrated that the "Laplacian Corrected Modified Naïve Bayes" (LCMNB) algorithm can be viewed as a variant of the so-called Standard Naïve Bayes (SNB) scheme, whereby the role played by absence of compound features in classifying/assigning the compound to its appropriate class is ignored. MMG have also proffered guidelines regarding the conditions under which this omission may hold. Utilising three data sets, the present paper examines the validity of these guidelines in practice. The paper also extends MMG's work and introduces a new version of the SNB classifier: "Tapered Naïve Bayes" (TNB). TNB does not discard the role of absence of a feature out of hand, nor does it fully consider its role. Hence, TNB encapsulates both SNB and LCMNB. LCMNB, SNB and TNB performed differently on classifying 4,658, 5,031 and 1,149 ligands (all chosen from the ChEMBL Database) distributed over 31 enzymes, 23 membrane receptors, and one ion-channel, four transporters and one transcription factor as their target proteins. When the number of features utilised was equal to or smaller than the "optimal" number of features for a given data set, SNB classifiers systematically gave better classification results than those yielded by LCMNB classifiers. The opposite was true when the number of features employed was markedly larger than the "optimal" number of features for this data set. Nonetheless, these LCMNB performances were worse than the classification performance achieved by SNB when the "optimal" number of features for the data set was utilised. TNB classifiers systematically outperformed both SNB and LCMNB classifiers. The classification results obtained in this study concur with the mathematical based guidelines given in MMG's paper-that is, ignoring the role of absence of a feature out of hand does not necessarily improve classification performance of the SNB approach; if anything, it could make the performance of the SNB method worse. The results obtained also lend support to the rationale, on which the TNB algorithm rests: handled judiciously, taking into account absence of features can enhance (not impair) the discriminatory classification power of the SNB approach.

View on publisher site Alert me about new mentions

X Demographics

The data shown below were collected from the profiles of 5 X users who shared this research output. Click here to find out more about how the information was compiled.

Geographical breakdown

Country	Count	As %
Germany	1	20%
United Kingdom	1	20%
Unknown	3	60%

Demographic breakdown

Type	Count	As %
Members of the public	4	80%
Scientists	1	20%

Mendeley readers

The data shown below were compiled from readership statistics for 19 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country	Count	As %
United Kingdom	1	5%
Unknown	18	95%

Demographic breakdown

Readers by professional status	Count	As %
Researcher	10	53%
Other	3	16%
Student > Master	2	11%
Professor	1	5%
Lecturer > Senior Lecturer	1	5%
Other	1	5%
Unknown	1	5%

Readers by discipline	Count	As %
Chemistry	6	32%
Biochemistry, Genetics and Molecular Biology	5	26%
Agricultural and Biological Sciences	3	16%
Computer Science	1	5%
Mathematics	1	5%
Other	0	0%
Unknown	3	16%

Attention Score in Context

This research output has an Altmetric Attention Score of 5. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 16 June 2015.

All research outputs

#6,148,629

of 22,811,321 outputs

Outputs from Journal of Cheminformatics

#515

of 833 outputs

Outputs of similar age

#71,546

of 264,930 outputs

Outputs of similar age from Journal of Cheminformatics

#11

of 20 outputs

Altmetric has tracked 22,811,321 research outputs across all sources so far. This one has received more attention than most of these and is in the 72nd percentile.

So far Altmetric has tracked 833 research outputs from this source. They typically receive a lot more attention than average, with a mean Attention Score of 10.9. This one is in the 37th percentile – i.e., 37% of its peers scored the same or lower than it.

Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 264,930 tracked outputs that were published within six weeks on either side of this one in any source. This one has gotten more attention than average, scoring higher than 72% of its contemporaries.

We're also able to compare this research output to 20 others from the same source and published within six weeks on either side of this one. This one is in the 45th percentile – i.e., 45% of its contemporaries scored the same or lower than it.

Verifying the fully “Laplacianised” posterior Naïve Bayesian approach and more

About this Attention Score

Mentioned by

Citations

Readers on

X Demographics

Geographical breakdown

Demographic breakdown

Mendeley readers

Geographical breakdown

Demographic breakdown

Attention Score in Context