↓ Skip to main content

Incorporating biological prior knowledge for Bayesian learning via maximal knowledge-driven information priors

Overview of attention for article published in BMC Bioinformatics, December 2017
Altmetric Badge

About this Attention Score

  • In the top 25% of all research outputs scored by Altmetric
  • Good Attention Score compared to outputs of the same age (79th percentile)
  • High Attention Score compared to outputs of the same age and source (82nd percentile)

Mentioned by

policy
1 policy source
twitter
2 X users
wikipedia
1 Wikipedia page

Citations

dimensions_citation
33 Dimensions

Readers on

mendeley
45 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Incorporating biological prior knowledge for Bayesian learning via maximal knowledge-driven information priors
Published in
BMC Bioinformatics, December 2017
DOI 10.1186/s12859-017-1893-4
Pubmed ID
Authors

Shahin Boluki, Mohammad Shahrokh Esfahani, Xiaoning Qian, Edward R Dougherty

Abstract

Phenotypic classification is problematic because small samples are ubiquitous; and, for these, use of prior knowledge is critical. If knowledge concerning the feature-label distribution - for instance, genetic pathways - is available, then it can be used in learning. Optimal Bayesian classification provides optimal classification under model uncertainty. It differs from classical Bayesian methods in which a classification model is assumed and prior distributions are placed on model parameters. With optimal Bayesian classification, uncertainty is treated directly on the feature-label distribution, which assures full utilization of prior knowledge and is guaranteed to outperform classical methods. The salient problem confronting optimal Bayesian classification is prior construction. In this paper, we propose a new prior construction methodology based on a general framework of constraints in the form of conditional probability statements. We call this prior the maximal knowledge-driven information prior (MKDIP). The new constraint framework is more flexible than our previous methods as it naturally handles the potential inconsistency in archived regulatory relationships and conditioning can be augmented by other knowledge, such as population statistics. We also extend the application of prior construction to a multinomial mixture model when labels are unknown, which often occurs in practice. The performance of the proposed methods is examined on two important pathway families, the mammalian cell-cycle and a set of p53-related pathways, and also on a publicly available gene expression dataset of non-small cell lung cancer when combined with the existing prior knowledge on relevant signaling pathways. The new proposed general prior construction framework extends the prior construction methodology to a more flexible framework that results in better inference when proper prior knowledge exists. Moreover, the extension of optimal Bayesian classification to multinomial mixtures where data sets are both small and unlabeled, enables superior classifier design using small, unstructured data sets. We have demonstrated the effectiveness of our approach using pathway information and available knowledge of gene regulating functions; however, the underlying theory can be applied to a wide variety of knowledge types, and other applications when there are small samples.

X Demographics

X Demographics

The data shown below were collected from the profiles of 2 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 45 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 45 100%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 15 33%
Researcher 8 18%
Student > Bachelor 3 7%
Other 3 7%
Student > Postgraduate 2 4%
Other 5 11%
Unknown 9 20%
Readers by discipline Count As %
Computer Science 8 18%
Medicine and Dentistry 7 16%
Agricultural and Biological Sciences 7 16%
Engineering 4 9%
Biochemistry, Genetics and Molecular Biology 2 4%
Other 6 13%
Unknown 11 24%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 8. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 16 June 2022.
All research outputs
#3,992,746
of 22,675,759 outputs
Outputs from BMC Bioinformatics
#1,543
of 7,249 outputs
Outputs of similar age
#88,412
of 440,138 outputs
Outputs of similar age from BMC Bioinformatics
#25
of 143 outputs
Altmetric has tracked 22,675,759 research outputs across all sources so far. Compared to these this one has done well and is in the 82nd percentile: it's in the top 25% of all research outputs ever tracked by Altmetric.
So far Altmetric has tracked 7,249 research outputs from this source. They typically receive a little more attention than average, with a mean Attention Score of 5.4. This one has done well, scoring higher than 78% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 440,138 tracked outputs that were published within six weeks on either side of this one in any source. This one has done well, scoring higher than 79% of its contemporaries.
We're also able to compare this research output to 143 others from the same source and published within six weeks on either side of this one. This one has done well, scoring higher than 82% of its contemporaries.