↓ Skip to main content

An efficient heuristic method for active feature acquisition and its application to protein-protein interaction prediction

Overview of attention for article published in BMC Proceedings, November 2012
Altmetric Badge

Mentioned by

twitter
1 X user

Citations

dimensions_citation
10 Dimensions

Readers on

mendeley
13 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
An efficient heuristic method for active feature acquisition and its application to protein-protein interaction prediction
Published in
BMC Proceedings, November 2012
DOI 10.1186/1753-6561-6-s7-s2
Pubmed ID
Authors

Mohamed Thahir, Tarun Sharma, Madhavi K Ganapathiraju

Abstract

Machine learning approaches for classification learn the pattern of the feature space of different classes, or learn a boundary that separates the feature space into different classes. The features of the data instances are usually available, and it is only the class-labels of the instances that are unavailable. For example, to classify text documents into different topic categories, the words in the documents are features and they are readily available, whereas the topic is what is predicted. However, in some domains obtaining features may be resource-intensive because of which not all features may be available. An example is that of protein-protein interaction prediction, where not only are the labels ('interacting' or 'non-interacting') unavailable, but so are some of the features. It may be possible to obtain at least some of the missing features by carrying out a few experiments as permitted by the available resources. If only a few experiments can be carried out to acquire missing features, which proteins should be studied and which features of those proteins should be determined? From the perspective of machine learning for PPI prediction, it would be desirable that those features be acquired which when used in training the classifier, the accuracy of the classifier is improved the most. That is, the utility of the feature-acquisition is measured in terms of how much acquired features contribute to improving the accuracy of the classifier. Active feature acquisition (AFA) is a strategy to preselect such instance-feature combinations (i.e. protein and experiment combinations) for maximum utility. The goal of AFA is the creation of optimal training set that would result in the best classifier, and not in determining the best classification model itself.

X Demographics

X Demographics

The data shown below were collected from the profile of 1 X user who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 13 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 13 100%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 3 23%
Student > Doctoral Student 2 15%
Student > Master 2 15%
Professor 1 8%
Other 1 8%
Other 1 8%
Unknown 3 23%
Readers by discipline Count As %
Computer Science 6 46%
Agricultural and Biological Sciences 2 15%
Mathematics 1 8%
Unknown 4 31%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 1. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 10 December 2012.
All research outputs
#18,323,689
of 22,689,790 outputs
Outputs from BMC Proceedings
#265
of 374 outputs
Outputs of similar age
#136,335
of 179,097 outputs
Outputs of similar age from BMC Proceedings
#5
of 7 outputs
Altmetric has tracked 22,689,790 research outputs across all sources so far. This one is in the 11th percentile – i.e., 11% of other outputs scored the same or lower than it.
So far Altmetric has tracked 374 research outputs from this source. They receive a mean Attention Score of 4.0. This one is in the 15th percentile – i.e., 15% of its peers scored the same or lower than it.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 179,097 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 11th percentile – i.e., 11% of its contemporaries scored the same or lower than it.
We're also able to compare this research output to 7 others from the same source and published within six weeks on either side of this one. This one has scored higher than 2 of them.