↓ Skip to main content

Feature analysis for classification of trace fluorescent labeled protein crystallization images

Overview of attention for article published in BioData Mining, April 2017
Altmetric Badge

Mentioned by

twitter
1 X user

Citations

dimensions_citation
15 Dimensions

Readers on

mendeley
21 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Feature analysis for classification of trace fluorescent labeled protein crystallization images
Published in
BioData Mining, April 2017
DOI 10.1186/s13040-017-0133-9
Pubmed ID
Authors

Madhav Sigdel, Imren Dinc, Madhu S. Sigdel, Semih Dinc, Marc L. Pusey, Ramazan S. Aygun

Abstract

Large number of features are extracted from protein crystallization trial images to improve the accuracy of classifiers for predicting the presence of crystals or phases of the crystallization process. The excessive number of features and computationally intensive image processing methods to extract these features make utilization of automated classification tools on stand-alone computing systems inconvenient due to the required time to complete the classification tasks. Combinations of image feature sets, feature reduction and classification techniques for crystallization images benefiting from trace fluorescence labeling are investigated. Features are categorized into intensity, graph, histogram, texture, shape adaptive, and region features (using binarized images generated by Otsu's, green percentile, and morphological thresholding). The effects of normalization, feature reduction with principle components analysis (PCA), and feature selection using random forest classifier are also analyzed. The time required to extract feature categories is computed and an estimated time of extraction is provided for feature category combinations. We have conducted around 8624 experiments (different combinations of feature categories, binarization methods, feature reduction/selection, normalization, and crystal categories). The best experimental results are obtained using combinations of intensity features, region features using Otsu's thresholding, region features using green percentile G90 thresholding, region features using green percentile G99 thresholding, graph features, and histogram features. Using this feature set combination, 96% accuracy (without misclassifying crystals as non-crystals) was achieved for the first level of classification to determine presence of crystals. Since missing a crystal is not desired, our algorithm is adjusted to achieve a high sensitivity rate. In the second level classification, 74.2% accuracy for (5-class) crystal sub-category classification. Best classification rates were achieved using random forest classifier. The feature extraction and classification could be completed in about 2 s per image on a stand-alone computing system, which is suitable for real time analysis. These results enable research groups to select features according to their hardware setups for real-time analysis.

X Demographics

X Demographics

The data shown below were collected from the profile of 1 X user who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 21 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 21 100%

Demographic breakdown

Readers by professional status Count As %
Student > Master 3 14%
Lecturer 2 10%
Researcher 2 10%
Other 1 5%
Student > Doctoral Student 1 5%
Other 4 19%
Unknown 8 38%
Readers by discipline Count As %
Computer Science 6 29%
Pharmacology, Toxicology and Pharmaceutical Science 1 5%
Agricultural and Biological Sciences 1 5%
Biochemistry, Genetics and Molecular Biology 1 5%
Earth and Planetary Sciences 1 5%
Other 1 5%
Unknown 10 48%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 1. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 04 May 2017.
All research outputs
#20,418,183
of 22,968,808 outputs
Outputs from BioData Mining
#289
of 308 outputs
Outputs of similar age
#269,443
of 309,813 outputs
Outputs of similar age from BioData Mining
#8
of 8 outputs
Altmetric has tracked 22,968,808 research outputs across all sources so far. This one is in the 1st percentile – i.e., 1% of other outputs scored the same or lower than it.
So far Altmetric has tracked 308 research outputs from this source. They typically receive more attention than average, with a mean Attention Score of 7.7. This one is in the 1st percentile – i.e., 1% of its peers scored the same or lower than it.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 309,813 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 1st percentile – i.e., 1% of its contemporaries scored the same or lower than it.
We're also able to compare this research output to 8 others from the same source and published within six weeks on either side of this one.