↓ Skip to main content

Alignment behaviors of short peptides provide a roadmap for functional profiling of metagenomic data

Overview of attention for article published in BMC Genomics, December 2015
Altmetric Badge

About this Attention Score

  • Above-average Attention Score compared to outputs of the same age (53rd percentile)
  • Above-average Attention Score compared to outputs of the same age and source (55th percentile)

Mentioned by

twitter
5 X users

Citations

dimensions_citation
1 Dimensions

Readers on

mendeley
26 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Alignment behaviors of short peptides provide a roadmap for functional profiling of metagenomic data
Published in
BMC Genomics, December 2015
DOI 10.1186/s12864-015-2272-z
Pubmed ID
Authors

Rohita Sinha, Jennifer Clarke, Andrew K. Benson

Abstract

Functional assignments for short-read metagenomic data pose a significant computational challenge due to perceived unpredictability of alignment behavior and the inability to infer useful functional information from translated protein-fragments/peptides. To address this problem, we have examined the predictability of short peptide alignments by systematically studying alignment behavior of large sets of short peptides generated from well-characterized proteins as well as hypothetical proteins in the KEGG database. Using test sets of peptides modeling the length and phylogenetic distributions of short-read metagenomic data, we observed that peptides from well-characterized proteins had indistinguishable alignments to proteins from the same orthologous family and proteins from different families. Nonetheless, the patterns contained remarkable phylogenetic and structural signals, with alignments of even very short peptides naturally restricted to their orthologous family and/or proteins having similar structural folds. In stark contrast, peptides from "hypothetical proteins" had only sparse hit patterns with low frequencies and much lower identities. By weighting the structure-driven alignments and filtering peptides with behaviors similar to those derived from "hypothetical proteins", we demonstrate that the accuracy of abundance predictions of protein families is dramatically improved. Evolutionary processes have dispersed protein folds across multiple protein families, precluding accurate functional assignment to short peptides, whose alignment behavior is non-random and driven by structure. Algorithms that filter sparse peptides and weight hit patterns of peptides from "known space" dramatically improve quantification of functions from diverse mixtures of peptides and should substantially improve applications of metagenomic analyses requiring accurate quantitative measures of functional families.

X Demographics

X Demographics

The data shown below were collected from the profiles of 5 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 26 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 26 100%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 9 35%
Researcher 8 31%
Student > Master 3 12%
Student > Bachelor 1 4%
Professor > Associate Professor 1 4%
Other 0 0%
Unknown 4 15%
Readers by discipline Count As %
Agricultural and Biological Sciences 10 38%
Biochemistry, Genetics and Molecular Biology 8 31%
Immunology and Microbiology 2 8%
Environmental Science 1 4%
Engineering 1 4%
Other 0 0%
Unknown 4 15%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 3. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 24 September 2016.
All research outputs
#13,339,171
of 23,498,099 outputs
Outputs from BMC Genomics
#4,683
of 10,787 outputs
Outputs of similar age
#181,367
of 392,705 outputs
Outputs of similar age from BMC Genomics
#141
of 324 outputs
Altmetric has tracked 23,498,099 research outputs across all sources so far. This one is in the 42nd percentile – i.e., 42% of other outputs scored the same or lower than it.
So far Altmetric has tracked 10,787 research outputs from this source. They receive a mean Attention Score of 4.7. This one has gotten more attention than average, scoring higher than 55% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 392,705 tracked outputs that were published within six weeks on either side of this one in any source. This one has gotten more attention than average, scoring higher than 53% of its contemporaries.
We're also able to compare this research output to 324 others from the same source and published within six weeks on either side of this one. This one has gotten more attention than average, scoring higher than 55% of its contemporaries.