↓ Skip to main content

RNA-sequence data normalization through in silico prediction of reference genes: the bacterial response to DNA damage as case study

Overview of attention for article published in BioData Mining, September 2017
Altmetric Badge

About this Attention Score

  • Good Attention Score compared to outputs of the same age (71st percentile)

Mentioned by

twitter
3 X users
patent
1 patent

Citations

dimensions_citation
16 Dimensions

Readers on

mendeley
40 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
RNA-sequence data normalization through in silico prediction of reference genes: the bacterial response to DNA damage as case study
Published in
BioData Mining, September 2017
DOI 10.1186/s13040-017-0150-8
Pubmed ID
Authors

Bork A. Berghoff, Torgny Karlsson, Thomas Källman, E. Gerhart H. Wagner, Manfred G. Grabherr

Abstract

Measuring how gene expression changes in the course of an experiment assesses how an organism responds on a molecular level. Sequencing of RNA molecules, and their subsequent quantification, aims to assess global gene expression changes on the RNA level (transcriptome). While advances in high-throughput RNA-sequencing (RNA-seq) technologies allow for inexpensive data generation, accurate post-processing and normalization across samples is required to eliminate any systematic noise introduced by the biochemical and/or technical processes. Existing methods thus either normalize on selected known reference genes that are invariant in expression across the experiment, assume that the majority of genes are invariant, or that the effects of up- and down-regulated genes cancel each other out during the normalization. Here, we present a novel method, moose(2) , which predicts invariant genes in silico through a dynamic programming (DP) scheme and applies a quadratic normalization based on this subset. The method allows for specifying a set of known or experimentally validated invariant genes, which guides the DP. We experimentally verified the predictions of this method in the bacterium Escherichia coli, and show how moose(2) is able to (i) estimate the expression value distances between RNA-seq samples, (ii) reduce the variation of expression values across all samples, and (iii) to subsequently reveal new functional groups of genes during the late stages of DNA damage. We further applied the method to three eukaryotic data sets, on which its performance compares favourably to other methods. The software is implemented in C++ and is publicly available from http://grabherr.github.io/moose2/. The proposed RNA-seq normalization method, moose(2) , is a valuable alternative to existing methods, with two major advantages: (i) in silico prediction of invariant genes provides a list of potential reference genes for downstream analyses, and (ii) non-linear artefacts in RNA-seq data are handled adequately to minimize variations between replicates.

X Demographics

X Demographics

The data shown below were collected from the profiles of 3 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 40 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 40 100%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 9 23%
Researcher 7 18%
Student > Bachelor 4 10%
Professor > Associate Professor 4 10%
Other 3 8%
Other 6 15%
Unknown 7 18%
Readers by discipline Count As %
Biochemistry, Genetics and Molecular Biology 10 25%
Agricultural and Biological Sciences 10 25%
Computer Science 5 13%
Engineering 3 8%
Pharmacology, Toxicology and Pharmaceutical Science 1 3%
Other 2 5%
Unknown 9 23%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 5. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 29 June 2022.
All research outputs
#5,877,479
of 23,577,761 outputs
Outputs from BioData Mining
#117
of 315 outputs
Outputs of similar age
#90,179
of 316,724 outputs
Outputs of similar age from BioData Mining
#5
of 7 outputs
Altmetric has tracked 23,577,761 research outputs across all sources so far. This one has received more attention than most of these and is in the 74th percentile.
So far Altmetric has tracked 315 research outputs from this source. They typically receive more attention than average, with a mean Attention Score of 7.7. This one has gotten more attention than average, scoring higher than 62% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 316,724 tracked outputs that were published within six weeks on either side of this one in any source. This one has gotten more attention than average, scoring higher than 71% of its contemporaries.
We're also able to compare this research output to 7 others from the same source and published within six weeks on either side of this one. This one has scored higher than 2 of them.