↓ Skip to main content

Improved methods and resources for paramecium genomics: transcription units, gene annotation and gene expression

Overview of attention for article published in BMC Genomics, June 2017
Altmetric Badge

About this Attention Score

  • Average Attention Score compared to outputs of the same age
  • Average Attention Score compared to outputs of the same age and source

Mentioned by

twitter
4 X users

Citations

dimensions_citation
51 Dimensions

Readers on

mendeley
42 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Improved methods and resources for paramecium genomics: transcription units, gene annotation and gene expression
Published in
BMC Genomics, June 2017
DOI 10.1186/s12864-017-3887-z
Pubmed ID
Authors

Olivier Arnaiz, Erwin Van Dijk, Mireille Bétermier, Maoussi Lhuillier-Akakpo, Augustin de Vanssay, Sandra Duharcourt, Erika Sallet, Jérôme Gouzy, Linda Sperling

Abstract

The 15 sibling species of the Paramecium aurelia cryptic species complex emerged after a whole genome duplication that occurred tens of millions of years ago. Given extensive knowledge of the genetics and epigenetics of Paramecium acquired over the last century, this species complex offers a uniquely powerful system to investigate the consequences of whole genome duplication in a unicellular eukaryote as well as the genetic and epigenetic mechanisms that drive speciation. High quality Paramecium gene models are important for research using this system. The major aim of the work reported here was to build an improved gene annotation pipeline for the Paramecium lineage. We generated oriented RNA-Seq transcriptome data across the sexual process of autogamy for the model species Paramecium tetraurelia. We determined, for the first time in a ciliate, candidate P. tetraurelia transcription start sites using an adapted Cap-Seq protocol. We developed TrUC, multi-threaded Perl software that in conjunction with TopHat mapping of RNA-Seq data to a reference genome, predicts transcription units for the annotation pipeline. We used EuGene software to combine annotation evidence. The high quality gene structural annotations obtained for P. tetraurelia were used as evidence to improve published annotations for 3 other Paramecium species. The RNA-Seq data were also used for differential gene expression analysis, providing a gene expression atlas that is more sensitive than the previously established microarray resource. We have developed a gene annotation pipeline tailored for the compact genomes and tiny introns of Paramecium species. A novel component of this pipeline, TrUC, predicts transcription units using Cap-Seq and oriented RNA-Seq data. TrUC could prove useful beyond Paramecium, especially in the case of high gene density. Accurate predictions of 3' and 5' UTR will be particularly valuable for studies of gene expression (e.g. nucleosome positioning, identification of cis regulatory motifs). The P. tetraurelia improved transcriptome resource, gene annotations for P. tetraurelia, P. biaurelia, P. sexaurelia and P. caudatum, and Paramecium-trained EuGene configuration are available through ParameciumDB ( http://paramecium.i2bc.paris-saclay.fr ). TrUC software is freely distributed under a GNU GPL v3 licence ( https://github.com/oarnaiz/TrUC ).

X Demographics

X Demographics

The data shown below were collected from the profiles of 4 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 42 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 42 100%

Demographic breakdown

Readers by professional status Count As %
Researcher 9 21%
Student > Ph. D. Student 5 12%
Student > Bachelor 5 12%
Student > Master 5 12%
Student > Doctoral Student 3 7%
Other 7 17%
Unknown 8 19%
Readers by discipline Count As %
Biochemistry, Genetics and Molecular Biology 14 33%
Agricultural and Biological Sciences 11 26%
Computer Science 4 10%
Engineering 2 5%
Environmental Science 1 2%
Other 2 5%
Unknown 8 19%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 2. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 28 June 2017.
All research outputs
#13,900,658
of 23,577,761 outputs
Outputs from BMC Genomics
#5,124
of 10,800 outputs
Outputs of similar age
#162,368
of 316,709 outputs
Outputs of similar age from BMC Genomics
#97
of 216 outputs
Altmetric has tracked 23,577,761 research outputs across all sources so far. This one is in the 39th percentile – i.e., 39% of other outputs scored the same or lower than it.
So far Altmetric has tracked 10,800 research outputs from this source. They receive a mean Attention Score of 4.7. This one is in the 49th percentile – i.e., 49% of its peers scored the same or lower than it.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 316,709 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 46th percentile – i.e., 46% of its contemporaries scored the same or lower than it.
We're also able to compare this research output to 216 others from the same source and published within six weeks on either side of this one. This one is in the 49th percentile – i.e., 49% of its contemporaries scored the same or lower than it.