↓ Skip to main content

The impact of read length on quantification of differentially expressed genes and splice junction detection

Overview of attention for article published in Genome Biology, June 2015
Altmetric Badge

About this Attention Score

  • In the top 5% of all research outputs scored by Altmetric
  • High Attention Score compared to outputs of the same age (95th percentile)
  • High Attention Score compared to outputs of the same age and source (83rd percentile)

Mentioned by

twitter
68 X users
facebook
1 Facebook page
wikipedia
3 Wikipedia pages
video
1 YouTube creator

Citations

dimensions_citation
101 Dimensions

Readers on

mendeley
383 Mendeley
citeulike
2 CiteULike
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
The impact of read length on quantification of differentially expressed genes and splice junction detection
Published in
Genome Biology, June 2015
DOI 10.1186/s13059-015-0697-y
Pubmed ID
Authors

Sagar Chhangawala, Gabe Rudy, Christopher E. Mason, Jeffrey A. Rosenfeld

Abstract

The initial next-generation sequencing technologies produced reads of 25 or 36 bp, and only from a single-end of the library sequence. Currently, it is possible to reliably produce 300 bp paired-end sequences for RNA expression analysis. While read lengths have consistently increased, people have assumed that longer reads are more informative and that paired-end reads produce better results than single-end reads. We used paired-end 101 bp reads and trimmed them to simulate different read lengths, and also separated the pairs to produce single-end reads. For each read-length and paired status, we evaluated differential expression levels between two standard samples and compared the results to those obtained by qPCR. We found that, with the exception of 25 bp reads, there is little difference for the detection of differential expression regardless of the read length. Once single-end reads are at a length of 50 bp, the results do not change substantially for any level up to, and including, 100 bp paired-end. However, splice junction detection significantly improves as the read length increases with 100 bp paired-end showing the best performance. We performed the same analysis on two ENCODE samples and found consistent results confirming that our conclusions have broad application. A researcher could save substantial resources by using 50 bp single-end reads for differential expression analysis instead of using longer reads. However, splicing detection is unquestionably improved by paired-end and longer reads. Therefore, an appropriate read length should be used based on the final goal of the study.

X Demographics

X Demographics

The data shown below were collected from the profiles of 68 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 383 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United States 5 1%
Germany 4 1%
United Kingdom 3 <1%
Norway 1 <1%
South Africa 1 <1%
Canada 1 <1%
Switzerland 1 <1%
Argentina 1 <1%
Luxembourg 1 <1%
Other 0 0%
Unknown 365 95%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 91 24%
Researcher 85 22%
Student > Master 53 14%
Student > Bachelor 42 11%
Student > Doctoral Student 15 4%
Other 43 11%
Unknown 54 14%
Readers by discipline Count As %
Agricultural and Biological Sciences 131 34%
Biochemistry, Genetics and Molecular Biology 111 29%
Computer Science 20 5%
Medicine and Dentistry 13 3%
Engineering 9 2%
Other 39 10%
Unknown 60 16%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 40. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 07 April 2023.
All research outputs
#1,044,424
of 25,779,988 outputs
Outputs from Genome Biology
#733
of 4,517 outputs
Outputs of similar age
#12,229
of 279,059 outputs
Outputs of similar age from Genome Biology
#11
of 67 outputs
Altmetric has tracked 25,779,988 research outputs across all sources so far. Compared to these this one has done particularly well and is in the 95th percentile: it's in the top 5% of all research outputs ever tracked by Altmetric.
So far Altmetric has tracked 4,517 research outputs from this source. They typically receive a lot more attention than average, with a mean Attention Score of 27.5. This one has done well, scoring higher than 83% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 279,059 tracked outputs that were published within six weeks on either side of this one in any source. This one has done particularly well, scoring higher than 95% of its contemporaries.
We're also able to compare this research output to 67 others from the same source and published within six weeks on either side of this one. This one has done well, scoring higher than 83% of its contemporaries.