↓ Skip to main content

Gene expression prediction using low-rank matrix completion

Overview of attention for article published in BMC Bioinformatics, June 2016
Altmetric Badge

About this Attention Score

  • In the top 25% of all research outputs scored by Altmetric
  • High Attention Score compared to outputs of the same age (88th percentile)
  • High Attention Score compared to outputs of the same age and source (93rd percentile)

Mentioned by

blogs
1 blog
twitter
9 X users

Citations

dimensions_citation
35 Dimensions

Readers on

mendeley
58 Mendeley
citeulike
1 CiteULike
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Gene expression prediction using low-rank matrix completion
Published in
BMC Bioinformatics, June 2016
DOI 10.1186/s12859-016-1106-6
Pubmed ID
Authors

Arnav Kapur, Kshitij Marwah, Gil Alterovitz

Abstract

An exponential growth of high-throughput biological information and data has occurred in the past decade, supported by technologies, such as microarrays and RNA-Seq. Most data generated using such methods are used to encode large amounts of rich information, and determine diagnostic and prognostic biomarkers. Although data storage costs have reduced, process of capturing data using aforementioned technologies is still expensive. Moreover, the time required for the assay, from sample preparation to raw value measurement is excessive (in the order of days). There is an opportunity to reduce both the cost and time for generating such expression datasets. We propose a framework in which complete gene expression values can be reliably predicted in-silico from partial measurements. This is achieved by modelling expression data as a low-rank matrix and then applying recently discovered techniques of matrix completion by using nonlinear convex optimisation. We evaluated prediction of gene expression data based on 133 studies, sourced from a combined total of 10,921 samples. It is shown that such datasets can be constructed with a low relative error even at high missing value rates (>50 %), and that such predicted datasets can be reliably used as surrogates for further analysis. This method has potentially far-reaching applications including how bio-medical data is sourced and generated, and transcriptomic prediction by optimisation. We show that gene expression data can be computationally constructed, thereby potentially reducing the costs of gene expression profiling. In conclusion, this method shows great promise of opening new avenues in research on low-rank matrix completion in biological sciences.

X Demographics

X Demographics

The data shown below were collected from the profiles of 9 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 58 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 58 100%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 17 29%
Researcher 14 24%
Student > Master 8 14%
Student > Bachelor 6 10%
Professor 2 3%
Other 5 9%
Unknown 6 10%
Readers by discipline Count As %
Computer Science 14 24%
Agricultural and Biological Sciences 10 17%
Biochemistry, Genetics and Molecular Biology 9 16%
Medicine and Dentistry 4 7%
Mathematics 3 5%
Other 13 22%
Unknown 5 9%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 14. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 20 July 2016.
All research outputs
#2,143,614
of 22,877,793 outputs
Outputs from BMC Bioinformatics
#579
of 7,298 outputs
Outputs of similar age
#41,152
of 352,647 outputs
Outputs of similar age from BMC Bioinformatics
#6
of 96 outputs
Altmetric has tracked 22,877,793 research outputs across all sources so far. Compared to these this one has done particularly well and is in the 90th percentile: it's in the top 10% of all research outputs ever tracked by Altmetric.
So far Altmetric has tracked 7,298 research outputs from this source. They typically receive a little more attention than average, with a mean Attention Score of 5.4. This one has done particularly well, scoring higher than 92% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 352,647 tracked outputs that were published within six weeks on either side of this one in any source. This one has done well, scoring higher than 88% of its contemporaries.
We're also able to compare this research output to 96 others from the same source and published within six weeks on either side of this one. This one has done particularly well, scoring higher than 93% of its contemporaries.