↓ Skip to main content

Classification based on extensions of LS-PLS using logistic regression: application to clinical and multiple genomic data

Overview of attention for article published in BMC Bioinformatics, September 2018
Altmetric Badge

Citations

dimensions_citation
8 Dimensions

Readers on

mendeley
20 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Classification based on extensions of LS-PLS using logistic regression: application to clinical and multiple genomic data
Published in
BMC Bioinformatics, September 2018
DOI 10.1186/s12859-018-2311-2
Pubmed ID
Authors

Caroline Bazzoli, Sophie Lambert-Lacroix

Abstract

To address high-dimensional genomic data, most of the proposed prediction methods make use of genomic data alone without considering clinical data, which are often available and known to have predictive value. Recent studies suggest that combining clinical and genomic information may improve predictions. We consider here methods for classification purposes that simultaneously use both types of variables but apply dimensionality reduction only to the high-dimensional genomic ones. Using partial least squares (PLS), we propose some one-step approaches based on three extensions of the least squares (LS)-PLS method for logistic regression. A comparison of their prediction performances via a simulation and on real data sets from cancer studies is conducted. In general, those methods using only clinical data or only genomic data perform poorly. The advantage of using LS-PLS methods for classification and their performances are shown and then used to analyze clinical and genomic data. The corresponding prediction results are encouraging and stable regardless of the data set and/or number of selected features. These extensions have been implemented in the R package lsplsGlm to enhance their use.

Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 20 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 20 100%

Demographic breakdown

Readers by professional status Count As %
Other 4 20%
Student > Ph. D. Student 3 15%
Researcher 3 15%
Student > Bachelor 2 10%
Professor 2 10%
Other 4 20%
Unknown 2 10%
Readers by discipline Count As %
Biochemistry, Genetics and Molecular Biology 3 15%
Mathematics 3 15%
Pharmacology, Toxicology and Pharmaceutical Science 2 10%
Engineering 2 10%
Computer Science 1 5%
Other 6 30%
Unknown 3 15%