↓ Skip to main content

A weighting approach for judging the effect of patient strata on high-dimensional risk prediction signatures

Overview of attention for article published in BMC Bioinformatics, September 2015
Altmetric Badge

Readers on

mendeley
7 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
A weighting approach for judging the effect of patient strata on high-dimensional risk prediction signatures
Published in
BMC Bioinformatics, September 2015
DOI 10.1186/s12859-015-0716-8
Pubmed ID
Authors

Veronika Weyer, Harald Binder

Abstract

High-dimensional molecular measurements, e.g. gene expression data, can be linked to clinical time-to-event endpoints by Cox regression models and regularized estimation approaches, such as componentwise boosting, and can incorporate a large number of covariates as well as provide variable selection. If there is heterogeneity due to known patient subgroups, a stratified Cox model allows for separate baseline hazards in each subgroup. Variable selection will still depend on the relative stratum sizes in the data, which might be a convenience sample and not representative for future applications. Such effects need to be systematically investigated and could even help to more reliably identify components of risk prediction signatures. Correspondingly, we propose a weighted regression approach based on componentwise likelihood-based boosting which is implemented in the R package CoxBoost ( https://github.com/binderh/CoxBoost ). This approach focuses on building a risk prediction signature for a specific stratum by down-weighting the observations from the other strata using a range of weights. Stability of selection for specific covariates as a function of the weights is investigated by resampling inclusion frequencies, and two types of corresponding visualizations are suggested. This is illustrated for two applications with methylation and gene expression measurements from cancer patients. The proposed approach is meant to point out components of risk prediction signatures that are specific to the stratum of interest and components that are also important to other strata. Performance is mostly improved by incorporating down-weighted information from the other strata. This suggests more general usefulness for risk prediction signature development in data with heterogeneity due to known subgroups.

Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 7 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 7 100%

Demographic breakdown

Readers by professional status Count As %
Researcher 2 29%
Student > Bachelor 1 14%
Student > Master 1 14%
Student > Postgraduate 1 14%
Student > Doctoral Student 1 14%
Other 0 0%
Unknown 1 14%
Readers by discipline Count As %
Medicine and Dentistry 2 29%
Computer Science 1 14%
Mathematics 1 14%
Physics and Astronomy 1 14%
Agricultural and Biological Sciences 1 14%
Other 0 0%
Unknown 1 14%