Title |
Multi-model inference using mixed effects from a linear regression based genetic algorithm
|
---|---|
Published in |
BMC Bioinformatics, March 2014
|
DOI | 10.1186/1471-2105-15-88 |
Pubmed ID | |
Authors |
Koen Van der Borght, Geert Verbeke, Herman van Vlijmen |
Abstract |
Different high-dimensional regression methodologies exist for the selection of variables to predict a continuous variable. To improve the variable selection in case clustered observations are present in the training data, an extension towards mixed-effects modeling (MM) is requested, but may not always be straightforward to implement.In this article, we developed such a MM extension (GA-MM-MMI) for the automated variable selection by a linear regression based genetic algorithm (GA) using multi-model inference (MMI). We exemplify our approach by training a linear regression model for prediction of resistance to the integrase inhibitor Raltegravir (RAL) on a genotype-phenotype database, with many integrase mutations as candidate covariates. The genotype-phenotype pairs in this database were derived from a limited number of subjects, with presence of multiple data points from the same subject, and with an intra-class correlation of 0.92. |
X Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
Norway | 1 | 50% |
Unknown | 1 | 50% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Scientists | 1 | 50% |
Members of the public | 1 | 50% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
Germany | 2 | 5% |
United States | 2 | 5% |
New Zealand | 1 | 2% |
Unknown | 38 | 88% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Researcher | 13 | 30% |
Student > Ph. D. Student | 9 | 21% |
Student > Postgraduate | 4 | 9% |
Student > Master | 3 | 7% |
Professor | 2 | 5% |
Other | 5 | 12% |
Unknown | 7 | 16% |
Readers by discipline | Count | As % |
---|---|---|
Agricultural and Biological Sciences | 7 | 16% |
Biochemistry, Genetics and Molecular Biology | 5 | 12% |
Computer Science | 4 | 9% |
Nursing and Health Professions | 3 | 7% |
Engineering | 3 | 7% |
Other | 10 | 23% |
Unknown | 11 | 26% |