Title |
A review of machine learning methods to predict the solubility of overexpressed recombinant proteins in Escherichia coli
|
---|---|
Published in |
BMC Bioinformatics, May 2014
|
DOI | 10.1186/1471-2105-15-134 |
Pubmed ID | |
Authors |
Narjeskhatoon Habibi, Siti Z Mohd Hashim, Alireza Norouzi, Mohammed Razip Samian |
Abstract |
Over the last 20 years in biotechnology, the production of recombinant proteins has been a crucial bioprocess in both biopharmaceutical and research arena in terms of human health, scientific impact and economic volume. Although logical strategies of genetic engineering have been established, protein overexpression is still an art. In particular, heterologous expression is often hindered by low level of production and frequent fail due to opaque reasons. The problem is accentuated because there is no generic solution available to enhance heterologous overexpression. For a given protein, the extent of its solubility can indicate the quality of its function. Over 30% of synthesized proteins are not soluble. In certain experimental circumstances, including temperature, expression host, etc., protein solubility is a feature eventually defined by its sequence. Until now, numerous methods based on machine learning are proposed to predict the solubility of protein merely from its amino acid sequence. In spite of the 20 years of research on the matter, no comprehensive review is available on the published methods. |
X Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
Norway | 1 | 50% |
Unknown | 1 | 50% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Members of the public | 1 | 50% |
Scientists | 1 | 50% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
Korea, Republic of | 1 | <1% |
Czechia | 1 | <1% |
Switzerland | 1 | <1% |
Unknown | 149 | 98% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Student > Ph. D. Student | 38 | 25% |
Researcher | 30 | 20% |
Student > Master | 11 | 7% |
Student > Bachelor | 10 | 7% |
Student > Doctoral Student | 7 | 5% |
Other | 17 | 11% |
Unknown | 39 | 26% |
Readers by discipline | Count | As % |
---|---|---|
Agricultural and Biological Sciences | 37 | 24% |
Biochemistry, Genetics and Molecular Biology | 34 | 22% |
Computer Science | 10 | 7% |
Chemistry | 8 | 5% |
Chemical Engineering | 6 | 4% |
Other | 16 | 11% |
Unknown | 41 | 27% |