Title |
Prediction of Drosophila melanogaster gene function using Support Vector Machines
|
---|---|
Published in |
BioData Mining, April 2013
|
DOI | 10.1186/1756-0381-6-8 |
Pubmed ID | |
Authors |
Nicholas Mitsakakis, Zak Razak, Michael Escobar, J Timothy Westwood |
Abstract |
While the genomes of hundreds of organisms have been sequenced and good approaches exist for finding protein encoding genes, an important remaining challenge is predicting the functions of the large fraction of genes for which there is no annotation. Large gene expression datasets from microarray experiments already exist and many of these can be used to help assign potential functions to these genes. We have applied Support Vector Machines (SVM), a sigmoid fitting function and a stratified cross-validation approach to analyze a large microarray experiment dataset from Drosophila melanogaster in order to predict possible functions for previously un-annotated genes. A total of approximately 5043 different genes, or about one-third of the predicted genes in the D. melanogaster genome, are represented in the dataset and 1854 (or 37%) of these genes are un-annotated. |
X Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
China | 1 | 100% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Members of the public | 1 | 100% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
United Kingdom | 1 | 3% |
India | 1 | 3% |
Slovenia | 1 | 3% |
Unknown | 27 | 90% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Student > Master | 6 | 20% |
Researcher | 5 | 17% |
Student > Ph. D. Student | 5 | 17% |
Lecturer > Senior Lecturer | 3 | 10% |
Student > Bachelor | 2 | 7% |
Other | 7 | 23% |
Unknown | 2 | 7% |
Readers by discipline | Count | As % |
---|---|---|
Agricultural and Biological Sciences | 14 | 47% |
Biochemistry, Genetics and Molecular Biology | 4 | 13% |
Computer Science | 3 | 10% |
Engineering | 3 | 10% |
Medicine and Dentistry | 2 | 7% |
Other | 2 | 7% |
Unknown | 2 | 7% |