Title |
Non-specific filtering of beta-distributed data
|
---|---|
Published in |
BMC Bioinformatics, June 2014
|
DOI | 10.1186/1471-2105-15-199 |
Pubmed ID | |
Authors |
Xinhui Wang, Peter W Laird, Toshinori Hinoue, Susan Groshen, Kimberly D Siegmund |
Abstract |
Non-specific feature selection is a dimension reduction procedure performed prior to cluster analysis of high dimensional molecular data. Not all measured features are expected to show biological variation, so only the most varying are selected for analysis. In DNA methylation studies, DNA methylation is measured as a proportion, bounded between 0 and 1, with variance a function of the mean. Filtering on standard deviation biases the selection of probes to those with mean values near 0.5. We explore the effect this has on clustering, and develop alternate filter methods that utilize a variance stabilizing transformation for Beta distributed data and do not share this bias. |
X Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
Unknown | 1 | 100% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Scientists | 1 | 100% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
United Kingdom | 1 | 3% |
United States | 1 | 3% |
Sweden | 1 | 3% |
Unknown | 29 | 91% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Researcher | 11 | 34% |
Student > Ph. D. Student | 9 | 28% |
Student > Master | 3 | 9% |
Student > Doctoral Student | 2 | 6% |
Professor > Associate Professor | 2 | 6% |
Other | 3 | 9% |
Unknown | 2 | 6% |
Readers by discipline | Count | As % |
---|---|---|
Agricultural and Biological Sciences | 14 | 44% |
Computer Science | 4 | 13% |
Medicine and Dentistry | 3 | 9% |
Nursing and Health Professions | 2 | 6% |
Biochemistry, Genetics and Molecular Biology | 2 | 6% |
Other | 5 | 16% |
Unknown | 2 | 6% |