Title |
'Genome order index' should not be used for defining compositional constraints in nucleotide sequences - a case study of the Z-curve
|
---|---|
Published in |
Biology Direct, February 2010
|
DOI | 10.1186/1745-6150-5-10 |
Pubmed ID | |
Authors |
Eran Elhaik, Dan Graur, Krešimir Josić |
Abstract |
The Z-curve is a three dimensional representation of DNA sequences proposed over a decade ago and has been extensively applied to sequence segmentation, horizontal gene transfer detection, and sequence analysis. Based on the Z-curve, a "genome order index," was proposed, which is defined as S = a2+ c2+t2+g2, where a, c, t, and g are the nucleotide frequencies of A, C, T, and G, respectively. This index was found to be smaller than 1/3 for almost all tested genomes, which was taken as support for the existence of a constraint on genome composition. A geometric explanation for this constraint has been suggested. Each genome was represented by a point P whose distance from the four faces of a regular tetrahedron was given by the frequencies a, c, t, and g. They claimed that an inscribed sphere of radius r = 1/ square root 3 contains almost all points corresponding to various genomes, implying that S <r2. The distribution of the points P obtained by S was studied using the Z-curve. |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 2 | 15% |
Spain | 1 | 8% |
France | 1 | 8% |
Unknown | 9 | 69% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Researcher | 4 | 31% |
Professor > Associate Professor | 3 | 23% |
Student > Ph. D. Student | 2 | 15% |
Professor | 1 | 8% |
Librarian | 1 | 8% |
Other | 2 | 15% |
Readers by discipline | Count | As % |
---|---|---|
Agricultural and Biological Sciences | 8 | 62% |
Biochemistry, Genetics and Molecular Biology | 2 | 15% |
Mathematics | 1 | 8% |
Social Sciences | 1 | 8% |
Medicine and Dentistry | 1 | 8% |
Other | 0 | 0% |