Title |
De novo construction of a “Gene-space” for diploid plant genome rich in repetitive sequences by an iterative Process of Extraction and Assembly of NGS reads (iPEA protocol) with limited computing resources
|
---|---|
Published in |
BMC Research Notes, February 2016
|
DOI | 10.1186/s13104-016-1903-z |
Pubmed ID | |
Authors |
Christelle Aluome, Grégoire Aubert, Susete Alves Carvalho, Marie-Christine Le Paslier, Judith Burstin, Dominique Brunel |
Abstract |
The continuing increase in size and quality of the "short reads" raw data is a significant help for the quality of the assembly obtained through various bioinformatics tools. However, building a reference genome sequence for most plant species remains a significant challenge due to the large number of repeated sequences which are problematic for a whole-genome quality de novo assembly. Furthermore, for most SNP identification approaches in plant genetics and breeding, only the "Gene-space" regions including the promoter, exon and intron sequences are considered. We developed the iPea protocol to produce a de novo Gene-space assembly by reconstructing, in an iterative way, the non-coding sequence flanking the Unigene cDNA sequence through addition of next-generation DNA-seq data. The approach was elaborated with the large diploid genome of pea (Pisum sativum L.), rich in repetitive sequences. The final Gene-space assembly included 35,400 contigs (97 Mb), covering 88 % of the 40,227 contigs (53.1 Mb) of the PsCam_low-copy Unigen set. Its accuracy was validated by the results of the built GenoPea 13.2 K SNP Array. The iPEA protocol allows the reconstruction of a Gene-space based from RNA-Seq and DNA-seq data with limited computing resources. |
Twitter Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
United Kingdom | 2 | 67% |
Germany | 1 | 33% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Members of the public | 2 | 67% |
Scientists | 1 | 33% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
Netherlands | 1 | 7% |
United States | 1 | 7% |
France | 1 | 7% |
Unknown | 12 | 80% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Researcher | 5 | 33% |
Student > Master | 2 | 13% |
Professor | 2 | 13% |
Unspecified | 1 | 7% |
Other | 1 | 7% |
Other | 2 | 13% |
Unknown | 2 | 13% |
Readers by discipline | Count | As % |
---|---|---|
Agricultural and Biological Sciences | 8 | 53% |
Biochemistry, Genetics and Molecular Biology | 2 | 13% |
Computer Science | 2 | 13% |
Unspecified | 1 | 7% |
Unknown | 2 | 13% |