Title |
Linking genes to literature: text mining, information extraction, and retrieval applications for biology
|
---|---|
Published in |
Genome Biology, September 2008
|
DOI | 10.1186/gb-2008-9-s2-s8 |
Pubmed ID | |
Authors |
Martin Krallinger, Alfonso Valencia, Lynette Hirschman |
Abstract |
Efficient access to information contained in online scientific literature collections is essential for life science research, playing a crucial role from the initial stage of experiment planning to the final interpretation and communication of the results. The biological literature also constitutes the main information source for manual literature curation used by expert-curated databases. Following the increasing popularity of web-based applications for analyzing biological data, new text-mining and information extraction strategies are being implemented. These systems exploit existing regularities in natural language to extract biologically relevant information from electronic texts automatically. The aim of the BioCreative challenge is to promote the development of such tools and to provide insight into their performance. This review presents a general introduction to the main characteristics and applications of currently available text-mining systems for life sciences in terms of the following: the type of biological information demands being addressed; the level of information granularity of both user queries and results; and the features and methods commonly exploited by these applications. The current trend in biomedical text mining points toward an increasing diversification in terms of application types and techniques, together with integration of domain-specific resources such as ontologies. Additional descriptions of some of the systems discussed here are available on the internet http://zope.bioinfo.cnio.es/bionlp_tools/. |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 28 | 6% |
United Kingdom | 9 | 2% |
Germany | 6 | 1% |
Mexico | 5 | 1% |
Spain | 5 | 1% |
Belgium | 4 | <1% |
France | 3 | <1% |
Australia | 3 | <1% |
Switzerland | 3 | <1% |
Other | 23 | 5% |
Unknown | 376 | 81% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Researcher | 98 | 21% |
Student > Ph. D. Student | 90 | 19% |
Student > Master | 77 | 17% |
Professor | 34 | 7% |
Professor > Associate Professor | 32 | 7% |
Other | 105 | 23% |
Unknown | 29 | 6% |
Readers by discipline | Count | As % |
---|---|---|
Agricultural and Biological Sciences | 133 | 29% |
Computer Science | 109 | 23% |
Medicine and Dentistry | 34 | 7% |
Biochemistry, Genetics and Molecular Biology | 31 | 7% |
Social Sciences | 30 | 6% |
Other | 86 | 18% |
Unknown | 42 | 9% |