↓ Skip to main content

KnowLife: a versatile approach for constructing a large knowledge graph for biomedical sciences

Overview of attention for article published in BMC Bioinformatics, May 2015
Altmetric Badge

About this Attention Score

  • In the top 25% of all research outputs scored by Altmetric
  • Good Attention Score compared to outputs of the same age (79th percentile)
  • Good Attention Score compared to outputs of the same age and source (77th percentile)

Mentioned by

twitter
9 X users
patent
1 patent

Citations

dimensions_citation
122 Dimensions

Readers on

mendeley
182 Mendeley
citeulike
1 CiteULike
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
KnowLife: a versatile approach for constructing a large knowledge graph for biomedical sciences
Published in
BMC Bioinformatics, May 2015
DOI 10.1186/s12859-015-0549-5
Pubmed ID
Authors

Patrick Ernst, Amy Siu, Gerhard Weikum

Abstract

Biomedical knowledge bases (KB's) have become important assets in life sciences. Prior work on KB construction has three major limitations. First, most biomedical KBs are manually built and curated, and cannot keep up with the rate at which new findings are published. Second, for automatic information extraction (IE), the text genre of choice has been scientific publications, neglecting sources like health portals and online communities. Third, most prior work on IE has focused on the molecular level or chemogenomics only, like protein-protein interactions or gene-drug relationships, or solely address highly specific topics such as drug effects. We address these three limitations by a versatile and scalable approach to automatic KB construction. Using a small number of seed facts for distant supervision of pattern-based extraction, we harvest a huge number of facts in an automated manner without requiring any explicit training. We extend previous techniques for pattern-based IE with confidence statistics, and we combine this recall-oriented stage with logical reasoning for consistency constraint checking to achieve high precision. To our knowledge, this is the first method that uses consistency checking for biomedical relations. Our approach can be easily extended to incorporate additional relations and constraints. We ran extensive experiments not only for scientific publications, but also for encyclopedic health portals and online communities, creating different KB's based on different configurations. We assess the size and quality of each KB, in terms of number of facts and precision. The best configured KB, KnowLife, contains more than 500,000 facts at a precision of 93% for 13 relations covering genes, organs, diseases, symptoms, treatments, as well as environmental and lifestyle risk factors. KnowLife is a large knowledge base for health and life sciences, automatically constructed from different Web sources. As a unique feature, KnowLife is harvested from different text genres such as scientific publications, health portals, and online communities. Thus, it has the potential to serve as one-stop portal for a wide range of relations and use cases. To showcase the breadth and usefulness, we make the KnowLife KB accessible through the health portal ( http://knowlife.mpi-inf.mpg.de ).

X Demographics

X Demographics

The data shown below were collected from the profiles of 9 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 182 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Spain 2 1%
Japan 1 <1%
Germany 1 <1%
Brazil 1 <1%
Unknown 177 97%

Demographic breakdown

Readers by professional status Count As %
Researcher 38 21%
Student > Ph. D. Student 31 17%
Student > Master 18 10%
Student > Bachelor 11 6%
Student > Doctoral Student 7 4%
Other 25 14%
Unknown 52 29%
Readers by discipline Count As %
Computer Science 65 36%
Agricultural and Biological Sciences 16 9%
Medicine and Dentistry 9 5%
Biochemistry, Genetics and Molecular Biology 7 4%
Business, Management and Accounting 4 2%
Other 18 10%
Unknown 63 35%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 8. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 08 June 2023.
All research outputs
#4,663,504
of 24,938,276 outputs
Outputs from BMC Bioinformatics
#1,675
of 7,613 outputs
Outputs of similar age
#54,060
of 269,713 outputs
Outputs of similar age from BMC Bioinformatics
#26
of 117 outputs
Altmetric has tracked 24,938,276 research outputs across all sources so far. Compared to these this one has done well and is in the 81st percentile: it's in the top 25% of all research outputs ever tracked by Altmetric.
So far Altmetric has tracked 7,613 research outputs from this source. They typically receive a little more attention than average, with a mean Attention Score of 5.5. This one has done well, scoring higher than 77% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 269,713 tracked outputs that were published within six weeks on either side of this one in any source. This one has done well, scoring higher than 79% of its contemporaries.
We're also able to compare this research output to 117 others from the same source and published within six weeks on either side of this one. This one has done well, scoring higher than 77% of its contemporaries.