↓ Skip to main content

A framework for ontology-based question answering with application to parasite immunology

Overview of attention for article published in Journal of Biomedical Semantics, July 2015
Altmetric Badge

About this Attention Score

  • Good Attention Score compared to outputs of the same age (69th percentile)

Mentioned by

twitter
5 X users

Citations

dimensions_citation
22 Dimensions

Readers on

mendeley
70 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
A framework for ontology-based question answering with application to parasite immunology
Published in
Journal of Biomedical Semantics, July 2015
DOI 10.1186/s13326-015-0029-x
Pubmed ID
Authors

Amir H. Asiaee, Todd Minning, Prashant Doshi, Rick L. Tarleton

Abstract

Large quantities of biomedical data are being produced at a rapid pace for a variety of organisms. With ontologies proliferating, data is increasingly being stored using the RDF data model and queried using RDF based querying languages. While existing systems facilitate the querying in various ways, the scientist must map the question in his or her mind to the interface used by the systems. The field of natural language processing has long investigated the challenges of designing natural language based retrieval systems. Recent efforts seek to bring the ability to pose natural language questions to RDF data querying systems while leveraging the associated ontologies. These analyze the input question and extract triples (subject, relationship, object), if possible, mapping them to RDF triples in the data. However, in the biomedical context, relationships between entities are not always explicit in the question and these are often complex involving many intermediate concepts. We present a new framework, OntoNLQA, for querying RDF data annotated using ontologies which allows posing questions in natural language. OntoNLQA offers five steps in order to answer natural language questions. In comparison to previous systems, OntoNLQA differs in how some of the methods are realized. In particular, it introduces a novel approach for discovering the sophisticated semantic associations that may exist between the key terms of a natural language question, in order to build an intuitive query and retrieve precise answers. We apply this framework to the context of parasite immunology data, leading to a system called AskCuebee that allows parasitologists to pose genomic, proteomic and pathway questions in natural language related to the parasite, Trypanosoma cruzi. We separately evaluate the accuracy of each component of OntoNLQA as implemented in AskCuebee and the accuracy of the whole system. AskCuebee answers 68 % of the questions in a corpus of 125 questions, and 60 % of the questions in a new previously unseen corpus. If we allow simple corrections by the scientists, this proportion increases to 92 %. We introduce a novel framework for question answering and apply it to parasite immunology data. Evaluations of translating the questions to RDF triple queries by combining machine learning, lexical similarity matching with ontology classes, properties and instances for specificity, and discovering associations between them demonstrate that the approach performs well and improves on previous systems. Subsequently, OntoNLQA offers a viable framework for building question answering systems in other biomedical domains.

X Demographics

X Demographics

The data shown below were collected from the profiles of 5 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 70 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Brazil 1 1%
Unknown 69 99%

Demographic breakdown

Readers by professional status Count As %
Student > Master 12 17%
Researcher 11 16%
Student > Ph. D. Student 11 16%
Student > Bachelor 6 9%
Student > Doctoral Student 4 6%
Other 12 17%
Unknown 14 20%
Readers by discipline Count As %
Computer Science 35 50%
Agricultural and Biological Sciences 5 7%
Engineering 4 6%
Immunology and Microbiology 2 3%
Social Sciences 2 3%
Other 4 6%
Unknown 18 26%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 4. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 06 October 2017.
All research outputs
#6,898,570
of 22,817,213 outputs
Outputs from Journal of Biomedical Semantics
#130
of 364 outputs
Outputs of similar age
#72,293
of 234,778 outputs
Outputs of similar age from Journal of Biomedical Semantics
#1
of 4 outputs
Altmetric has tracked 22,817,213 research outputs across all sources so far. This one has received more attention than most of these and is in the 69th percentile.
So far Altmetric has tracked 364 research outputs from this source. They receive a mean Attention Score of 4.6. This one has gotten more attention than average, scoring higher than 64% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 234,778 tracked outputs that were published within six weeks on either side of this one in any source. This one has gotten more attention than average, scoring higher than 69% of its contemporaries.
We're also able to compare this research output to 4 others from the same source and published within six weeks on either side of this one. This one has scored higher than all of them