↓ Skip to main content

Building a model for disease classification integration in oncology, an approach based on the national cancer institute thesaurus

Overview of attention for article published in Journal of Biomedical Semantics, February 2017
Altmetric Badge

About this Attention Score

  • Average Attention Score compared to outputs of the same age

Mentioned by

twitter
1 X user

Citations

dimensions_citation
13 Dimensions

Readers on

mendeley
19 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Building a model for disease classification integration in oncology, an approach based on the national cancer institute thesaurus
Published in
Journal of Biomedical Semantics, February 2017
DOI 10.1186/s13326-017-0114-4
Pubmed ID
Authors

Vianney Jouhet, Fleur Mougin, Bérénice Bréchat, Frantz Thiessard

Abstract

Identifying incident cancer cases within a population remains essential for scientific research in oncology. Data produced within electronic health records can be useful for this purpose. Due to the multiplicity of providers, heterogeneous terminologies such as ICD-10 and ICD-O-3 are used for oncology diagnosis recording purpose. To enable disease identification based on these diagnoses, there is a need for integrating disease classifications in oncology. Our aim was to build a model integrating concepts involved in two disease classifications, namely ICD-10 (diagnosis) and ICD-O-3 (topography and morphology), despite their structural heterogeneity. Based on the NCIt, a "derivative" model for linking diagnosis and topography-morphology combinations was defined and built. ICD-O-3 and ICD-10 codes were then used to instantiate classes of the "derivative" model. Links between terminologies obtained through the model were then compared to mappings provided by the Surveillance, Epidemiology, and End Results (SEER) program. The model integrated 42% of neoplasm ICD-10 codes (excluding metastasis), 98% of ICD-O-3 morphology codes (excluding metastasis) and 68% of ICD-O-3 topography codes. For every codes instantiating at least a class in the "derivative" model, comparison with SEER mappings reveals that all mappings were actually available in the model as a link between the corresponding codes. We have proposed a method to automatically build a model for integrating ICD-10 and ICD-O-3 based on the NCIt. The resulting "derivative" model is a machine understandable resource that enables an integrated view of these heterogeneous terminologies. The NCIt structure and the available relationships can help to bridge disease classifications taking into account their structural and granular heterogeneities. However, (i) inconsistencies exist within the NCIt leading to misclassifications in the "derivative" model, (ii) the "derivative" model only integrates a part of ICD-10 and ICD-O-3. The NCIt is not sufficient for integration purpose and further work based on other termino-ontological resources is needed in order to enrich the model and avoid identified inconsistencies.

X Demographics

X Demographics

The data shown below were collected from the profile of 1 X user who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 19 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 19 100%

Demographic breakdown

Readers by professional status Count As %
Researcher 6 32%
Other 2 11%
Student > Ph. D. Student 2 11%
Student > Master 2 11%
Professor 1 5%
Other 3 16%
Unknown 3 16%
Readers by discipline Count As %
Medicine and Dentistry 5 26%
Computer Science 3 16%
Agricultural and Biological Sciences 2 11%
Biochemistry, Genetics and Molecular Biology 2 11%
Pharmacology, Toxicology and Pharmaceutical Science 1 5%
Other 2 11%
Unknown 4 21%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 1. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 10 February 2017.
All research outputs
#15,443,875
of 22,953,506 outputs
Outputs from Journal of Biomedical Semantics
#238
of 364 outputs
Outputs of similar age
#256,183
of 420,233 outputs
Outputs of similar age from Journal of Biomedical Semantics
#9
of 13 outputs
Altmetric has tracked 22,953,506 research outputs across all sources so far. This one is in the 22nd percentile – i.e., 22% of other outputs scored the same or lower than it.
So far Altmetric has tracked 364 research outputs from this source. They receive a mean Attention Score of 4.6. This one is in the 21st percentile – i.e., 21% of its peers scored the same or lower than it.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 420,233 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 30th percentile – i.e., 30% of its contemporaries scored the same or lower than it.
We're also able to compare this research output to 13 others from the same source and published within six weeks on either side of this one. This one is in the 23rd percentile – i.e., 23% of its contemporaries scored the same or lower than it.