Report for: PubMed-supported clinical term weighting approach for improving inter-patient similarity measure in diagnosis prediction

You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.

Title	PubMed-supported clinical term weighting approach for improving inter-patient similarity measure in diagnosis prediction
Published in	BMC Medical Informatics and Decision Making, June 2015
DOI	10.1186/s12911-015-0166-2
Pubmed ID	26032596
Authors	Lawrence WC Chan, Ying Liu, Tao Chan, Helen KW Law, SC Cesar Wong, Andy PH Yeung, KF Lo, SW Yeung, KY Kwok, William YL Chan, Thomas YH Lau, Chi-Ren Shyu
Abstract	Similarity-based retrieval of Electronic Health Records (EHRs) from large clinical information systems provides physicians the evidence support in making diagnoses or referring examinations for the suspected cases. Clinical Terms in EHRs represent high-level conceptual information and the similarity measure established based on these terms reflects the chance of inter-patient disease co-occurrence. The assumption that clinical terms are equally relevant to a disease is unrealistic, reducing the prediction accuracy. Here we propose a term weighting approach supported by PubMed search engine to address this issue. We collected and studied 112 abdominal computed tomography imaging examination reports from four hospitals in Hong Kong. Clinical terms, which are the image findings related to hepatocellular carcinoma (HCC), were extracted from the reports. Through two systematic PubMed search methods, the generic and specific term weightings were established by estimating the conditional probabilities of clinical terms given HCC. Each report was characterized by an ontological feature vector and there were totally 6216 vector pairs. We optimized the modified direction cosine (mDC) with respect to a regularization constant embedded into the feature vector. Equal, generic and specific term weighting approaches were applied to measure the similarity of each pair and their performances for predicting inter-patient co-occurrence of HCC diagnoses were compared by using Receiver Operating Characteristics (ROC) analysis. The Areas under the curves (AUROCs) of similarity scores based on equal, generic and specific term weighting approaches were 0.735, 0.728 and 0.743 respectively (p < 0.01). In comparison with equal term weighting, the performance was significantly improved by specific term weighting (p < 0.01) but not by generic term weighting. The clinical terms "Dysplastic nodule", "nodule of liver" and "equal density (isodense) lesion" were found the top three image findings associated with HCC in PubMed. Our findings suggest that the optimized similarity measure with specific term weighting to EHRs can improve significantly the accuracy for predicting the inter-patient co-occurrence of diagnosis when compared with equal and generic term weighting approaches.

View on publisher site Alert me about new mentions

X Demographics

The data shown below were collected from the profiles of 2 X users who shared this research output. Click here to find out more about how the information was compiled.

Geographical breakdown

Country	Count	As %
United States	1	50%
India	1	50%

Demographic breakdown

Type	Count	As %
Members of the public	1	50%
Practitioners (doctors, other healthcare professionals)	1	50%

Mendeley readers

The data shown below were compiled from readership statistics for 47 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country	Count	As %
Brazil	1	2%
Unknown	46	98%

Demographic breakdown

Readers by professional status	Count	As %
Student > Ph. D. Student	8	17%
Researcher	8	17%
Other	4	9%
Professor > Associate Professor	4	9%
Professor	3	6%
Other	10	21%
Unknown	10	21%

Readers by discipline	Count	As %
Computer Science	15	32%
Medicine and Dentistry	7	15%
Social Sciences	3	6%
Agricultural and Biological Sciences	2	4%
Engineering	2	4%
Other	5	11%
Unknown	13	28%

Attention Score in Context

This research output has an Altmetric Attention Score of 5. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 17 May 2022.

All research outputs

#6,147,146

of 22,808,725 outputs

Outputs from BMC Medical Informatics and Decision Making

#559

of 1,988 outputs

Outputs of similar age

#72,571

of 267,792 outputs

Outputs of similar age from BMC Medical Informatics and Decision Making

#10

of 39 outputs

Altmetric has tracked 22,808,725 research outputs across all sources so far. This one has received more attention than most of these and is in the 72nd percentile.

So far Altmetric has tracked 1,988 research outputs from this source. They receive a mean Attention Score of 4.9. This one has gotten more attention than average, scoring higher than 71% of its peers.

Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 267,792 tracked outputs that were published within six weeks on either side of this one in any source. This one has gotten more attention than average, scoring higher than 72% of its contemporaries.

We're also able to compare this research output to 39 others from the same source and published within six weeks on either side of this one. This one has gotten more attention than average, scoring higher than 74% of its contemporaries.

PubMed-supported clinical term weighting approach for improving inter-patient similarity measure in diagnosis prediction

About this Attention Score

Mentioned by

Citations

Readers on

X Demographics

Geographical breakdown

Demographic breakdown

Mendeley readers

Geographical breakdown

Demographic breakdown

Attention Score in Context