↓ Skip to main content

Fuzzy association rule mining and classification for the prediction of malaria in South Korea

Overview of attention for article published in BMC Medical Informatics and Decision Making, June 2015
Altmetric Badge

About this Attention Score

  • Average Attention Score compared to outputs of the same age
  • Average Attention Score compared to outputs of the same age and source

Mentioned by

twitter
2 X users

Readers on

mendeley
115 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Fuzzy association rule mining and classification for the prediction of malaria in South Korea
Published in
BMC Medical Informatics and Decision Making, June 2015
DOI 10.1186/s12911-015-0170-6
Pubmed ID
Authors

Anna L. Buczak, Benjamin Baugher, Erhan Guven, Liane C. Ramac-Thomas, Yevgeniy Elbert, Steven M. Babin, Sheri H. Lewis

Abstract

Malaria is the world's most prevalent vector-borne disease. Accurate prediction of malaria outbreaks may lead to public health interventions that mitigate disease morbidity and mortality. We describe an application of a method for creating prediction models utilizing Fuzzy Association Rule Mining to extract relationships between epidemiological, meteorological, climatic, and socio-economic data from Korea. These relationships are in the form of rules, from which the best set of rules is automatically chosen and forms a classifier. Two classifiers have been built and their results fused to become a malaria prediction model. Future malaria cases are predicted as LOW, MEDIUM or HIGH, where these classes are defined as a total of 0-2, 3-16, and above 17 cases, respectively, for a region in South Korea during a two-week period. Based on user recommendations, HIGH is considered an outbreak. Model accuracy is described by Positive Predictive Value (PPV), Sensitivity, and F-score for each class, computed on test data not previously used to develop the model. For predictions made 7-8 weeks in advance, model PPV and Sensitivity are 0.842 and 0.681, respectively, for the HIGH classes. The F0.5 and F3 scores (which combine PPV and Sensitivity) are 0.804 and 0.694, respectively, for the HIGH classes. The overall FARM results (as measured by F-scores) are significantly better than those obtained by Decision Tree, Random Forest, Support Vector Machine, and Holt-Winters methods for the HIGH class. For the MEDIUM class, Random Forest and FARM obtain comparable results, with FARM being better at F0.5, and Random Forest obtaining a higher F3. A previously described method for creating disease prediction models has been modified and extended to build models for predicting malaria. In addition, some new input variables were used, including indicators of intervention measures. The South Korea malaria prediction models predict LOW, MEDIUM or HIGH cases 7-8 weeks in the future. This paper demonstrates that our data driven approach can be used for the prediction of different diseases.

X Demographics

X Demographics

The data shown below were collected from the profiles of 2 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 115 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Indonesia 1 <1%
United States 1 <1%
India 1 <1%
Canada 1 <1%
Unknown 111 97%

Demographic breakdown

Readers by professional status Count As %
Student > Master 25 22%
Student > Ph. D. Student 11 10%
Researcher 9 8%
Student > Bachelor 8 7%
Professor > Associate Professor 6 5%
Other 26 23%
Unknown 30 26%
Readers by discipline Count As %
Computer Science 30 26%
Medicine and Dentistry 13 11%
Engineering 13 11%
Nursing and Health Professions 4 3%
Biochemistry, Genetics and Molecular Biology 4 3%
Other 17 15%
Unknown 34 30%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 2. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 21 June 2015.
All research outputs
#14,229,946
of 22,813,792 outputs
Outputs from BMC Medical Informatics and Decision Making
#1,101
of 1,988 outputs
Outputs of similar age
#135,893
of 264,477 outputs
Outputs of similar age from BMC Medical Informatics and Decision Making
#21
of 40 outputs
Altmetric has tracked 22,813,792 research outputs across all sources so far. This one is in the 35th percentile – i.e., 35% of other outputs scored the same or lower than it.
So far Altmetric has tracked 1,988 research outputs from this source. They receive a mean Attention Score of 4.9. This one is in the 38th percentile – i.e., 38% of its peers scored the same or lower than it.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 264,477 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 45th percentile – i.e., 45% of its contemporaries scored the same or lower than it.
We're also able to compare this research output to 40 others from the same source and published within six weeks on either side of this one. This one is in the 37th percentile – i.e., 37% of its contemporaries scored the same or lower than it.