↓ Skip to main content

Dynamic association rules for gene expression data analysis

Overview of attention for article published in BMC Genomics, October 2015
Altmetric Badge

About this Attention Score

  • In the top 25% of all research outputs scored by Altmetric
  • High Attention Score compared to outputs of the same age (80th percentile)
  • High Attention Score compared to outputs of the same age and source (87th percentile)

Mentioned by

blogs
1 blog
twitter
2 X users

Citations

dimensions_citation
21 Dimensions

Readers on

mendeley
58 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Dynamic association rules for gene expression data analysis
Published in
BMC Genomics, October 2015
DOI 10.1186/s12864-015-1970-x
Pubmed ID
Authors

Shu-Chuan Chen, Tsung-Hsien Tsai, Cheng-Han Chung, Wen-Hsiung Li

Abstract

The purpose of gene expression analysis is to look for the association between regulation of gene expression levels and phenotypic variations. This association based on gene expression profile has been used to determine whether the induction/repression of genes correspond to phenotypic variations including cell regulations, clinical diagnoses and drug development. Statistical analyses on microarray data have been developed to resolve gene selection issue. However, these methods do not inform us of causality between genes and phenotypes. In this paper, we propose the dynamic association rule algorithm (DAR algorithm) which helps ones to efficiently select a subset of significant genes for subsequent analysis. The DAR algorithm is based on association rules from market basket analysis in marketing. We first propose a statistical way, based on constructing a one-sided confidence interval and hypothesis testing, to determine if an association rule is meaningful. Based on the proposed statistical method, we then developed the DAR algorithm for gene expression data analysis. The method was applied to analyze four microarray datasets and one Next Generation Sequencing (NGS) dataset: the Mice Apo A1 dataset, the whole genome expression dataset of mouse embryonic stem cells, expression profiling of the bone marrow of Leukemia patients, Microarray Quality Control (MAQC) data set and the RNA-seq dataset of a mouse genomic imprinting study. A comparison of the proposed method with the t-test on the expression profiling of the bone marrow of Leukemia patients was conducted. We developed a statistical way, based on the concept of confidence interval, to determine the minimum support and minimum confidence for mining association relationships among items. With the minimum support and minimum confidence, one can find significant rules in one single step. The DAR algorithm was then developed for gene expression data analysis. Four gene expression datasets showed that the proposed DAR algorithm not only was able to identify a set of differentially expressed genes that largely agreed with that of other methods, but also provided an efficient and accurate way to find influential genes of a disease. In the paper, the well-established association rule mining technique from marketing has been successfully modified to determine the minimum support and minimum confidence based on the concept of confidence interval and hypothesis testing. It can be applied to gene expression data to mine significant association rules between gene regulation and phenotype. The proposed DAR algorithm provides an efficient way to find influential genes that underlie the phenotypic variance.

X Demographics

X Demographics

The data shown below were collected from the profiles of 2 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 58 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United Kingdom 1 2%
Spain 1 2%
United States 1 2%
Taiwan 1 2%
Unknown 54 93%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 13 22%
Student > Master 8 14%
Researcher 7 12%
Student > Bachelor 4 7%
Professor > Associate Professor 3 5%
Other 8 14%
Unknown 15 26%
Readers by discipline Count As %
Agricultural and Biological Sciences 15 26%
Biochemistry, Genetics and Molecular Biology 12 21%
Computer Science 7 12%
Sports and Recreations 2 3%
Nursing and Health Professions 1 2%
Other 4 7%
Unknown 17 29%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 8. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 19 October 2015.
All research outputs
#3,987,380
of 22,830,751 outputs
Outputs from BMC Genomics
#1,625
of 10,655 outputs
Outputs of similar age
#53,974
of 279,403 outputs
Outputs of similar age from BMC Genomics
#45
of 373 outputs
Altmetric has tracked 22,830,751 research outputs across all sources so far. Compared to these this one has done well and is in the 82nd percentile: it's in the top 25% of all research outputs ever tracked by Altmetric.
So far Altmetric has tracked 10,655 research outputs from this source. They receive a mean Attention Score of 4.7. This one has done well, scoring higher than 84% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 279,403 tracked outputs that were published within six weeks on either side of this one in any source. This one has done well, scoring higher than 80% of its contemporaries.
We're also able to compare this research output to 373 others from the same source and published within six weeks on either side of this one. This one has done well, scoring higher than 87% of its contemporaries.