Japanese Journal of Biometrics
Online ISSN : 2185-6494
Print ISSN : 0918-4430
ISSN-L : 0918-4430
Preliminary Report
Comparison between Pearson Correlation Coefficient and Mutual Information as a Similarity Measure of Gene Expression Profiles
Daisuke HoryuTakeshi Hayashi
Author information
JOURNAL FREE ACCESS

2013 Volume 33 Issue 2 Pages 125-143

Details
Abstract
Definition of similarity is required for clustering co-expressed genes or estimating gene regulatory network from gene expression data. Pearson correlation coefficient and mutual information are the popular measures to evaluate similarity between gene expression profiles. To investigate which measure is appropriate for evaluating similarity between gene expression profiles, we have compared these two measures using Gene ontology annotation similarity. Genes that have similar Gene ontology annotations can be interpreted that they have commonality in biological processes or molecular functions. The results showed that the better similarity measure is different depending on the purpose of the analysis or from which organism the data derived. In the case of evaluating similarities among more than three genes, mutual information was a better similarity measure for the data derived from multicellular organisms, though Pearson correlation coefficient was a better similarity measure for the data derived from unicellular organisms. In the case of finding genes whose transcripts have similar functions or genes that participate to similar processes, Pearson correlation coefficient was always a better measure.
Content from these authors
© 2013 The Biometric Society of Japan
Previous article Next article
feedback
Top