Solved – Maximal Information Coefficient (MIC) in practical bioinformatics applications

bioinformaticscorrelationmicroarray

I am interested in Maximal Information Coefficient (MIC) as an alternative to Pearson correlation when looking at gene co-expression from microarray data. I've read some very good posts on this website on MIC. However‚ they have mostly dealt with the theory. The Science paper is now quite a few years old. So I wondered what the practical experience with MIC is for microarray data. Is it fast enough for big datasets with many features such as microarrays? What about multiple testing? Doesn't the method detect too many co-expressed genes? Does it work equally well for tissue-specific versus housekeeping genes? (One of the strongest features of PCC when looking at multiple tissues is that it works mostly for tissue-specific genes.)

I would be grateful for practical hints and/or pointers to published papers where MIC was used for microarray data.

Best Answer

You can read the MICtools paper by Albanese, et al.. MICtools is practical, general purpose, open-source software for maximal information coefficient analysis. I think you can run it on your microarray data without problems. For multiple testing correction, MICtools makes available the strategies implemented in the Python Statsmodels package and a Python implementation of the Storey q-value method.

Albanese, D., Riccadonna, S., Donati, C., & Franceschi, P. (2018). A practical tool for maximal information coefficient analysis. GigaScience, 7(4), 1–8.

Related Question