Solved – Explaining the difference between Pearson correlation and distance correlation

correlationdistancedistance-covariancepearson-r

This question and its answer might highlight my naivete regarding Brownian/distance correlation.

I'm using the difference between a matrix of distance correlations, as calculated by energy::dcor(), and absolute value Pearson correlations, as calculated by cor(), to highlight potential nonlinear dependencies introduced with a certain estimation technique.

In my resulting difference matrix, I have a handful of negative values indicating that the Pearson correlation is larger in magnitude than the distance correlation (range from -.07 to -.01).

First, is my approach adequate? If so, how do I explain why the Pearson correlations might be larger in magnitude than distance correlation?

Best Answer

Distance correlation or distance covariance is a measure of dependence between two paired random vectors of arbitrary, not necessarily equal dimension.

The population distance correlation coefficient is zero if and only if the random vectors are independent. Thus, distance correlation measures both linear and nonlinear association between two random variables or random vectors.

This is in contrast to Pearson's correlation, which can only detect linear association between two random variables.