Machine Learning – How to Detect Discernible Structure in Scatter Plots

correlationmachine learningscatterplot

If I have a large number of relationships I want to check that may or may not be linear (as in the bottom row of image below), is there a method that will reject the hypothesis the scatter plot is showing white noise? I want to check for the presence of any relationship without knowing what the form of that may be. What method/algorithm of measuring deviations from white noise captures the most different types of deviation?

enter image description here

image from:
https://en.wikipedia.org/wiki/Correlations

Best Answer

Maximal Information coefficient is one method that has been used for this. "In statistics, the maximal information coefficient (MIC) is a measure of the strength of the linear or non-linear association between two variables X and Y."

Detecting Novel Associations in Large Data Sets. D. Reshef et. al

http://en.wikipedia.org/wiki/Maximal_information_coefficient

Related Question