Regression – When to Look at the P Value of the Slope vs. P Value of a Correlation Test

linearp-valueregression

This question was edited to clairify my question, for the old question see the edit log
I found this about regression and correlation:

Regression is different from correlation because it try to put
variables into equation and thus explain causal relationship between
them, for example the most simple linear equation is written : Y=aX+b

Based on the infromation mentioned here:

However such results do not allow any causal explanation of the effect
of x on y, indeed x could act on y in various way that are not always
direct, all we can say from the correlation is that these two
variables are linked somehow, to really explain and measure causal
effect of x on y we need to use regression method, which will come
next.

If we plot data and it shows a clear linear trend we can test if this linear trend is significant using a correlation test (I assume), if this is the case we can apply a linear model to this and then inspect the p value of the slope to determine if the slope is probably the same as we could expect in our popultion.

I'm not sure if the above assumption is correct so I'm wondering what the p value of the correlation test tells us and what the P value of the slope tells us?

Best Answer

There are different questions in this question. Neither correlation nor linear regression can prove causal relationship. But in your mind and in the model, the correlation is not directed but regression is. There is no difference in correlation, whether you think one value is the reason for the other whereas the formulation of a linear regression modell usually implies a direction. At least with ordinary least squares, it is not the same, whether you write $Y = aX+b$ or $X = cY+d$. However $cor(X,Y) = cor(Y,X)$.

Correlation and linear regression are familar, but the link is the $R^2$ value which results from linear regression and is indeed the square of the correlation coefficient $r$. You have not mentioned $R^2$ in your post so maybe this will help to get a better understanding.

The p-value mainly tells you, whether you sampled a large enough sample to conclude, which sign the correlation coefficient a and the regression coefficient r have.