Solved – distribution of errors in simple linear regression

errorregression

I just started learning about simple linear regression, and I have a question about one of its assumptions.

One of the assumptions is that the errors are normally distributed.
Does this mean that if I get every $y-\hat{y}$ point, those points should be distributed as a mound shape?

Best Answer

The assumption* relates to the errors rather than the residuals, but if the assumption is satisfied, you would expect the residuals to look close to normal.

While widely used by people who use a few particular pieces of software, histograms are a very blunt diagnostic tool for assessing normality; I tend to use Q-Q plots for that purpose while keeping in mnd that no model is perfect (its more about how much impact the non normality might have)

* see my comment in relation to when you use that assumption.

Related Question