Solved – Model fit is High but Ramsey RESET Test suggests omitted variables. What to do

biasmodel selection

I'm trying to figure out what next steps to take. I created a model and ran OLS on a very large sample of data (over 400000 observations) and got an R-squared value of 0.80. So the model fit seems really good. However, I ran a Ramsey RESET test and its test statistic strongly suggested that there were omitted variables. I'm not sure what do do next. Do I just throw away the model, saying that the estimates are biased. Do I keep adding terms until the RESET test no longer suggests omitted variables?

The model is intended to be predictive within and out of the sample and given the model fit to the observed data, is it still truly a good fit, despite there being omitted variables?

Thanks!

Best Answer

Ramsey RESET test is not about omitted variables, but about functional form. If your model is $y=\alpha+\beta x+\varepsilon$, the RESET test can say that $y=\alpha+\beta_1 x+\beta_2 x^2 +\varepsilon$ is a better model.

It is not unlikely that a RESET test rejects because of the omission of relevant variables, i.e. the inclusion of an additional variable may capture the nonlinearities, but you should try a different functional form at first.