Solved – Using MLE vs. OLS

least squaresmaximum likelihoodregression

When is it preferable to use Maximum Likelihood Estimation instead of Ordinary Least Squares? What are the strengths and limitations of each? I am trying to gather practical knowledge on where to use each in common situations.

Best Answer

As explained here, OLS is just a particular instance of MLE. Here is closely related question, with a derivation of OLS in terms of MLE.

The conditional distribution corresponds to your noise model (for OLS: Gaussian and the same distribution for all inputs). There are other options (t-Student to deal with outliers, or allow the noise distribution to depend on the input)