Solved – Minimizing the median absolute deviation or median absolute error

medianregressionrobustterminology

The median of a vector $\vec x$ is a scalar $a$ minimizing the mean of $|\vec x – a|$. Analogously, when quantile regression is used to estimate medians, it tries to minimize the mean of the absolute residuals.

But suppose we consider the median of the absolute deviations rather than their mean (or sum). The median of $\vec x$ need not minimize the median of $|\vec x – a|$. Building a regression model that tries to minimize the median of the absolute residuals has a certain intuitive appeal, in that median absolute error has a natural interpretation as a distance around true values that predictions are as likely as not to fall within.

This leads me to wonder:

Is there a name for the value $a$ that minimizes the median of $|\vec x – a|$? What about a regression model minimizing the median absolute residual?
Are there any better algorithms for calculating this value than using a generic function-minimizing routine like R's optim? How about algorithms for fitting this sort of regression model?

Best Answer

The shortest half is the shortest interval containing half the distribution or data (when dealing with populations or samples respectively). [Some authors call this interval of the shortest half the shorth, though the term seems to have been coined by Andrews et al (1972) who used it to refer to the mean of the observations in the shortest half, so it would more properly refer to that. Probably best to just explicitly say shortest half and mean of the shortest half to avoid that potential confusion]

The midpoint of the shortest half should minimize the median of the absolute deviations; you sometimes see it called "the midpoint of the shortest half", but it has another name (see below).

This is a one-dimensional version of a minimum volume estimator.

Because quantiles are equivariant to monotonic-increasing transformation, in one dimension we can see that minimizing the median of the absolute deviations is equivalent to minimizing the median of the squared deviations [or any other monotonic increasing function of them -- at least if we keep our definition of medians as interval-valued when they don't fall exactly at observations, otherwise they'll differ slightly but always lie between the same observations].

So the literature on least median of squares (LMS) estimation will probably be of some use to you here. e.g. see Rousseeuw & Leroy, 1987 [1], for example

There's often explicit code for LMS estimators (especially for regression, but if you only fit an intercept ... you should get the original thing you asked about) and sometimes code for producing estimators based on the shortest half (e.g. Nick Cox seems to have written one for Stata, for example)

So the alternative name I referred to earlier would be the "least median of squares estimate of location".

Sorry both terms seem to be such a mouthful; off the top of my head I don't know any reasonably unambiguous names that are shorter.

[1] Rousseeuw, P.J. and Leroy, A.M. (1987),
Robust Regression and Outlier Detection,
Wiley, New York.

Related Solutions

Solved – Finding a linear regression model that minimized percentage error in R

Easy. There are actually several choices, first is to take the logarithm of the y-axis values, that converts multiplication into addition thus it converts relative error into absolute error.

Second choice, that you didn't ask for, would be to do the regression minimizing a different norm. That is, usually one minimizes $||model-y_{data}||$, where $||.||$ is the norm, A.K.A. the L2 norm, A.K.A. the absolute value of a vector difference, A.K.A. the square root of the sum of squares of the difference. To do this for proportional modeling one minimizes $||\frac{model}{y_{data}}-1||$.

Solved – Quantile Regression vs OLS for homoscedasticity

Will the estimated slope coefficient $\beta_1$ always be the same for OLS and for QR for different quantiles?

No, of course not, because the empirical loss function being minimized differs in these different cases (OLS vs. QR for different quantiles).

I am well aware that in the presence of homoscedasticity all the slope parameters for different quantile regressions will be the same and that the QR models will differ only in the intercept.

No, not in finite samples. Here is an example taken from the help files of the quantreg package in R:

    library(quantreg)
    data(stackloss)
    rq(stack.loss ~ stack.x,tau=0.50) #median (l1) regression fit 
                                      # for the stackloss data.
    rq(stack.loss ~ stack.x,tau=0.25) #the 1st quartile

However, asymptotically they will all converge to the same true value.

But in the case of homoscedasticity, shouldn't outliers cancel each other out because positive errors are as likely as negative ones, rendering OLS and median QR slope coefficient equivalent?

No. First, perfect symmetry of errors is not guaranteed in any finite sample. Second, minimizing the sum of squares vs. absolute values will in general lead to different values even for symmetric errors.

Best Answer

Related Solutions

Solved – Finding a linear regression model that minimized percentage error in R

Solved – Quantile Regression vs OLS for homoscedasticity

Related Question