Ordinal Regression – What Is the Difference Between Ordinal Regression and Ranking?

ordinal-datarankingregression

In both ordinal regression and ranking you are learning from an ordered dependent variables, so my question is:

What is the difference in formulation (if any) between the ordinal regression problem and a learning to rank problem?

Best Answer

3 years after, I answer to my own question.

For me, the main difference is in what is the output of the models in the different problems. In ordinal regression, the task is to predict a label for a given sample, hence the output of a prediction is a label (as is the case for example in multiclass classification). On the other hand, in the problem of learning to rank, the output is an order of a sequence of samples. That is, a the output of a ranking model can be seen as a permutation that makes the samples to have labels as ordered as possible. Hence, unlike the ordinal regression model, the ranking algorithm is not able to predict a class label. Because of this, the input of a ranking model does not need to specify class labels, but only a partial order between the samples (see e.g. [0] for an application of this). In this sense, ranking is an easier problem than ordinal regression: from the numerical labels you can construct an order, but not necessarily the other way round.

This is better explained with an example. Suppose that we have the following pairs of (sample, label): $\{(x_1, 1), (x_2, 2), (x_3, 2)\}$. Given this input, a ranking model will predict an order of this sequence of samples. For example, for a ranking algorithms, the permutations $(1, 2, 3) \to (1, 2, 3)$ and $(1, 2, 3) \to (1, 3, 2)$ are predictions with perfect score since the labels of both sequences $\{(x_1, 1), (x_2, 2), (x_3, 2)\}$ and $\{(x_1, 1), (x_3, 2), (x_2, 2)\}$ are ordered. On the other hand, an ordinal regression would predict a label for each of the samples, and in this case the prediction (1, 2, 2) would give a perfect score, but not (1, 2, 3) or (1, 3, 2).

[0] Optimizing Search Engines using Clickthrough Data Thorsten Joachims

Related Solutions

Solved – Ordinal data in regression

Since your response is ordinal then you should use ordinal regression. At a very high level, the main difference ordinal regression and linear regression is that with linear regression the dependent variable is continuous and ordinal the dependent variable is ordinal.

Now you can usually use linear regression with an ordinal dependent variable but you will see that the diagnostic plots do not look good. When you say SPSS won't run the linear regression what do you mean? Are you getting an error?

Solved – Ordinal independent variables and ordinal regression method

1) You can either use the Order Logit regression or the Order probit regression.

I do not know whether this approach works in SPSS, but here there is a nice code for the Order Logit Regression in R.

library(MASS)
m <- polr(independentvar ~ var1 + var2 + var3, data = ghost291data, Hess=TRUE)

2) You get the following output:

A list of coefficients like for any regression
Two intercepts which indicate the differences between the different ordinal datas. You will get n-1 intercepts for n categories of the independent variables.

3) The Algorithm from the MASS package does the recoding for you.

Best Answer

Related Solutions

Solved – Ordinal data in regression

Solved – Ordinal independent variables and ordinal regression method

Related Question