Probability – How to Derive the 37-Percent Rule for Dating

combinatoricsprobabilityrecreational-mathematics

I am trying to prove the theoretical "37-percent rule" for dating. The setup, if I remember correctly, is this. Suppose that you will meet exactly $N$ potential mates in your life, and you will meet them one at a time, in a perfectly random order. The potential mates rank from best to worst (in a total ordering), and you want to maximize the probability that you end up with the best one. However, you can only tell how good the mates are relative to each other, so while you can fully rank the people you've already met, you can't say anything about the ones you have yet to meet. Also, for each potential mate, you can either stay with them forever or leave forever, i.e. there is no divorce or post-breakup dating.

The result I have heard, and which I am trying to prove, is that your best strategy is to wait and reject the first 37% of them ($1/e$, to be precise), and then marry the next one that is better than all you have previously met. The $1/e$ number presumably arises as the limit as $N \to \infty$.

Obviously, you should never marry someone who isn't strictly better than all the previous ones, because then your chances of picking the right one are $0$. Also, given a strategy that you wait through the first $K$ partners and then marry the next one that is the best so far, I calculate your expected chances of succeeding as
\begin{equation}
\frac{\displaystyle \sum_{M = K}^{N-1} \frac{M – 1 \choose K-1}{N – M}}{N \choose K}
\end{equation}

(Let $M$ be the maximum value among the first $K$ people you meet, where $1$ is the value of the worst person, $2$ is the next, and so on, with your desired mate having value $N$. Given $M$, your chances of winning are $\frac{1}{N – M}$, because the value $N$ must be the first to appear out of the highest $N – M$ values. The chances of the maximum being exactly $M$ are ${M – 1\choose K – 1}/{N \choose K}$.)

(The above formula doesn't technically work for $K = 0$, but the reasonable convention ${-1 \choose -1} = 1$ gives the desired value $\frac1N$.)

The two things I am unable to prove, and which I would like to see ideas for, are:

  1. Given that you never pick someone unless they are the best so far, how do you prove further that the best strategy must involve waiting for some $K$ people and then going for anyone else after that $K$?

  2. Why is the formula above optomized at $K = N / e$, and how could one show this?

Best Answer

The decision to marry or not need only be made when the current result is better than all previous results (otherwise the current choice is definitely suboptimal). The choice, at point $K$ to marry or not (provided the $K$th date is better than all before) does not depend on the previous dates in anay ways (aprt from them being worse than the $K$th) because all relatiev orders are equally likely. Therefore, any optimal strategy must consist of probabilities $p_k$ for $1\le k\le N$ such that when the $k$th date is better than all before, we marry (and stop) with probability $p_k$ and continue otherwise. Let $q_k$ be the probability of picking the best provided we picked none of the first $k$. Then the probability of picking the best provided we picked none of the first $k-1$ is $$ q_{k-1}=\frac{\frac1Np_k+\frac{N-k}{N}\frac1k(1-p_k)q_k+\frac{N-k}{N}(1-\frac1k)q_k}{P(\text{none of the first $k-1$ selected})}=\alpha_kp_k+\beta_k$$ ($k$th is best and we pick, or best is after $k$ and $k$ is best of first and we don't pick it, or best is after $k$ and $k$ is not best of first) and hence is clearly maximized at $p_k=0$ ($\alpha<0$) or $p_k=1$ ($\alpha>0$) or in a rare possibility $p_k$ does not matter at all ($\alpha=0$) and can wlog. be selected to be $0$ or $1$.

You can make yourself clear that haveing $p_k=1$ and $p_{k+1}=0$ cannot be optimal.

Related Question