Probability: Random stick breaking vs non random stick breaking

probability

If I broke a stick into n parts randomly, and broke another same length stick into n parts, with each part being equal.

If I pick a stick from both piles which is expected to be longer.

I couldn't figure out the basic to start, I can say on a average the length of a part on first stick is (1/N), and answer seems to be both will be same but I just don't feel right about it

Best Answer

If I broke a stick into n parts randomly

Start from there. What is the distribution function of the length of the sticks? What you need is some formal way to express what is the probability that each part of the stick has some length.

The answer is: It depends on how you split the stick.


EDIT: I made a mistake in one of the calculations. The answer is still "it depends" but the two examples I picked happen to yield the same result (shame on me). I'll cross out the parts that are invalid and add a third case where the results are actually different.

The possibilities are countless; here are my favorite two:

  1. You pick $N-1$ points on the stick independently with uniform probability. The case where any two are the same has probability 0, so I'll assume they are distinct. This determines $N$ parts.
  2. You pick $N$ real numbers to represent the length of each part (that their sum is the length of the stick) so all parts are treated equally. To formalize this we get a $(N-1)$-simplex (think the equivalent of a triangle in $N-1$ dimensions) as the space of possibilities. Since this is a bounded figure, it is possible to take a uniform density function on it.
  3. (New!) Start by splitting the stick in a random point picked with uniform probability. Set apart the first piece and keep going $N-1$ times.

Case 1 looks way nicer, doesn't it? Unfortunately it has the unfortunate effect that the length of the $N$ pieces do not have the same distribution: the ones in the middle are more likely to be shorter than the ones at the edges. EDIT: It so happens that this approach brings the same result in terms of probablity distributions for the length of the pieces as Case 2. The proof is not trivial though.

To visualize that think at $N=3$. Your choices for the two cuts can be represented with points on a square (each coordinate tells you where one cut is made). Compare the areas of the regions on that square where each individual piece is longer that $\frac13$ and multiply that for the probability of you picking that piece (I'll assume that is uniform). You will notice that the analysis for the two edge pieces will be completely symmetrical while the middle piece requires a special treatment.

In the end the probability that the stick you pick is longer than $\frac13$ is: $$\frac13*\frac49+\frac13*\frac49+\frac13*\frac49=\frac49$$

Case 2 has the added complexity of higher-dimensional geometry but it is symmetrical and it is thus easier to obtain a general formula. A bit of geometry will tell you that the probability of any piece to be longer than $\frac1N$ is: $$\left(\frac{N-1}N\right)^{N-1}$$ I do not include the computations because they are very easy to make for those that know high-dimension geometry (just use similitude) but too long to fit in an answer otherwise.

Case 3 is totally different. The first piece is likelier to be longer since it is taken out of the whole stick (average length $\frac12$) and subsequent parts are expected to keep getting shorter as they are taken out of a progressively smaller part (average length halves each step).


Naturally you can split the stick in lots of other ways. You could say you pick apart one piece at a time for $N$ times but split the longer half on the next step. You can be trying to keep it "fair" by being "close" to $\frac1N$? How hard are you trying to be "fair"? How good are you at it? The possibilities are endless.

Related Question