Expected number of pairs with Hamming distance $d$ for a sample of $k$ random bit strings of length $n$

binomial-coefficientscombinatoricsinformation theoryprobabilitystatistics

Say we were to uniformly sample $k$ times from a bit string with length $n$. What is the expected number of pairs with a Hamming distance $d$? In the limit of Hamming distance 0, I realize this becomes a birthday collision problem, but I am interested in the general case.

For example, say we generate 1000 random 10-bit strings, what is the expected number of strings with a Hamming distance of 3?

I found several related posts: Probability that a set of 'N' random binary strings are all at least a certain Hamming distance 'k' apart , birthday problem near misses , and Average number of strings with edit distance exactly 2
but I could not quite make the additional leap to answer my similar but not exactly the same question.

Thanks in advance for your help!

Best Answer

Actually this seems quite easy

Let $A^d_{i,j}$ be the event that two strings $s_i$ $s_j$ have distance $d$. Then, if both $s_i,s_j$ are random, we have $$P(A^d_{i,j})= \binom{n}{d} 2^{-n}$$

Then, if we sample $k$ random strings, the expected number of pairs with such distance is

$$ \frac{k(k-1)}{2} \binom{n}{d} 2^{-n}$$