Negative Binomial Distribution – Comparing with Binomial Distribution

binomial distributioncategorical datadata miningnegative-binomial-distribution

What is the difference between the negative binomial distribution and the binomial distribution?

I tried reading online, and I found that the negative binomial distribution is used when data points are discrete, but I think even the binomial distribution can be used for discrete data points.

Best Answer

The difference is what we are interested in. Both distributions are built from independent Bernoulli trials with fixed probability of success, p.

With the Binomial distribution, the random variable X is the number of successes observed in n trials. Because there are a fixed number of trials, the possible values of X are 0, 1, ..., n.

With the Negative Binomial distribution, the random variable Y is the number of trials until observed the r th success is observed. In this case, we keep increasing the number of trials until we reach r successes. The possible values of Y are r, r+1, r+2, ... with no upper bound. The Negative Binomial can also be defined in terms of the number of failures until the r th success, instead of the number of trials until the r th success. Wikipedia defines the Negative Binomial distribution in this manner.

So to summarize:

Binomial:

  • Fixed number of trials (n)
  • Fixed probability of success (p)
  • Random variable is X = Number of successes.
  • Possible values are 0 ≤ Xn

Negative Binomial:

  • Fixed number of successes (r)
  • Fixed probability of success (p)
  • Random variable is Y = Number of trials until the r th success.
  • Possible values are rY

Thanks to Ben Bolker for reminding me to mention the support of the two distributions. He answered a related question here.