Solved – Normalize histogram with different bin width

histogramnormalization

I know how to normalize histogram (so that the area =1) with the same bin width, but how to do it when the histogram has different bin width? Any idea?

Best Answer

The principle of a histogram is that for bins that touch (i.e. no overlaps or gaps), bar height should be proportional to bin frequency/bin width. If bin width is constant, then it can be left out of the calculation.

It's conventional to present histograms as showing frequencies, percents, or proportions, and that's all possible and legitimate with constant bin width.

With varying bin width, however, the choices include (a) probability density (b) frequency density (c) frequencies scaled to some standard bin width (really (b) with possibly differing magnitudes).

The principles transcend software choice, but more discussion, and some references, can be found in this Stata FAQ.

If you have varying bin widths, that has no impact on the principle that the total area of the bars represents total frequency or total probability of 1. If you draw the individual bars correctly, that will be satisfied.