Solved – Does SPSS rescale dendrograms

clusteringdendrogramrspss

A colleague and I have been clustering some data in SPSS (v19) and R (2.15), respectively. Using the same distance metric and agglomeration method, we get identical merge orders/agglomeration schedules in both programs, and the dendrograms have very similar shapes, but the actual height values are quite different. On this particular data set, the R dendrogram is about 150 units tall, but the SPSS dendrogram is only 25 units tall. This is somewhat $\ldots$ unsettling, obviously.

While thumbing through some manuscripts and webpages, I noticed that all but one of the SPSS-derived dendrograms were also exactly 25 units tall (example 1, 2, 3; we also have a pretty hefty pile of papers that are unfortunately all pay-walled, including the single counter-example).

The caption on the SPSS output says something about rescaling, but the documentation is oddly silent about if, how, and why SPSS might be rescaling the dendrograms.

Could someone please confirm that SPSS does rescales dendrograms (and rescales them onto [0,25])? For extra credit, is there a way to turn this rescaling off? It seems to cause SPSS to cut our dendrogram a few levels above the leaves.

Best Answer

Judging by examples on the web, it does rescale them.

I'm not sure changing the scaling would help with your problem of it cutting levels though.

Edit / addition (drawing on research from the OP)

The documentation here states (clicking on the dash-underlined "dendrogram"):

"The dendrogram rescales the actual distances to numbers between 0 and 25, preserving the ratio of the distances between steps."