Solved – use z-scores to compare data measured in different units

correlationstandardization

I am trying to compare the accuracy of two different methods of estimating biomass. This data is in the form of percentage cover (and is arcsine transformed). I also have the real biomass values, measured in grams, so three different measurements of the same thing, one of which (for the purpose of the study) I'm assuming provides the correct values, the other two are of unknown accuracy. What's the best way to approach this?

I've tried two separate correlation analyses, and then used Fisher's r-to-z transformation to compare the correlation coefficients.

I was wondering if it's possible to calculate an 'accuracy' value by standardising all the scores, then taking the absolute value of the difference between my (standardised) % cover data points and the (standardised) biomass value in grams. This would be useful because then I could see whether the accuracy changes over time, for example. But does it make sense?

I'm a statistics novice and could really do with some tips! I'm using SPSS.

enter image description here

enter image description here

Best Answer

If I understand this correctly, you have

$B$. Measured values of biomass in grams, which you regard as your best measurement.

$C_1$ and $C_2$. Two measures of percent cover, which you regard as proxies for biomass.

The main question is surely the relationships between $B$ on the one hand and $C_1$ and $C_2$ on the other. As $B$ and $C_1$, $C_2$ are in quite different units of measurement, you can only assess accuracy by establishing what functions best approximate $B$ as predicted from $C_1$ and $C_2$ respectively.

Calculating a correlation is pertinent to the extent that the relationships are approximately linear, and my guess would be to doubt that, especially as $B$ is unbounded (no mathematical limit on its upper values) whereas the $C$s are bounded. I'd expect some kind of nonlinear relation here.

Transforming percents or correlations are secondary details; it may be that arcsine transformations (by which you may well mean "arcsine of square root" as that is a common transformation here) help with nonlinearity, or it may not be. Nor is it clear that any standardisation will help at all: standardisation of each variable separately makes sense if there are linear relationships of the form $\hat B = a + bC$, but if so you are better off working directly with those relationships.

To give good advice we need to see plots of $B$ vs $C_1$ and $B$ vs $C_2$; there is, in my view, no correct method for this problem that can be given in abstraction. Listings of the data would be most helpful if they can be provided easily.