Hypothesis Testing – Creating the Null Hypothesis Statement for Wilcoxon Signed-Rank Test

hypothesis testingwilcoxon-signed-rank

I performed a one-sided Wilcoxon signed-rank (WSR) test. I had a matched pair of continuous random variables, say $X_1, X_2, …, X_n$ and $Y_1, Y_2, …, Y_n$. For my system, it is safe to assume that under the null hypothesis, $X_i$ and $Y_i$ are identically distributed for all $i$. Hence, the difference, say $U_i = X_i – Y_i$ should have a symmetrical distribution around 0, i.e., under the null hypothesis. Also, $U_i$'s are independent. Hence, I believe that assumptions of Wilcoxon signed-rank test are satisfied. So, I performed a one-sided test to check if $U_i$ tend to be greater than 0.

I am writing a report on my analysis but I am a bit confused about how exactly to phrase the test. Following are a couple statements that I have considered.

  1. I performed Wilcoxon signed-rank test to test null hypothesis against one-sided alternative that population mean of $U_i>0$.

  2. I performed Wilcoxon signed-rank test to test null hypothesis against one-sided alternative that population median of $U_i>0$.

I have noticed that WSR null is generally reported in terms of median. For one, I am not clear on why that is. Second, my understanding is that since $U_i$'s are symmetrically distributed about 0 under the null hypothesis, mean and median should be the same. So perhaps, it should be ok to describe the test in terms of mean. If it is so, I'd prefer to report in terms of mean because a lot of my readers might not know WSR. If they read a statement in terms of mean, they might find it easy to relate to as simply an alternative to two sample t-test.

If you have a suggestion for a better way of reporting the test, please let me know.

I have read several answers over Stackexchange but could not find exactly what I was looking for. Also, I read a paper (Li and Johnson, 2014, Pharmaceutical Statistics; DOI: 10.1002/pst.1628) that describes two different null hypothesis – one for "test of median" and another for "test of symmetry" with the name of Wilcoxon signed-rank test.

I simply used wilcox.test function in R as wilcox.test(X, Y, alternative = "greater", paired = T). How do I know if my test was a "test of median" or a "test of symmetry"?

Best Answer

This answer has been revised after being accepted, as I did not adequately appreciate Wilcoxon's critique of the sign test to extend the null hypothesis. I address the difference between the revised and previous answer at the end

The Wilcoxon sign rank test has these null and alternative hypotheses (see Snedecor, G. W. and Cochran, W. G. (1989) Statistical Methods, 8th edition. Iowa State University Press: Ames, IA.):

$\text{H}_{0}$: The magnitude of paired differences are symmetrically distributed about zero.

$\text{H}_{\text{A}}$: The magnitude of paired differences are either not symmetrically distributed, or are not distributed about zero, or both.

(The symmetric distribution about zero rapidly approaches a normal distribution as the sample size increases. See Belera, 2010.)

Many introductory texts motivate the signed rank test as a test of median difference, or more rarely in my experience, mean difference, without mentioning that two fairly strict assumptions are required for this interpretation:

  1. The distribution of both groups must have the same shape.

  2. The variance of both groups must be equal.

If both these assumptions are true, then the signed rank test can validly be interpreted as having a null hypothesis of equal medians (or equal means).

References
Bellera, C. A., Julien, M., and Hanley, J. A. (2010). Normal approximations to the distributions of the Wilcoxon statistics: Accurate to what $n$? Graphical insights. Journal of Statistics Education, 18(2):1–17.

Wilcoxon, F. (1945). Individual comparisons by ranking methods. Biometrics Bulletin, 1(6):80–83.


Motivation for my revised answer

My previously accepted answer was that the null and alternative hypotheses were in paired observations:

$$H_0:P(X_A>X_B)=0.5;H_A:P(X_A>X_B)≠0.5$$

These are null and alternative hypotheses about relative stochastic size (sometimes zeroth-order stochastic dominance). In plain language the null hypothesis is that the probability of a random observation from group $A$ exceeding the paired observation drawn from group $B$ is one half (i.e. a random observation in group $A$ has just as much probability of being greater than, as being less than its paired observation in group $B$).

In plain language the alternative hypothesis is that this probability is not one half (i.e. one of the groups is more likely to be greater than the other than less than the other).

But the sign rank can be false due to asymmetry (because of the magnitude of rank differences are larger in one direction than the other), so that even when $P(X_A>X_B)=0.5$ we reject the null if the magnitudes when $X_{A}>X_{B}$ are, say, much larger than when $X_{A}<X_{B}$. Magnitude of rank difference is therefore why the sign rank test must incorporate symmetry into the null as in my revision. My thanks to @SalMangiafico for his patient tutelage.