Solved – What to do when Kolmogorov-Smirnov test is significant for residuals of parametric test but skewness and kurtosis look normal

assumptionsdistributionsnormality-assumptionregressionresiduals

I have conducted a parametric test in a study, n=290. I want to assess whether the residuals of this test are normally distributed.

The skewness and kurtosis of the residuals are -0.017 and -0.438 respectively. I think this is considered as normal.
Unfortunately, the Kolmogorov-Smirnov of the residuals has a p-value of 0.021. I think this is considered not normal.

Question

What should I do when skewness and kurtosis look normal but Kolmogorov-Smirnov is significant?

Best Answer

In order to make sure that I can use parametric test, I need to make sure that my residual distribution is normal.

There is really no way to demonstrate that you have exact normality, but that's okay because approximate normality will generally be sufficient for hypothesis tests in regression to work the way you want.

However, when I refer to the value of skewness and kurtosis of the residual, it is -0.017 and -0.438 respectively, where i think this is considered as normal.

You can obtain values like that with residuals from a simple regression on normal data, but the kurtosis is just significant at the 5% level.

(Technical aside: I used simulation to assess the significance of the kurtosis of residuals here; not knowing the number of predictors, I did it for both independent normals and for one predictor at the given sample size, both showed essentially the same p-value; results should be similar for regression with small numbers of predictors.)

This doesn't actually suggest a problem with the inference when doing a regression or correlation, however. Your data won't be exactly normal; the essential question is 'are the data so badly non-normal that the inference no longer has the properties you wish?'

Unfortunately, when i do kolmogorov-smirnov, the significant value is 0.021, which indicates the residual is not normal.

What were the specified population mean and variance of the residuals for your KS test and how did you get such population values?

Could anybody please explain to me what to do.

I suggest you don't do a hypothesis test to assess the suitability of the assumption of normality, but instead to look at diagnostic displays that show you how badly non-normal the data are.

Some pointers -

See the points here

Also see the discussion on this question

See the comments under this answer, and the advice in this answer

Consider this advice

Related Solutions

Solved – How robust is ANOVA to violations of normality

Don't look at it as a binary thing: "either I can trust the results or I can't." Look at it as a spectrum. With all assumptions perfectly satisfied (including the in most cases crucial one of random sampling), statistics such as F- and p-values will allow you to make accurate sample-to-population inferences. The farther one gets from that situation, the more skeptical one should be about such results. You've got a substantial degree of nonnormality; that's one strike against accuracy. Now how about the other assumptions underlying the use of ANOVA? Size it all up the best you can, and document in a footnote or a technical section what you find. You also should look at this page, as @William pointed out.

As to your last question, I don't believe you need to change your strategy vis-a-vis multiple comparisons just because you move from a parametric to a nonparametric test. If you want to describe the rationale for your current approach, I'm sure people will be glad to comment on it.

Data Transformation – Methods to Increase Kurtosis and Skewness of Normal Random Variables

This can be done using the sinh-arcsinh transformation from

Jones, M. C. and Pewsey A. (2009). Sinh-arcsinh distributions. Biometrika 96: 761–780.

The transformation is defined as

$$H(x;\epsilon,\delta)=\sinh[\delta\sinh^{-1}(x)-\epsilon], \tag{$\star$}$$

where $\epsilon \in{\mathbb R}$ and $\delta \in {\mathbb R}_+$. When this transformation is applied to the normal CDF $S(x;\epsilon,\delta)=\Phi[H(x;\epsilon,\delta)]$, it produces a unimodal distribution whose parameters $(\epsilon,\delta)$ control skewness and kurtosis, respectively (Jones and Pewsey, 2009), in the sense of van Zwet (1969). In addition, if $\epsilon=0$ and $\delta=1$, we obtain the original normal distribution. See the following R code.

fs = function(x,epsilon,delta) dnorm(sinh(delta*asinh(x)-epsilon))*delta*cosh(delta*asinh(x)-epsilon)/sqrt(1+x^2)

vec = seq(-15,15,0.001)

plot(vec,fs(vec,0,1),type="l")
points(vec,fs(vec,1,1),type="l",col="red")
points(vec,fs(vec,2,1),type="l",col="blue")
points(vec,fs(vec,-1,1),type="l",col="red")
points(vec,fs(vec,-2,1),type="l",col="blue")

vec = seq(-5,5,0.001)

plot(vec,fs(vec,0,0.5),type="l",ylim=c(0,1))
points(vec,fs(vec,0,0.75),type="l",col="red")
points(vec,fs(vec,0,1),type="l",col="blue")
points(vec,fs(vec,0,1.25),type="l",col="red")
points(vec,fs(vec,0,1.5),type="l",col="blue")

Therefore, by choosing an appropriate sequence of parameters $(\epsilon_n,\delta_n)$, you can generate a sequence of distributions/transformations with different levels of skewness and kurtosis and make them look as similar or as different to the normal distribution as you want.

The following plot shows the outcome produced by the R code. For (i) $\epsilon=(-2,-1,0,1,2)$ and $\delta=1$, and (ii) $\epsilon=0$ and $\delta=(0.5,0.75,1,1.25,1.5)$.

enter image description here

Simulation of this distribution is straightforward given that you just have to transform a normal sample using the inverse of $(\star)$.

$$H^{-1}(x;\epsilon,\delta)=\sinh[\delta^{-1}(\sinh^{-1}(x)+\epsilon)]$$

Question

Best Answer

Related Solutions

Solved – How robust is ANOVA to violations of normality

Data Transformation – Methods to Increase Kurtosis and Skewness of Normal Random Variables

Related Question