Solved – Compare linear regression models (same and different response variable)

linear modelrregression

How can I (1) compare two linear models between years and (2) Can I compare 2 models with different response variables?

My data have 4 variables: y_meas, x, year, y_calc. "y_meas" is a lab measured response varibale, and "y_calc" is an estimate of the same variable, using a standard calculation. "x " is a dosage, similar(ish) between two years:

#create dataset
set.seed(100)
dat <- within(data.frame(x = rep(1:10, times=2)),
                 {
                   year <- rep(1990:1991, each = 10)
                   y_meas <- 0.5 * x* (1:20) + rnorm(20)
         y_calc <- 0.3 * x* (1:20) + rnorm(20)
                   year <- factor(year)                 # convert to a factor
                 }
                 )

I have two related questions:
(1) Is there any difference between slope/intercept of models for 1990 and 1991?

m.1990<-lm(y_meas~x, data=subset(dat, year==1990))
m.1991<-lm(y_meas~x, data=subset(dat, year==1991))
anova(m.1990)
anova(m.1991) 
# both models are significant

I can't run anova(m.1990,m.1991) because the models are not nested? Do I need to use year as a dummy variable and run ANCOVA? What does this look like (roughly)?

(2) Assuming I can combine 1990 and 1991, can I compare the slope/intercept of 'y_meas~x' and 'y_calc~x'? Yes, two different response variables, but are on the same scale.

Best Answer

Just put everything in one model:

library(reshape2)
dat1 <- melt(dat, id.vars=c("x", "year"))

mod <- lm(value~x*variable*year, data=dat1)
anova(mod)

#Analysis of Variance Table
#
#Response: value
#                Df  Sum Sq Mean Sq   F value    Pr(>F)    
#x                1 13546.1 13546.1 1087.7872 < 2.2e-16 ***
#variable         1  1746.5  1746.5  140.2458 3.103e-13 ***
#year             1  4994.1  4994.1  401.0389 < 2.2e-16 ***
#x:variable       1   860.9   860.9   69.1300 1.687e-09 ***
#x:year           1  1399.1  1399.1  112.3510 5.352e-12 ***
#variable:year    1   292.1   292.1   23.4527 3.137e-05 ***
#x:variable:year  1    81.6    81.6    6.5533   0.01539 *  
#Residuals       32   398.5    12.5                        
#---
#Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

As you see, you have the significant main effects and interactions you'd expect from your artificial data.

It's always recommended to plot:

library(ggplot2)
newdat <- expand.grid(x=c(1,10), variable=c("y_calc", "y_meas"), year=factor(1990:1991))
newdat <- cbind(newdat, predict(mod, newdata=newdat, interval = "confidence"))

ggplot(dat1, aes(x=x, y=value, colour=year, shape=variable, linetype=variable)) +
  geom_ribbon(data=newdat, aes(y=fit, ymin=lwr, ymax=upr, fill=year, colour=NULL), alpha=0.2) +
  geom_line(data=newdat, aes(y=fit)) +
  geom_point()

plot of data and fitted lines with confidence bands

This shows you that differences between slopes are most important, which is confirmed if you look at the summary table:

#Coefficients:
#                          Estimate Std. Error t value Pr(>|t|)    
#(Intercept)                -6.3443     2.4107  -2.632 0.012963 *  
#x                           3.2300     0.3885   8.314 1.69e-09 ***
#variabley_meas             -4.4852     3.4092  -1.316 0.197648    
#year1991                   -0.2361     3.4092  -0.069 0.945218    
#x:variabley_meas            2.2357     0.5494   4.069 0.000288 ***
#x:year1991                  3.1235     0.5494   5.685 2.71e-06 ***
#variabley_meas:year1991    -0.1319     4.8213  -0.027 0.978341    
#x:variabley_meas:year1991   1.9891     0.7770   2.560 0.015394 *

Note that the usual diagnostics (in particular for variance homogeneity) should of course be performed.

Related Solutions

Regression Analysis – How to Compare Slopes from Multiple Regression Models

How can I test the difference between slopes?

Include a dummy for species, let it interact with $P_i$, and see if this dummy is significant. Let $L_i$ be the sepal length and $P_i$ be the pedal width and $S_1, S_2, S_3$ be the dummy variables for the three species. The compare the model

$$ E(L_i) = \beta_0 + \beta_1 P_i $$

with the model that allows the effect of $P_i$ to be different for each species:

$$ E(L_i) = \alpha_0 + \alpha_1 S_2 + \alpha_2 S_3 + \alpha_4 P_i + \alpha_5 P_iS_2 + \alpha_6 P_i S_3 $$

The GLS estimators are MLEs and the first model is a submodel on the second, so you can use the likelihood ratio test here. The likelihoods can be extracted using the logLik function and the degrees of freedom for the test will be $4$ since you've deleted $4$ parameters to arrive at the submodel.

What is a simple, effective way to present the comparison?

I think the most appealing way would be to plot the regression lines for each species all on the same axes, maybe with error bars based on the standard errors. This would make the difference (or non-difference) between the species and their relationship to $P_i$ very apparent.

Edit: I noticed another question has been added to the body. So, I'm adding an answer to that:

How can I test the difference between residual variances?

For this, you'll need to stratify the data set and fit separate models since, the interaction-based model I suggested will constraint the residual variance to be the same in every group. If you fit separate models, this constraint goes away. In that case, you can still use the likelihood ratio test (the likelihood for the larger model is now calculated by summing the likelihoods from the three separate models). The "null" model depends on what you want to compare it with

if you only want to test the variance, while leaving the main effects in, then the "null" model should be the model with the interactions I've written above. The degrees of freedom for the test are then $2$.
If you want to test the variance jointly with the coefficients, then the null model should be the first model I've written above. The degrees of freedom for the test is then $6$.

Solved – Correct way to compare two (very) different regression models

Since the two models are not nested (that is, the independent variables of one model are not a subset of independent variables of the other model), one cannot use a maximum likelihood test. However, you can consider the AIC (Akaike Information Criterion) or one of its variants. If you have likelihoods of your models, let's call them $\mathcal{L}_1$ and $\mathcal{L}_2$, you can easily calculate the AIC's of your models with

$AIC_{1} = -2 \log(\mathcal{L}_1) + 2\cdot K_1$

where $K_1$ is the number of estimable parameters in the first model. Now, a single AIC value is not informative, or, rather, it is only informative relative to alternative models. Therefore often, when you have several models, one calculates the differences of the AICs of models relative to the AIC of the model with the smallest AIC:

$\Delta_i = AIC_i - AIC_{min} $

Now, you will not have a statistical test to compare these values. This is the information-theoretic approach, which is a different thing from the Neyman-Pearson hypothesis testing framework, and the two should not be mixed (Anderson 2001). However, there are some rules of thumb as to what is the magnitute of a delta that one considers "significant" (but in the common meaning of the word, and not as in "statistically significant"). In "Model selection and multimodel inference", Burnham and Anderson present the following table:

Delta_i     Level of empirical support of model i
0-2          Substantial
4-7          Considerably less
> 10         Essentially none

That is, if the difference of AIC of your two models is 4-7, you can assume that one of the model is "considerably" better supported by evidence than the other one. In fact, the authors state that

It seems best not to associate the words significant or rejected with results under an information-theoretic paradigm. Questions concerning the strength of evidence for the models in the set are best addressed using the evidence ratio as well as an analysis of residuals, adjusted $R^2$ and other model diagnostics or descriptive statistics.

The variants of AIC include $AIC_c$ (or c-AIC) which is suitable for small sample sizes, and QAIC (for overdispersed count data).

There are alternatives, of course, which allow you actually to do hypothesis testing. See for example this question.

Best Answer

Related Solutions

Regression Analysis – How to Compare Slopes from Multiple Regression Models

Solved – Correct way to compare two (very) different regression models

Related Question