Solved – Interpret effect of adding random effects to ordinal regression (R – ordinal package – clmm)

fixed-effects-modelmixed modelordinal-datarrandom-effects-model

I know there are already lots of questions around this topic (especially this one and this one) but I haven't really seen anything that directly helps me (It will be obvious I'm not a great statistician, but I'll do my best to explain).

I am running an ordinal regression in R (clm and clmm). My response variable is a rating between 0 and 4. I have two types of explanatory variables: individual and scenario variables [let's say IVs and SVs].

Six different scenario variables (all dummies with at most 4 different values) represent potential collaboration scenarios that get rated by the respondent (between 0 and 4) creating the response variable. (Research design is a conjoint analysis; there are a total of 192 different scenarios possible)

On top of that I have a variety of individual characteristics about the respondent (age, gender, work experience, networking skills, …) all derived from a survey.

Every respondent rates between 3 and 16 different scenarios (average 8.1); every scenario is rated by at least 8 respondents. Every respondent and every scenario have a unique identifier (called IVid and SVid). So they are non nested within each other.

Thus the basic regression looks like this:

clm.base <- clm(rating ~ SVs + IVs, data = dt)

The hypothesis I am trying to test is that there are specific individual characteristics, that will influence the rating of the scenarios, independent of the actual content of the scenarios. Basically, some people are more or less favourable to all types of collaboration scenarios.

Now a reviewer of my paper asks me to include individual fixed effects (which in management [my field] basically means dummies for each individual). My assumption originally was that this would result in all individual variables being dropped. This is exactly what happens when I use another model (package lfe)

felm.complete <- felm(rating ~ SVs + IVs | SVid + IVid | 0 | IVid, data = dt)

In this regression basically all my variables are perfectly collinear as expected.
However, when I approximate this in the ordinal package, there is no perfect collinearity. I presume this is related that clmm adds so-called 'random effects'. The regression takes a couple of minutes to run but eventually returns results

clmm.complete <- clmm(rating ~ SVs + IVs + (1|SVid) + (1|IVid), data = dt)

Now, the results here are pretty useless:

All but one of my IVs are insignificant

I am trying to understand what exactly happens when adding the (1|IVid) term in the clmm model. If it basically adds something like an individual dummy than the fact almost everything is now insignificant is no surprise. The coefficients of the IVid dummies would capture the effect I am looking for (some people rate all scenarios higher or lower, regardless of scenario content) most accurately.

Now I wonder whether this interpretation is correct or whether the results I got from running the simple clm regression are just not reliable?

Concretely, I'd like to find out:

What happens when adding a random effect to clmm
A laymen explanation of how the Laplace approximation works
How to group errors around individuals when running clm
Is it possible to extract the coefficients of these random effects (1|id) for as far as there is such a thing?

Best Answer

I can't answer all of your questions, but this is what I know for sure.

Adding a random effect to clmm has much the same impact as adding a random term to a logistic regression model. Take a simple case with ordinal response $Y$, continuous covariate $X$ and grouping variable $G$. Write $p_k=Prob(Y \leq k)$, for some level of the ordinal response. The model posits that for subject $i$ in group $j$, $$\log \frac{p_k}{1-p_k}=\theta_k - \beta x_{ij} - g_j$$

Takeaways:

We are modeling a chain of logistic regressions where the binary response is: "less than or equal to a certain level" vs "greater than that level".
the $\theta$ parameter depends only on the ordinal response. It's like an intercept term, although there is one of these for each level.
$\beta$ is the slope parameter for the fixed effect $x$, and it does not depend on $k$. The impact of $x$ applies proportionally to all levels of the response.
$g_j$ is an unseen, grouping effect. clmm will estimate the value of $g_j$ and the variance parameter of its (assumed) normal distribution.
The minus signs are a convention of clm and clmm. In interpreting the parameters, negative values imply a greater probability for low ranking responses and positive values imply that the odds favour higher levels of the response. I do not believe that this convention is universal. SPSS, I'm looking at you.

In this simple model, the random effect increases or decreases the odds ratio by a factor of $e^{-g_j}$. This factor is independent of the level itself. Say, if $e^{-g_j}=0.5$ in some group, then all odds ratios are cut by a half for members of that group. This assumption is the weak link of the proportional odds model.

You could include random slopes as well, if you believe there is an interaction between the group and covariate $x$.

You can extract the random effects that the model estimates using method ranef, as in ranef(clmm.complete).

Laplace approximation is a method of numerical integration. Random effects models involve an iterative procedure to estimate the random effects and the process parameters, the EM algorithm. I believe that's where the Laplace approximation comes in, since during the estimation phase, you have to integrate over the unknown parameters. However, I don't know the details of the algorithm employed by clmm. Perhaps someone else can give a better answer.

Be aware that clmm requires an optimization to produce estimates. As part of the summary function, you get some information about whether or not the optimization succeeded. The gradient should be near zero and the Hessian index should be less than 10000.

Regarding the specifics of your model, I don't quite understand the difference between variable SV and variable SVid. In any case, adding (1|SVid) basically changes the odds ratio for each group indexed by SVid. The model has to estimate one parameter for each SV group, a variance and possibly some covariance terms with other parameter estimates. Your description of the study suggests a lot of groups and not much data in each, so that could lead to problems with instability and convergence of the model.

I'm not sure what you mean by "grouping errors around individuals".

Related Solutions

Solved – Time-invariant variables not being removed in Fixed Effects model. And feasibility of addional time dummies in Fixed Effect/Random modelling

Having an unbalanced panel is not a problem nowadays. In the past, when econometrics had to be done by hand, inverting matrices for unbalanced panels was more difficult but for computers this is not a problem. The only worry connected today with this is the question why the panel is unbalanced: is it due to attrition? If yes, is this attrition random or related to characteristics of the statistical units? For instance, in surveys people with higher education tend to be more responsive and stay in the panel longer for that reason.

Regarding the fixed effects model, have you checked whether the variables that are time-invariant in theory are actual not varying over time? Sometimes coding errors sneak in and then all the sudden a variable varies over time when it shouldn't. One way of checking this is to use the xtsum command which displays overall, between, and within summary statistics. The time-invariant variables should have a zero within standard deviation. If they don't then something went wrong in the coding.

Having a negative Hausman test statistics is a bad thing because the matrices that the test is built on are positive semi-definite and therefore the theoretical values of the test are positive. Negative values point towards model misspecification or a too small sample (related to this is this question).

If you cluster your standard errors you also need a modified version of the Hausman test. This is implemented in the xtoverid command. You can use it like this:

xtreg ln_r_prisperkg_Frst_102202 Dflere_mottak_tur i.landingsfylkekode i.kvartiler_ny markedsk_torsk gjenv_TAC_NØtorsk_år_prct lalder_fartøy i.fangstr r_minst_Frst_torsk gjenv_kvote_NØtorsk_fartøy_prct i.lengde_gruppering mobilitet, fe vce(cluster fartyid)
xtoverid

Rejecting the null rejects the validity of the assumptions underlying the random effects mode.

The xtset command only takes into account the unit id for fixed effects estimation. The time variable does not eliminate time fixed effects. So if you do

xtset id time
xtreg y x, fe

will give you the exact same results as

xtset id
xtreg y x, fe

The time variable is only specified for commands for which the sorting order of the data matters, for instance xtserial which tests for panel autocorrelation requires this. This has been discussed here. So if you want to include time fixed effects, you need to include the day dummies separately via i.day, for example. In this context, the season and year dummies make sense so it's good that you use them.

Solved – Main effects flip sign when adding interaction term in ordinal logistic regression

First, as AlexK pointed out, this flipping of signs can happen because of the nature of multiple regressions. The coefficient in a multiple regression gives you the marginal effect of that variable with all other terms in the model held constant.

Let's look at interaction effects, but in a highly simplified 2x2 design. The principle can of course be extended to continuous variables, it is just a lot easier to think about it this way. Imagine your data set looked like this, with the means in the cells and the marginal means around it.

                   GDP
                   Low   High
Corruption    Low   10      8   9
              High   8     20  14
                     9     14

As you can see, both GDP and Corruption have a positive main effect, because in both "High" groups, means are higher. The model would be:

y = b0 + b1*GDP + b2*Corr (with both IV either 0 or 1).

Estimating coefficients would lead to both b1 and b2 being positive, due to the high mean when both are 1. The model has no chance but to estimate coefficients in a way that fit the data best, and with this model, it will estimate two positive coefficients.

20 = b0 + b1*1 + b2*1
 8 = b0 + b1*0 + b2*1
 8 = b0 + b1*1 + b2*0
10 = b0 + b1*0 + b2*0

Let's include the interaction term (GDP*Corr), which will actually be 0 in all but the first equation.

20 = b0 + b1*1 + b2*1 + b3*1*1
 8 = b0 + b1*0 + b2*1 + b3*0*1
 8 = b0 + b1*1 + b2*0 + b3*1*0
10 = b0 + b1*0 + b2*0 + b3*0*0

Now the model has a chance to account for the "diagonal" effect of both IVs being "High" together. The high mean of 20 can now be accounted for by b3. The model can estimate marginal effects holding the interaction effect constant and show that the remaining (=marginal) effects reflected by coefficients b1 and b2 are actually negative.

I had to add: The interpretation of something like this can become tricky very quickly (imagine three-way interactions). If you have a significant interaction, be very careful how you interpret it and any remaining main effects!

Best Answer

Related Solutions

Solved – Time-invariant variables not being removed in Fixed Effects model. And feasibility of addional time dummies in Fixed Effect/Random modelling

Solved – Main effects flip sign when adding interaction term in ordinal logistic regression

Related Question