Solved – multcomp() vs emmeans() for multiple comparisons

multiple regressionmultiple-comparisonsr

I have two sets of models. The first set contains a number of logistic models, fitted using glm(), with different binary dependent variables. The second set contains a number of linear models, fitted with lm(), with different continuous dependent variables.

All models are testing for differences between participants in one of four conditions (three treatments and one control) coded as a factor (with the control as the contrast), while controlling for a number of demographic variables, some of them factors and some of them continuous.

The sample size is roughly 800 per condition (the conditions aren't perfectly balanced), so about 3,200 in total.

In addition to comparing the treatment conditions to the control through summary(), however, I would also like to compare the treatment conditions to one another, to see if some are significantly more effective than others. My question is whether to use multcomp() or emmeans() to do this.

As I understand it (e.g., from here), the main difference is that emmeans() uses a t statistic (assuming that hasn’t changed from lsmeans()) while multcomp() uses a z statistic, and that the latter therefore tends to result in inappropriately small p values and short confidence intervals.

That would seem to recommend emmeans(). Or are there ever other considerations making multcomp() more appropriate?

Best Answer

For glm models, both use a z statistic. In general, there is little difference between using emmeans::contrast() and multcomp::glht() except for user interface. The latter is somewhat harder to use with multi-factor models because there isn't a nice interface for specifying pairwise comparisons of limited groups or marginal averages; but on the other hand, you can specify comparisons in glht() in the same way as in emmeans() by using emm() instead of mcp(). See ? emmeans::emm. Alternatively, you may convert an emmGrid object to summarize in multcomp via the function as.glht().

The default multiplicity adjustment in glht() is the single-step method, and that same method is available as adjust = "mvt" in contrast(). One situation where you may prefer glht() is if you want to use one of the multi-step adjustment methods it offers, which are not available in contrast(). (The exception is that those in p.adjust.methods are available in both.)

Related Solutions

Solved – How to interpret logistic regression coefficients with interactions between binary and continuous variables

The odds of being elected when you are not treated increases by a factor $\exp(1.50083)\approx 4.49$ or $(4.49-1)\times100\%=349\%$ if you move from a city with no one treated to a city where everyone is treated.

This effect of Treat.City increases by a factor $\exp(2.80625\approx16.55)$ or $(16.55-1)\times100\%=1555\%$ if one is treated. For more see: http://maartenbuis.nl/publications/interactions.html

Given the large size of the effect I will assume that Treat.City is not a percentage but a proportion. The effects will be more realistic and easier to interpret when you turn Treat.City into percentages.

Pairwise Comparisons Using emmeans in Mixed Three-Way Interactions

It shouldn't be necessary to fit a separate model just to do the post-hoc comparisons you want. You had tried:

emms <-  emmeans(fit1b, ~ AB*C)
contrast(emms, interaction = "pairwise")

but you can get the same results from the original model using by variables judiciously:

emms1 <- emmeans(fit1, ~ A*B | C)
con1 <- contrast(emms1, interaction = "pairwise")
pairs(con1, by = NULL)

The con1 results are the desired 1-d.f. interaction effects for each level of C (the by factor is remembered). Then we compare them pairwise, no longer using the by grouping. By default, a Tukey adjustment is made to the family of comparisons, but you may use a different method via adjust.

Best Answer

Related Solutions

Solved – How to interpret logistic regression coefficients with interactions between binary and continuous variables

Pairwise Comparisons Using emmeans in Mixed Three-Way Interactions

Related Question