Solved – Is least square dumthe variable model better than random effects model

categorical datafixed-effects-modelr-squaredrandom-effects-modelstatistical significance

I have a panel dataset with one dependent and twelve independent variables. There are 50 individuals with data for 100 days. Theoretically, most of them should be significant. First, I checked for fixed effects using breusch and pagan lagrangian multiplier test. As I found the fixed effects to be significant, I performed the Huasman test. Finally, I chose random effects model. I am getting at most 4 variables significant using p-value for all models (pooled, fixed, and random). The problem is very low adjusted R-square(0.01 to 0.02). when I use least squares dummy variables model (LSDV) with all days as dummy variables, I find all the days to be significant apart from some explanatory variables and the adjusted square value becomes 0.90 (approx.). Since my days are 100, so there are 100 dummy variables. I also find R-square to be 0.90 when I use individuals' dummy variables (50) in the LSDV model. My questions are:

Is the difference between RE model and LSDV model R-squares due to the unobserved heterogeneity correlated with all of the regressors in all time periods?
Is there a limit to include dummy variables?
Is my LSDV model better than FE model or RE model?

Kindly also guide me to appropriate literature. The analysis is conducted in R using package plm.

Best Answer

Here is a start at an answer.

In your situation, where $i$ is an observational unit and $t$ is time and the model is $y_{it} = \beta X_{it} + \gamma_i + \epsilon_{it}$

and $\gamma_i$ is the unobserved heterogeneity, the LSDV should produce the same coefficients as the fixed effect (FE) estimator. However, the standard errors will be different. FE is better than LSDV when the number of individuals increases, but the number of time periods is fixed. LSDV is better in the opposite case, that is as the sample size increases, the number of observational units stays fixed (more or less).

My understanding of random effects (RE) is that if certain assumptions about the covariance structure hold, then RE is more efficient (smaller standard errors) than FE and LSDV. All three estimators will be consistent. However, FE and LSDV are agnostic to the covariance assumptions, so that when these assumptions are violated, then RE will be an inconsistent estimator, while FE and LSDV will remain consistent. This is the scenario you were testing for. In Mostly Harmless Econometrics, footnote 2 on page 223, Angrist and Pischke say that they they prefer FE (OLS) to RE (GLS) as "GLS requires stronger assumptions than OLS and the resulting efficiency gain is likely to be modest, while finite sample properties might be worse."

As far as the low adjusted $R^2$, it is not as big a concern as you might think if your goal is to estimate parameters (the $\beta$s). To see this, consider the five assumptions laid out in Wooldridge's Introductory Econometrics: A Modern Approach (or his grad text) for BLUE or the asymptotic analogue. The $R^2$ is absent from any of these assumptions for unbiased or consistent (and efficient) estimation.

However, if your goal is model fitting, then the model producing higher $R^2$ might be something to consider.

To throw in one extra bit of info, you have a long time period, 100, so you might consider looking at estimators that model time more seriously. One of these is the Arellano Bond estimator. See this blog post for a competing method together with useful references. A second method is multi-way clustering as in Cameron, Gelbach, and Miller. I suspect that an Arellano-Bond approach will be more appropriate in your situation.

Related Solutions

Solved – Within model with plm package

The two estimators are computed differently, but are numerically identical, so essentially it doesn't matter. The within estimator is computationally easier since it keeps the size of the design matrix in check, and I would think that is how the within estimator is implemented. Here is some R code to demonstrate this

library(plm)
data("Produc", package = "plm")
plmResults <- plm(log(gsp) ~ log(pcap) + log(pc) + log(emp) + unemp, data = Produc, 
                  index = c("state","year"))
summary(plmResults)

regResults <- lm(log(gsp) ~ as.factor(state) + log(pcap) + log(pc) + log(emp) + unemp, 
                 data = Produc)
summary(regResults)

Or, if you prefer, some Stata code,

webuse nlswork
xtset idcode

xtreg ln_w grade c.age##c.age c.ttl_exp##c.ttl_exp c.tenure##c.tenure ///
 2.race not_smsa south, fe

areg ln_w grade c.age##c.age c.ttl_exp##c.ttl_exp c.tenure##c.tenure ///
 2.race not_smsa south, absorb(idcode)

A proof using the Frisch-Waugh-Lovell theorem can easily be given. Note one crucial point that for a large number of groups, that is, $n\to \infty$, the estimates of the coefficients on the group dummies are not consistent.

Solved – Glmer random effects model vs. dumthe-coded fixed effects

I am thinking about the same problem and found your question here. I'll tell you what I know so far. Maybe we can reach a satisfying conclusion.

The random effect models treat between subject (in your case, different Gamble.Nums) variation as normally distributed. But it may not be normal, and then if you try to fit a random effect model, the result will be biased. To fix this problem, people use dummy variables to code those subjects, and estimate their effect separately, instead of assuming a distribution. This approach looks good, but you have a bunch more parameters to estimate, and you don't have a statistical distribution of the whole population. For the random effect model, those subjects (Gamble.Nums) are assumed to be randomly sampled from a population. From the random model fit, you get estimates of the parameters of the distribution of that population.

Or, you can adopt a model selection point of view, and use anova(glmer, glm) to see which one is better.

From my own experience, the estimates from the glm using dummies and those from ranef(glmer) are strongly correlated. The real difference is not that large at all.

Best Answer

Related Solutions

Solved – Within model with plm package

Solved – Glmer random effects model vs. dumthe-coded fixed effects

Related Question