Differencing estimator in two-way fixed effects model

differencesfixed-effects-modelpanel dataregression

Consider a simple linear panel regression model with two-way fixed effects:
$$Y_{it}=\alpha_i+\mu_t+X_{it}'\beta+u_{it}, \quad i=1,…,n, \ t=1,…,T,$$
where $n$ and $T$ represent the total units in individual and time dimensions, respectively. Variables $\alpha_i$ and $\mu_t $ are unobserved individual and time effects, respectively, which are correlated with (time variant) regressors $X_{it}\in \mathcal{R}^p$. The random noise $u_{it}$ satisfies $E(u_{it}|\alpha_i,\mu_t,X_{it})=0$. When both $n$ and $T$ are large, the least square dummy variable approach becomes infeasible for a consistent estimator of $\beta$. One popular method is the demean approach that wipes out $\alpha_i$ and $\mu_t $.

Question: If we instead use differencing estimator, can the estimator for $\beta$ be still consistent? Namely, we perform differencing on both sides of the model above to have
$$\ddot{\Delta}Y_{it}=\ddot{\Delta}X_{it}'\beta+\ddot{\Delta}u_{it}, \quad i=2,…,n, \ t=2,…,T,$$
where $\ddot{\Delta}Y_{it}=Y_{it}-Y_{i,t-1}-Y_{i-1,t}+Y_{i-1,t-1}$ and similarly for $\ddot{\Delta}X_{it} $ and $\ddot{\Delta}u_{it}$. I am sure that the $\beta$ can be estimated through OLS if we assume $E(\ddot{\Delta}u_{it}|\alpha_i,\mu_t,X_{it})=0.$

However, in such differencing process, $Y_{i-1,t}$ is not uniquely defined, because the order of $i=1,…,n$ can be arbitrary. For instance, if $i=3$ stands for the U.S., then $i-1=2$ can be any other country in a dataset, like China or Canada. I am not sure if this is a problem for the differencing estimator. I found no closed reference from the literature, so any help would be appreciated.

Thanks.

Best Answer

The extension of the first differencing for one-way fixed effect to two-way fixed effects involves variables observed at $i-1$. Since the order of $i=1,...,n$ is trivial, the estimates given a sample can be different with different orders of $i$. In other words, there will be $n!$ different choices of the order, which can be huge in application.

Here, I propose to revise the differencing transformation $\ddot\Delta Y_{it}$ as $$\ddot\Delta Y_{it}=Y_{it}-Y_{i,t-1}-Y_{1,t}+Y_{1,t-1},$$ where the variables at $i-1$ in the original post is now replaced by the first individual at $t$ and $t-1$, i.e., $Y_{1,t}$ and $Y_{1,t-1}.$ While the decision of who is going to be the first individual is an open question, it decreases the number of the order choice from $n!$ to $n$. Through a simulation, I observe that the difference of the estimate given a total of $n$ different choice of the first individual is reasonably small, and vanishes as sample size rises.

Any other thoughts are very welcome.

Related Question