Solved – Panel data: difference between time effects and cross-sectional dependence

fixed-effects-modelpanel data

I am currently learning (by myself) about the analysis of panel data.

What I have seen so far is that a fixed effects model allows us to control for idiosyncratic differences between entities. Moreover, dependencies that universally affect all entities can be controlled by adding time effects.

Here is my question: Are cross sectional dependencies completely controlled when I add time effects to the model ? Or do I need to account for cross sectional dependencies even after I have fixed time effects ?

(Background to the question: One of my study guides explains that time effects are those phenomena that affect all entities over time, such as federal laws in a study local state policies. But then another study guide states that this as an example of a cross sectional dependence which has to be checked even if time effects have been added to the model! That leaves me wondering about the difference of cross sectional dependencies and time effects)

Many thanks for your help!

Best Answer

I'm just thinking out loud here,

Suppose you have industry-county-year level data, your outcome is $Y_{ict}$, and you are interested in the effect of some variable $x_{ict}$.

In your strategy you would correctly think you can use:

(1) industry-county (panel) fixed effects to control for time invariant confounding factors across these panels as well as the average difference in time varying covariates across industry-county pairs

(2) year fixed effects to control for shocks that are common to all industries and counties in a given year

However what if there are shocks that are common across some counties in regions indexed by $r$, yet are both time varying and different across regions?

That is, perhaps the true data generating process is

$Y_{ict}=\underbrace{\theta_{ic}}_\text{panel fixed effect}+\underbrace{\theta_t}_\text{year fixed effect}+\underbrace{\theta_{rt}}_\text{regional shocks}+\underbrace{\beta}_\text{parameter of interest} X_{ict}+\underbrace{\epsilon_{ict}}_\text{idiosyncratic shock}$

But you estimate a model

$Y_{ict}=\theta_{ic}+\theta_t+\beta X_{ict}+\epsilon_{ict}$

which does not attempt to proxy for this regional shock, then,to to the degree that $Cov(\theta_{rt},X_{ict})\neq 0$, I believe your estimate $\hat{\beta}$ would in part reflect the variation in $\theta_{rt}$ that covaries with $X_{ict}$.

That is,

$plim \; \hat{\beta} =\underbrace{ \beta}_\text{true parameter} + \underbrace{\frac{Cov(X_{ict},\theta_{rt})}{Var(X_{ict}}}_\text{bias}$

to solve this I believe it is possible that you could

(1) Cluster your standard errors at the geographical level where you think there may be correlated disturbances

and

2) Find an instrument $Z_{ict}$ for $X_{ict}$ that is strongly correlated with $X_{ict}$ (relevant) that has an effect on the outcome only through its effect on $X_{ict}$ and not through $\theta_{rt}$ influencing $Z_{ict}$ or through $Z_{ict}$ influencing $Y_{ict}$ directly (excludibility).

Related Question