Solved – Log transformation of binary explanatory variable in regression

binary datadatasetinstrumental-variables

I have been telling students that you cannot have a 0-1 independent variable transformed into log. My reason: a log of 0 is undefined. Am I wrong?

Best Answer

You're right, and not just because log zero is not defined.

Any one-to-one transformation of $0$ and $1$ to $a$ and $b$ would just be a linear rescaling. This is true with any rule or function: even if it is a nonlinear function (say $\log(x + c)$) all that matters are the results of transforming $0$ and $1$. Think of this geometrically: for the data any transformation that preserves a difference between the two values defines two points in the plane differing on both coordinates, and so a linear transformation.

So it could not possibly do anything to improve that was even thought to be a problem.

For example, contrary to a surprisingly common myth, there aren't strict assumptions about marginal distributions of predictors (which is not to say that 665 zeros and 1 one for a predictor (say) is not a situation that needs care and attention).

(0, 1) predictors are fine and convenient because they lead to clean parameterisations and explanations of changes in level and slope.

Best Answer

Related Solutions

Solved – Local polynomial (linear) regression of binary data — logit transformation

Solved – How to interpret log-transformed predictors in probit regression

Related Question