Solved – Log transformation results in a negative coefficient

data transformationlogarithmlogisticregression coefficients

I am running a logistic regression with 7 independent variables. One of these variables is income. If I don't log-transform income and run the regression, it results in a positive coefficient, however, when I log-transform it, it results in a negative coefficient.

Why would this happen?

Best Answer

I can think of several reasons why this could happen.

  1. The relationship between the logit of the DV and the IV is nonmonotonic, either with the original IV or the log of the IV or both, and taking the log changes the sign of the linear relationship. In this case, you should model the relationship using either polynomial terms or a spline.

  2. The IV has outliers that affect things differently when it is logged.

  3. The IV is nearly colinear with one of the other IVs, either when it is logged or in its original form.

In any case, you should decide whether to log the IV based on substantive grounds and then work on finding a good model. Income is often logged because we think about income multiplicatively rather than additively. That is, if you are making \$20,000 and get a \$5,000 raise, that's huge. If you are making \$200,000 and get a \$5,000 raise, it's not.

Related Question