Solved – Comparing coefficients in logistic regression, with different samples

categorical datalogisticregressionregression coefficientsstata

I have a problem comparing the coefficients of my logistic regression models, in Stata.

I have a dependent variable (DV) 'being an entrepreneur' and multiple independent variables (IV) such as age, gender, education, parental occupation and entrepreneurial education. Except for the IV age, all the IVs are dummy variables.

When running the logistic regression model (logit D1A expl_age expl_gen expl_edu expl_par expl_ent), I wanted to check how the model would look like for individuals who had entrepreneurial education (expl_ent=1) and who didn't had this education (expl_ent=0) by running the command:

logit D1A expl_age i.expl_gen i.expl_edu i.expl_par if expl_ent==1
logit D1A expl_age i.expl_gen i.expl_edu i.expl_par if expl_ent==0

This gives me two different models, with two different sample sizes. Now, I want to check if the $β$ for the dummy variable expl_par in the model with only individuals who had entrepreneurship education is significantly different from the $β$ for the same variable but in the model with only individuals who didn't had entrepreneurship education.

More formally: How can I check whether the coefficient of variable x in model A, is significantly different from the coefficient of variable x in model B? As far as I'm concerned a t-test would do the job, but I can't get my head around on how to do just that.

Best Answer

If you are only interested in allowing that one variable to changed based on education, you can use an interaction term between expl_par and expl_ent in a single model. Then you can just see if the p-value for the interaction term is significant. This method constrains all other variables to take the same value for each level of education, but you can add additional interaction terms if you have enough data/believe other variables should differ. I don't know Stata very well so I'm not sure of the code. Depending on the software, typically either you create a new variable that equals expl_par*expl_ent and then include that variable in the model, or you can just do the multiplication right in the model statement.

Related Question