Solved – Education in years, ordinal or continuous variable

ordinal-dataregression

enter image description here

I am struggling with figuring out whether the variable "Education in years" should be used as an ordinal variable or as a continuous variable, or would either work? I haven't coded the data myself.

What I would like to do with the 'education' variable is to run a linear regression with it as a dependent variable. My independent variables are gender, age, occupation etc. Would this be possible with the education data I have? What would you suggest?

Best Answer

The first thing I would do is check out how the variable was coded, even though you didn't do it yourself, and to check out if education in actual years is available. I also wonder what 1 year of education means (probably it means "less than high school" but ... check it out to be sure).

Next, I'm wondering about some of your proposed independent variables. Age and gender make sense, but education can't be dependent on occupation (unless you mean parents' education).

As to your actual question, I think education here is best treated as ordinal or maybe even multinomial. There are various models for ordinal dependent variables, but by far the most common is ordinal logistic regression which depends on the assumption of proportional odds. That seems likely to be violated.

Related Question