Solved – Deep Learning for Regression

categorical datadeep learningmachine learningneural networksregression

I have dataset of 15 features and the goal is to estimate a best fitting curve (regression). Now I want to use a deep learning technique for this purpose. Now I see that there are many architectures that can be used such as CNN, DBM or auto-encoder, etc.. Which one to go for. And I could not find any examples for how to use the same. A sample code would be of great help. Thanks

P.S. Some of the features are categorical. How do we handle that as well?

Best Answer

For standard regression I recommend using Multi-Layer Perceptron (MLP) with Mean Square Error (MSE) loss function. The model can be defined in Keras as follows:

model = Sequential()
model.add(Dense(20, input_dim=13, init='normal', activation='relu'))
model.add(Dense(10, init='normal', activation='relu'))
model.add(Dense(1, init='normal'))

model.compile(loss='mean_squared_error', optimizer='adam')

See this article for a complete example. For a TensorFlow implementation, consider the following code.

Related Solutions

Solved – Why convolutional neural networks belong to deep learning

First, mind that deep learning is a buzz term. There is not even a consensus of a formal definition in the research community. A discussion of the term does not lead anywhere, really. It's just a word.

That being said, convolutional nets are deep because they rely on multiple layers of feature extraction, as you said. They extract features from the input to predict an outcome.

What you refer to is a "generative" approach, i.e. the features are used to create the observation (a picture, not a class label). That is what made deep learning popular, but it is in no way limited to that.

Solved – Using categorical features for regression problem in deep learning

In general, binary data types are used to represent membership in particular categories. Suppose you have some data in a row-column format, so that the columns are features and rows are observations. Perhaps you're interested in the relationship between height and gender. So your data look like

Height    Gender
6 ft      Male
5 ft 4 in Female

And so on. Clearly, there's not a great numerical representation of "male" and "female." But another way to look at the problem is as a question of membership, to answer the question "is this person male?" or, equivalently, "is this person female?" In this way, we can take the categories "male" and "female" and translate them into binary, numerical quantities (conventionally, $1$ and $0$). Whichever we choose to treat as 1 or 0 is irrelevant from a mathematical standpoint. Importantly, it's standard practice to expand a variable with $k$ categories into $k-1$ binary columns. This is because you don't want to make your columns linearly dependent with an intercept column of all $1$s, and make it impossible to uniquely estimate these quantities. I don't know what the particular details of your so-called "deep learning" regression are, but it probably has something like an intercept.

You've asked @Sheep

Do you know the underlying process for these kind of variables?

But this is a fundamentally unanswerable question without actually doing the analysis. Run the regression and find out!

Does it make sense to estimate a variable( my target variable) based on categorical variables?

Maybe. If the two quantities are completely unrelated -- for example, political party in power in a given territory and number of hurricanes over the Atlantic ocean -- then it probably wouldn't make sense. Instead of looking at your research as a purely rote, quantitative exercise, I would encourage you to think critically about what your data represent, and what the underlying causal mechanism is. Is there a biological process at work? Are people acting according to their self-interest? What do we know about the climate that casues hurricanes?

Best Answer

Related Solutions

Solved – Why convolutional neural networks belong to deep learning

Solved – Using categorical features for regression problem in deep learning

Related Question