Scaling variables without hindering binary values of the data set

Question

I am trying to prepare a data set to train an ANN model, therefore I need to apply scaling. However some of my variables are continous and some are already in a binary form. Below an example of how a given row from my X_train data set looks like:

array([[0.0, 1.0, 654, 1, 40, 5, 105683.63, 1, 1, 0, 173617.09]])

I have applied the following code to normalize my values:

from sklearn.preprocessing import StandardScaler

sc = StandardScaler()
X_train = sc.fit_transform(X_train)

However, this returns me an array with also the scaled binary values. Is there a way to avoid this happening?

Thank you in advance!

Answer 1

You should use the Pipeline with the Column Transformer for mixed types. Here is a good example of how to apply different preprocessing and feature extraction pipelines to different subsets of features.

Scaling variables without hindering binary values of the data set

Question

1 answers

solution1
0 2020-05-27 12:27:46

Scaling variables without hindering binary values of the data set

Question

1 answers

solution1 0 2020-05-27 12:27:46

solution1
0 2020-05-27 12:27:46