简体繁体中英

Keras Activation Functions Tanh Vs Sigmoid

原文 2020-08-16 14:41:42 9 2 python/ tensorflow/ keras

I have an LSTM that utilizes binary data, ie the labels are all 0's or 1's.

This would lead me to use a sigmoid activation function, but when I do it significantly underperforms the same model with a tanh activation function with the same data.

Why would a tanh activation function produce a better accuracy even though the data is not in the (-1,1) range needed for a tanh activation function?

Sigmoid Activation Function Accuracy: Training-Accuracy: 60.32 % Validation-Accuracy: 72.98 %

Tanh Activation Function Accuracy: Training-Accuracy: 83.41 % Validation-Accuracy: 82.82 %

All the rest of the code is the exact same.

Thanks.

2 answers

Convergence is usually faster if the average of each input variable over the training set is close to zero. And tanh has a zero mean. It's likely your data is normalized and has a mean near zero?

Ref: https://medium.com/analytics-vidhya/activation-functions-why-tanh-outperforms-logistic-sigmoid-3f26469ac0d1

In the interval of (0, 1] if gradient is diminishing over time t, Then sigmoid gives better result. If gradient is increasing then tanh activation function.

Activation functions: Softmax vs Sigmoid

Usage of sigmoid activation function in Keras

Using Sigmoid instead of Tanh activation function fails - Neural Networks

Tensorflow Keras sigmoid activation in functional API

Keras Binary Classification - Sigmoid activation function

Keras model only working with Sigmoid activation

Can the sigmoid activation function be used to solve regression problems in Keras?

Keras - Specifying from_logits=False when using tf.keras.layers.Dense(1,activation='sigmoid')(x)

Replacing sigmoid activation with custom activation

Keras model.predict with sigmoid activation and binary cross entropy returns only 0 or 1, not probability

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Activation functions: Softmax vs Sigmoid Usage of sigmoid activation function in Keras Using Sigmoid instead of Tanh activation function fails - Neural Networks Tensorflow Keras sigmoid activation in functional API Keras Binary Classification - Sigmoid activation function Keras model only working with Sigmoid activation Can the sigmoid activation function be used to solve regression problems in Keras? Keras - Specifying from_logits=False when using tf.keras.layers.Dense(1,activation='sigmoid')(x) Replacing sigmoid activation with custom activation Keras model.predict with sigmoid activation and binary cross entropy returns only 0 or 1, not probability

Related Tags

Keras Activation Functions Tanh Vs Sigmoid

Question

2 answers

solution1
1 ACCPTED 2020-08-16 14:55:43

solution2
1 2020-08-16 15:10:51

Keras Activation Functions Tanh Vs Sigmoid

Question

2 answers

solution1 1 ACCPTED 2020-08-16 14:55:43

solution2 1 2020-08-16 15:10:51

solution1
1 ACCPTED 2020-08-16 14:55:43

solution2
1 2020-08-16 15:10:51