Why Sigmoid or Tanh is not preferred to be used as the activation function in the hidden layer of the neural network?

Question

Why Sigmoid or Tanh is not preferred to be used as the activation function in the hidden layer of the neural network?

1 Answer

rahuljain1 · Answer 1 · 2023-07-21T00:05:18+0000

A common problem with Tanh or Sigmoid functions is that they saturate. Once saturated, the learning algorithms cannot adapt to the weights and enhance the performance of the model. Thus, Sigmoid or Tanh activation functions prevent the neural network from learning effectively leading to a vanishing gradient problem. The vanishing gradient problem can be addressed with the use of Rectified Linear Activation Function (ReLu) instead of sigmoid and Tanh. Alt_text

Why Sigmoid or Tanh is not preferred to be used as the activation function in the hidden layer of the neural network?

Please log in or register to answer this question.

1 Answer