When it comes to training an artificial neural network, what could be the reason why the loss doesn't decrease in a few epochs?

Question

1 Answer

rahuljain1 · Answer 1 · 2023-07-21T00:04:56+0000

Some of the reasons why the loss doesn't decrease after a few Epochs are:

a) The model is under-fitting the training data.

b) The learning rate of the model is large.

c) The initialization is not proper (like all the weights initialized with 0 doesn't make the network learn any function)

d) The Regularisation hyper-parameter is quite large.

e). The classic case of vanishing gradients