Training Deep Neural Networks

32 / 49

In deep neural networks that don’t use Batch Normalization, the upper layers will often end up having inputs with very different scales, so using Momentum optimization helps a lot.

See Answer

Note - Having trouble with the assessment engine? Follow the steps listed here

No hints are availble for this assesment

Loading comments...