Enrollments closing soon for Post Graduate Certificate Program in Applied Data Science & AI By IIT Roorkee | 3 Seats Left

  Apply Now

Training Deep Neural Networks

12 / 49

With SELU activation function even a 100 layer deep neural network preserves roughly mean 0 and standard deviation 1 across all layers avoiding the exploding/vanishing gradients problem.

See Answer

Note - Having trouble with the assessment engine? Follow the steps listed here


No hints are availble for this assesment

Loading comments...