MCQ - With SELU activation function Even a 100 layer deep neural network Preserves roughly mean 0 and standard deviation 1 across all layers Avoiding the exploding/vanishing gradients problem

Training Deep Neural Networks

You are currently auditing this course.

12 / 49

With SELU activation function even a 100 layer deep neural network preserves roughly mean 0 and standard deviation 1 across all layers avoiding the exploding/vanishing gradients problem.

True
False

See Answer

Previous Index Next

Training Deep Neural Networks

XP

Loading comments...