MCQ - With SELU activation function Even a 100 layer deep neural network Preserves roughly mean 0 and standard deviation 1 across all layers Avoiding the exploding/vanishing gradients problem

Previous Index Next

With SELU activation function even a 100 layer deep neural network preserves roughly mean 0 and standard deviation 1 across all layers avoiding the exploding/vanishing gradients problem.

True
False

See Answer

Note - Having trouble with the assessment engine? Follow the steps listed here

Training Deep Neural Networks

XP

Loading comments...