MCQ - Nesterov Optimizer | Automated hands-on| CloudxLab

Training Deep Neural Networks

You are currently auditing this course.

34 / 49

In Nesterov Optimizer, the only difference from vanilla Momentum optimization is that the gradient is measured at theta + theta*m rather than at theta, where theta represents the current parameters/weights and m is the momentum.

True
False

See Answer

Previous Index Next

Training Deep Neural Networks

XP

Loading comments...