Login using Social Account
     Continue with GoogleLogin using your credentials
In Nesterov Optimizer, the only difference from vanilla Momentum optimization is that the gradient is measured at theta + theta*m rather than at theta, where theta represents the current parameters/weights and m is the momentum.
Taking you to the next exercise in seconds...
Want to create exercises like this yourself? Click here.
No hints are availble for this assesment
Loading comments...