The only difference from vanilla Momentum optimization is that the gradient is measured at theta + theta*m rather than at ?.
Note - Having trouble with the assessment engine? Follow the steps listed here
No hints are availble for this assesment
Answer is not availble for this assesment
Loading comments...