In this video, we will understand:
what is credit assignment problem
what is policy gradients algorithm
how to implement policy gradients algorithm in training the agent, to play the CartPole game
Taking you to the next exercise in seconds...
Want to create exercises like this yourself? Click here.
Please login to comment
Be the first one to comment!