REINFORCE

Open REINFORCE.ipynb to see an implementation of REINFORCE (also known as Monte Carlo Policy Gradients) with OpenAI Gym's Cartpole environment.

Try to change the parameters in the notebook, to see if you can get the agent to train faster!

Provide feedback