Skip to content

Latest commit

 

History

History
15 lines (8 loc) · 485 Bytes

File metadata and controls

15 lines (8 loc) · 485 Bytes

REINFORCE

Instructions

Open REINFORCE.ipynb to see an implementation of REINFORCE (also known as Monte Carlo Policy Gradients) with OpenAI Gym's Cartpole environment.

Try to change the parameters in the notebook, to see if you can get the agent to train faster!

Results

Trained Agent