This project is aimed to re-implement and learn a mobile phone game, Compound Big Watermelon, which recently has been very popular in China.
In this game, we need to decide where to drop the new ball in order to remove more balls. We can remove a ball by collision with other balls of same level. The game will finish if the balls pile up over the end line.
The link to play the original game online:Compound Big Watermelon
The original code of the game: Repository(daxigua)
To learn the policy, we use mainly 3 methods, you can run the following code to implement them:
- Policy Gradient : run
python run_policy_gradient.pyin your terminal. - Policy Search - (1+1)-SA-ES: open the
Policy_Search_Agent_SAES.ipynbfile and run codes according to the order. - Policy Search - CEM: open the
Policy_Search_Agent_CEMfile and run codes according to the order.
You can change the configuration of the game in the Config.py file, including the balls setting, the screen size and so on.
