Reinforcement Learning for a Stochastic Ball Merging Game： Daxigua

Introduction

This project is aimed to re-implement and learn a mobile phone game, Compound Big Watermelon, which recently has been very popular in China.

In this game, we need to decide where to drop the new ball in order to remove more balls. We can remove a ball by collision with other balls of same level. The game will finish if the balls pile up over the end line.

The link to play the original game online：Compound Big Watermelon

The original code of the game: Repository(daxigua)

How to Run

To learn the policy, we use mainly 3 methods, you can run the following code to implement them:

Policy Gradient : run python run_policy_gradient.py in your terminal.
Policy Search - (1+1)-SA-ES: open the Policy_Search_Agent_SAES.ipynb file and run codes according to the order.
Policy Search - CEM: open the Policy_Search_Agent_CEM file and run codes according to the order.

You can change the configuration of the game in the Config.py file, including the balls setting, the screen size and so on.

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
figures		figures
.gitignore		.gitignore
Agency_test.ipynb		Agency_test.ipynb
Agency_test_new.ipynb		Agency_test_new.ipynb
Ball.py		Ball.py
CEM_policy_search.py		CEM_policy_search.py
Config.py		Config.py
Game.py		Game.py
Imitate_Learn.ipynb		Imitate_Learn.ipynb
Movement_evaluation.py		Movement_evaluation.py
Policy_Gradient.py		Policy_Gradient.py
Policy_Search_Agent_CEM.ipynb		Policy_Search_Agent_CEM.ipynb
Policy_Search_Agent_SAES.ipynb		Policy_Search_Agent_SAES.ipynb
README.md		README.md
State.py		State.py
run_policy_gradient.py		run_policy_gradient.py
test.py		test.py
theta.npy		theta.npy
theta.txt.npy		theta.txt.npy
theta_saes.npy		theta_saes.npy

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Reinforcement Learning for a Stochastic Ball Merging Game： Daxigua

Introduction

How to Run

About

Uh oh!

Releases

Packages

Contributors 5

Uh oh!

Languages

AutomnePAN/INF581-project-daxigua

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning for a Stochastic Ball Merging Game： Daxigua

Introduction

How to Run

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Uh oh!

Languages

Packages