UNREAL

About

Replicating UNREAL algorithm described in Google Deep Mind's paper "Reinforcement learning with unsupervised auxiliary tasks."

https://arxiv.org/pdf/1611.05397.pdf

Implemented with TensorFlow and DeepMind Lab environment.

Preview

seekavoid_arena_01

stairway_to_melon

Network

All weights of convolution layers and LSTM layer are shared.

Requirements

TensorFlow (Tested with r1.0)
DeepMind Lab
numpy
cv2
pygame
matplotlib

Result

Score plot of DeepMind Lab "seekavoid_arena_01" environment.

How to run

First, dowload and install DeepMind Lab

$ git clone https://github.com/deepmind/lab.git

Then build it following the build instruction. https://github.com/deepmind/lab/blob/master/docs/build.md

Clone this repo in lab directory.

$ cd lab
$ git clone https://github.com/miyosuda/unreal.git

Add this bazel instrution at the end of lab/BUILD file

package(default_visibility = ["//visibility:public"])

Then run bazel command to run training.

bazel run //unreal:train --define headless=osmesa

To show result after training, run this command.

bazel run //unreal:display --define headless=osmesa

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
doc		doc
environment		environment
model		model
train		train
.gitignore		.gitignore
BUILD		BUILD
LICENSE.txt		LICENSE.txt
README.md		README.md
board.sh		board.sh
clean.sh		clean.sh
constants.py		constants.py
display.py		display.py
main.py		main.py
test.py		test.py
visualize.py		visualize.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

UNREAL

About

Preview

Network

Requirements

Result

How to run

About

Releases

Packages

Languages

License

tomchen1000/unreal

Folders and files

Latest commit

History

Repository files navigation

UNREAL

About

Preview

Network

Requirements

Result

How to run

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages