Distributed Implementation of Evolution Strategies as a Scalable Alternative to Reinforcement Learning (ES): https://arxiv.org/abs/1703.03864. In particular, The policy structure can be extended to deep neural network from linear struture (https://github.com/modestyachts/ARS) in this implementation.