bkj / pbt
Population Based Training, Figure 2
☆25Updated 7 years ago
Alternatives and similar repositories for pbt:
Users that are interested in pbt are comparing it to the libraries listed below
- ☆56Updated 6 years ago
- Collection of reinforcement learners implemented in python. Mainly including DQN and its variants☆54Updated 7 years ago
- Implementation of A Distributional Perspective on Reinforcement Learning☆35Updated 7 years ago
- Implementation of Appendix A (Neural Architecture Search with Reinforcement Learning: https://arxiv.org/abs/1611.01578) by chainer☆54Updated 6 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆56Updated 7 years ago
- reinforcement learning. policy gradient. PCL☆37Updated 7 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆42Updated 7 years ago
- Tensorflow implementation of Wasserstein GAN - arxiv: https://arxiv.org/abs/1701.07875☆21Updated 7 years ago
- Collaborative Deep Reinforcement Learning☆32Updated 7 years ago
- ICML 2017 accepted papers on arXiv.org☆17Updated 7 years ago
- Population Based Training (in PyTorch with sqlite3). Status: Unsupported☆166Updated 7 years ago
- Distributed A3C☆34Updated 7 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆30Updated 7 years ago
- Atari gauntlet for RL agents☆29Updated 7 years ago
- PyTorch implementation of the Value Iteration Networks (VIN) (NIPS '16 best paper)☆80Updated 7 years ago
- Series Algorithms of Deep Reinforcement Learning, such as DQN, DDQN, one-step-DQN, DDPG, etc☆41Updated 8 years ago
- TensorFlow implementation of Value Iteration Networks (VIN): Clean, Simple and Modular☆52Updated 7 years ago
- Simple Tensorflow implementation of "On the Convergence of Adam and Beyond" (ICLR 2018)☆104Updated 5 years ago
- A pytorch implementation of "Self-Normalizing Neural Networks" by Klambauer et al. (still beta)☆59Updated 7 years ago
- Deep reinforcement learning in ViZDoom (using Tensorflow)☆19Updated 7 years ago
- Cleaned original source code from my NIPS publication☆154Updated 7 years ago
- Reference implementation for Structured Prediction with Deep Value Networks☆54Updated 7 years ago
- Tensorflow Implementation of GAN modeling for sequential data☆68Updated 7 years ago
- Professor Forcing, NIPS'16☆46Updated 7 years ago
- ☆38Updated 7 years ago
- ☆25Updated 7 years ago
- ☆53Updated 7 years ago
- Easy TensorFlow logging for quick prototypes☆110Updated 3 years ago
- Helpful files for Visual Doom AI Competition 2017☆44Updated 6 years ago
- An implementation of zoneout regularizer on LSTM-RNN by Tensorflow☆24Updated 7 years ago