ARVILab / open_alpha_zero
☆26Updated 5 years ago
Alternatives and similar repositories for open_alpha_zero:
Users that are interested in open_alpha_zero are comparing it to the libraries listed below
- ☆50Updated 6 years ago
- Deep Reinforcement Learning library for humans☆299Updated 7 years ago
- RL experiments☆70Updated 2 years ago
- A reinforcement learning framework☆154Updated 6 years ago
- Simple Reinforcement Learning Framework☆34Updated 7 years ago
- Yandex SDA classes on deep learning. Version of year 2017☆116Updated 7 years ago
- Direct Future Prediction (DFP ) in Keras☆109Updated 7 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆266Updated 5 years ago
- Codes of our team for the OpenAI Retro Contest of reinforcement learning☆99Updated 6 years ago
- ☆77Updated 7 years ago
- ☆66Updated 7 years ago
- An implementation of the ideas from this paper https://arxiv.org/pdf/1803.10122.pdf☆283Updated 2 years ago
- ☆15Updated 2 years ago
- Deep Q-networks made easy☆15Updated 6 years ago
- My code for Telstra Network Disruptions Kaggle competition☆74Updated 8 years ago
- UDScourse for Kyiv students☆41Updated 6 years ago
- Publicly releasable baselines for the Retro contest☆127Updated 6 years ago
- Source code for OpenAI Retro Contest for Sonic the Hedgehog☆31Updated 6 years ago
- ☆23Updated 5 years ago
- ☆18Updated 6 years ago
- lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.☆374Updated 2 years ago
- Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain …☆84Updated 9 years ago
- Keras implementation of DQN on ViZDoom environment☆53Updated 8 years ago
- ☆107Updated 6 years ago
- Full World Models Implementation in Chainer☆165Updated 6 years ago
- Sberbank Holdem Challenge 2017. Хакатон по написанию игровых ботов на основе машинного обучения.☆28Updated 6 years ago
- World Models applied to the Open AI Sonic Retro Contest☆77Updated 6 years ago
- Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it☆128Updated last year
- Wrapper for OpenAI Retro envs for parallel execution☆27Updated 6 years ago
- Easy TensorFlow logging for quick prototypes☆110Updated 3 years ago