AntonOsika / agz
AlphaGo Zero Reimplementation. MCTS Self Play library.
☆25Updated 2 years ago
Alternatives and similar repositories for agz:
Users that are interested in agz are comparing it to the libraries listed below
- A collection of code investigating the use of information theory for abstractions in RL☆16Updated 6 years ago
- Reinforcement learning algorithms to play Poker☆15Updated 3 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago
- Logarithmic Reinforcement Learning☆26Updated last year
- Implementation of restricted Boltzmann machine, deep Boltzmann machine, deep belief network, and deep restricted Boltzmann network models…☆13Updated 4 years ago
- ☆18Updated 6 years ago
- Code implementation of: "Graying the black box: Understanding DQNs"☆20Updated 8 years ago
- The Machine Learning Toybox for testing the behavior of autonomous agents.☆27Updated 2 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 4 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Updated 6 years ago
- ☆19Updated 3 years ago
- Surprise-based intrinsic motivation for deep reinforcement learning☆20Updated 8 years ago
- A simulation environment for the creation and observation of ML models based on PyTorch☆9Updated 6 years ago
- rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.☆30Updated 6 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆21Updated 4 years ago
- Source code for "A deep dive into reinforcement learning"☆12Updated 5 years ago
- Multi-agent active perception with prediction rewards☆11Updated 4 years ago
- Neuronal Circuit Policies☆40Updated 2 years ago
- Training (hopefully) safe agents in gridworlds☆25Updated 5 years ago
- Code for "Meta-Learning Priors for Efficient Online Bayesian Regression" by James Harrison, Apoorva Sharma, and Marco Pavone☆54Updated 2 years ago
- Keras implementation of Curiosity-driven Exploration by Self-supervised Prediction☆8Updated 7 years ago
- ☆21Updated 4 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆146Updated last year
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆94Updated 6 years ago
- The Tensorflow code and a DeepMind Lab wrapper for my article "Meta-Reinforcement Learning" on FloydHub.☆37Updated 6 years ago
- 🤖 Implementation of Self Normalizing Networks (SNN) in PyTorch.☆12Updated 7 years ago
- Imagination Augmented Agents TensorFlow☆26Updated 4 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆31Updated 5 years ago
- Gym wrapper for Vizdoom environments☆12Updated 6 years ago
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago