google-deepmind / symplectic-gradient-adjustment
A colab that implements the Symplectic Gradient Adjustment optimizer from "The mechanics of n-player differentiable games"
☆154Updated 6 years ago
Alternatives and similar repositories for symplectic-gradient-adjustment:
Users that are interested in symplectic-gradient-adjustment are comparing it to the libraries listed below
- Guided Evolutionary Strategies☆269Updated last year
- Velocity in deep-learning research☆276Updated 2 years ago
- lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.☆374Updated 2 years ago
- NIPS 2017 Value Prediction Network☆165Updated 7 years ago
- Augmented environments with RL☆103Updated 5 years ago
- I am implementing a lot of reinforcement learning and imitation learning algorithms since I'm sick of reading about them but not really u…☆51Updated 5 years ago
- ☆117Updated 4 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆151Updated 7 years ago
- A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release th…☆201Updated 4 years ago
- 2019 talk at GECCO☆68Updated 5 years ago
- Easy TensorFlow logging for quick prototypes☆110Updated 3 years ago
- Implementation of Model-Agnostic Meta-Learning (MAML) in Jax☆188Updated 2 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆93Updated 6 years ago
- ☆133Updated 7 years ago
- Highly Modular and Scalable Reinforcement Learning☆113Updated 5 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆87Updated 6 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Updated 5 years ago
- ☆182Updated 7 months ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆93Updated 4 years ago
- Basic versions of agents from Spinning Up in Deep RL written in PyTorch☆199Updated 3 years ago
- Code for: Implicit Competitive Regularization in GANs☆114Updated 3 years ago
- Full World Models Implementation in Chainer☆165Updated 6 years ago
- Implementation of Spectral Inference Networks, ICLR 2019☆171Updated 5 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆96Updated 6 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆266Updated 5 years ago
- [NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"☆116Updated 5 years ago
- Reason8.ai PyTorch solution for NIPS RL 2017 challenge☆84Updated 5 years ago
- ☆85Updated 4 years ago
- ICLR Reproducibility Challenge 2019☆220Updated 5 years ago
- Optimizing control variates for black-box gradient estimation☆162Updated 5 years ago