google-deepmind / symplectic-gradient-adjustmentLinks
A colab that implements the Symplectic Gradient Adjustment optimizer from "The mechanics of n-player differentiable games"
☆153Updated 7 years ago
Alternatives and similar repositories for symplectic-gradient-adjustment
Users that are interested in symplectic-gradient-adjustment are comparing it to the libraries listed below
Sorting:
- Guided Evolutionary Strategies☆272Updated 2 years ago
- Implementation of Model-Agnostic Meta-Learning (MAML) in Jax☆191Updated 3 years ago
- Velocity in deep-learning research☆279Updated 3 years ago
- lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.☆378Updated 3 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆154Updated 8 years ago
- [ICML-18] Codes for the custom games we built to compare RL agents with humans☆66Updated 7 years ago
- Code for the paper "Evolved Policy Gradients"☆253Updated 7 years ago
- Augmented environments with RL☆103Updated 6 years ago
- ☆119Updated 5 years ago
- I am implementing a lot of reinforcement learning and imitation learning algorithms since I'm sick of reading about them but not really u…☆53Updated 5 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Updated 7 years ago
- [NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"☆116Updated 6 years ago
- NIPS 2017 Value Prediction Network☆166Updated 7 years ago
- DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm☆24Updated 2 years ago
- Full World Models Implementation in Chainer☆168Updated 7 years ago
- ☆134Updated 8 years ago
- Some hard problems for reinforcement learning.☆32Updated 7 years ago
- ☆182Updated last year
- 2019 talk at GECCO☆68Updated 6 years ago
- Implementation of Spectral Inference Networks, ICLR 2019☆172Updated 6 years ago
- Original PyTorch implementation of the Leap meta-learner (https://arxiv.org/abs/1812.01054) along with code for running the Omniglot expe…☆148Updated 2 years ago
- ☆219Updated 7 years ago
- Publicly releasable baselines for the Retro contest☆129Updated 7 years ago
- A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release th…☆203Updated 5 years ago
- AI-ON Consciousness Prior☆97Updated 7 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Updated 6 years ago
- Tensorflow Implementation of Interaction Networks for Learning about Objects, Relations and Physics☆158Updated 8 years ago
- General Game Playing with Schema Networks☆41Updated 3 years ago
- A parallel version of Trust Region Policy Optimization☆65Updated 8 years ago
- A reinforcement learning framework☆157Updated 7 years ago