distillpub / post--understanding-rl-visionLinks
Understanding RL vision Distill article
☆24Updated 2 years ago
Alternatives and similar repositories for post--understanding-rl-vision
Users that are interested in post--understanding-rl-vision are comparing it to the libraries listed below
Sorting:
- Generalised UDRL☆37Updated 3 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆22Updated 4 years ago
- Variational Reinforcement Learning☆16Updated last year
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆43Updated 2 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆31Updated last year
- 🤖 Reinforcement Learning paper summaries, notebooks, and articles.☆26Updated 5 years ago
- GPT implementation in Flax☆18Updated 3 years ago
- A standalone release of DeepMind Lab's maze generator with Python bindings.☆64Updated last year
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆26Updated 3 years ago
- Reward Learning by Simulating the Past☆45Updated 6 years ago
- ☆28Updated 3 years ago
- ☆19Updated 4 years ago
- ☆38Updated last year
- Made for a reading group at the Center for Safe AGI.☆12Updated 2 years ago
- ☆31Updated 6 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆30Updated 5 years ago
- TaskMet Task-driven Metric Learning for Model Learning☆19Updated last year
- Using Rainbow implementation in Chainer RL for Slime Volleyball Pixel Environment☆23Updated 5 years ago
- Public Release of Plan2vec Implementation in pyTorch☆57Updated 2 years ago
- Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)☆22Updated 3 years ago
- A2C is a special case of PPO!☆22Updated 3 years ago
- ☆16Updated 3 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆20Updated 4 years ago
- ☆23Updated 3 years ago
- A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning☆12Updated 4 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆28Updated 2 years ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Updated 4 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 4 years ago