distillpub / post--understanding-rl-vision
Understanding RL vision Distill article
☆23Updated 2 years ago
Alternatives and similar repositories for post--understanding-rl-vision:
Users that are interested in post--understanding-rl-vision are comparing it to the libraries listed below
- Variational Reinforcement Learning☆16Updated 8 months ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆21Updated 4 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Updated 4 years ago
- Generalised UDRL☆37Updated 2 years ago
- ☆28Updated 2 years ago
- ☆14Updated 5 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆43Updated 2 years ago
- ☆13Updated 8 months ago
- ☆16Updated 4 years ago
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆17Updated 2 years ago
- ☆36Updated last year
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆26Updated 3 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Updated 6 years ago
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆26Updated 9 months ago
- Automatically generate simple meta-learning tasks from a very large space☆15Updated last year
- ☆19Updated 3 years ago
- flexible meta-learning in jax☆12Updated last year
- GPT implementation in Flax☆18Updated 3 years ago
- Codes for Evolving Plastic ANNs☆13Updated 2 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- The implementation of "The Kanerva Machine" with Pytorch and Pyro☆12Updated 6 years ago
- Gym wrapper for pysc2☆10Updated 2 years ago
- ☆17Updated 3 years ago
- Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"☆19Updated 2 years ago
- TaskMet Task-driven Metric Learning for Model Learning☆19Updated last year
- Code for Deep Reinforcement and InfoMax Learning (Neurips 2020)☆10Updated 4 years ago
- Code for "Continuous-Time Meta-Learning with Forward Mode Differentiation" (ICLR 2022)☆26Updated 3 years ago
- A2C is a special case of PPO!☆20Updated 2 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆28Updated 2 years ago