distillpub / post--understanding-rl-vision
Understanding RL vision Distill article
☆23Updated 2 years ago
Alternatives and similar repositories for post--understanding-rl-vision:
Users that are interested in post--understanding-rl-vision are comparing it to the libraries listed below
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆21Updated 4 years ago
- ☆14Updated 5 years ago
- Variational Reinforcement Learning☆16Updated 7 months ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Updated 6 years ago
- ☆28Updated 2 years ago
- Generalised UDRL☆37Updated 2 years ago
- Reward Learning by Simulating the Past☆44Updated 5 years ago
- Automatically generate simple meta-learning tasks from a very large space☆15Updated last year
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Updated 4 years ago
- ☆17Updated 3 years ago
- A framework for implementing equivariant DL☆10Updated 3 years ago
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆17Updated 2 years ago
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆26Updated 3 years ago
- Official repository for the paper "Goal-Conditioned Generators of Deep Policies"☆11Updated 2 years ago
- ☆18Updated last year
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- 🤖 Reinforcement Learning paper summaries, notebooks, and articles.☆26Updated 4 years ago
- ☆19Updated 3 years ago
- Gym wrapper for pysc2☆10Updated 2 years ago
- Analogous Safe-state Exploration (ASE) is an algorithm for provably safe and optimal exploration in MDPs with unknown, stochastic dynamic…☆11Updated 4 years ago
- Causal Analysis of Agent Behavior for AI Safety☆17Updated last year
- ☆16Updated 4 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆16Updated 6 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆28Updated last year
- Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"☆19Updated 2 years ago
- Code for Deep Reinforcement and InfoMax Learning (Neurips 2020)☆10Updated 4 years ago
- TaskMet Task-driven Metric Learning for Model Learning☆19Updated last year
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- ☆31Updated 2 years ago
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Updated 7 years ago