uber-research / Evolvability-ES
☆14Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for Evolvability-ES
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 3 years ago
- Variational Reinforcement Learning☆16Updated 4 months ago
- Generalised UDRL☆37Updated 2 years ago
- Map-Elites based on Evolution Strategies☆31Updated 2 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago
- Understanding RL vision Distill article☆23Updated last year
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Updated 6 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆16Updated 6 years ago
- ☆35Updated 6 years ago
- GPT implementation in Flax☆18Updated 2 years ago
- The implementation of "The Kanerva Machine" with Pytorch and Pyro☆12Updated 6 years ago
- Automatically generate simple meta-learning tasks from a very large space☆15Updated last year
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆45Updated 5 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆20Updated 3 years ago
- Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)☆21Updated 2 years ago
- Code for "Learning Inductive Biases with Simple Neural Networks" (Feinman & Lake, 2018).☆21Updated 5 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆21Updated 6 years ago
- ☆18Updated 3 years ago
- ☆29Updated 2 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆27Updated 4 years ago
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆25Updated 2 years ago
- A2C for GVG-AI☆21Updated 6 years ago
- SeqGAN but with more bells and whistles☆24Updated 6 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆17Updated 5 years ago
- ☆17Updated 2 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆26Updated 2 years ago