microsoft / MAMBALinks
Imitation learning from multiple experts
☆12Updated 2 years ago
Alternatives and similar repositories for MAMBA
Users that are interested in MAMBA are comparing it to the libraries listed below
Sorting:
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆28Updated 2 years ago
- Sandbox environment for generalizable agent research☆25Updated 2 years ago
- Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning☆25Updated 2 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆11Updated 2 years ago
- Implements the Messenger environment and EMMA model.☆23Updated 2 years ago
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated last year
- ☆19Updated 3 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆68Updated 3 years ago
- ☆42Updated 3 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆21Updated 4 years ago
- Scalable Opponent Shaping Experiments in JAX☆24Updated last year
- PyTorch Package For Quasimetric Learning☆42Updated 7 months ago
- Generalised UDRL☆37Updated 3 years ago
- Latent World Models For Intrinsically Motivated Exploration | Official repository☆22Updated 4 years ago
- Automatically generate simple meta-learning tasks from a very large space☆15Updated last year
- INTeractive learning via REPresentatIon Discovery☆34Updated last year
- ☆10Updated 2 years ago
- GPT implementation in Flax☆18Updated 3 years ago
- Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery☆16Updated last year
- Codebase for ICLR 2023 paper, "SMART: Self-supervised Multi-task pretrAining with contRol Transformers"☆53Updated last year
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago
- Vectorization techniques for fast population-based training.☆56Updated 2 years ago
- ☆20Updated 2 years ago
- ☆32Updated 10 months ago
- ☆31Updated 2 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- ☆15Updated 2 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- ☆54Updated 7 months ago
- Model-Based Reinforcement Learning via Latent-Space Collocation.☆33Updated 2 years ago