microsoft / MAMBALinks
Imitation learning from multiple experts
☆12Updated 2 years ago
Alternatives and similar repositories for MAMBA
Users that are interested in MAMBA are comparing it to the libraries listed below
Sorting:
- Implements the Messenger environment and EMMA model.☆23Updated last year
- ☆42Updated 3 years ago
- Sandbox environment for generalizable agent research☆25Updated 2 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆21Updated 4 years ago
- ☆10Updated 2 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆28Updated 2 years ago
- Exploring techniques to generate diverse conventions in multi-agent settings☆14Updated last year
- Latent World Models For Intrinsically Motivated Exploration | Official repository☆22Updated 4 years ago
- Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning☆25Updated 2 years ago
- Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery☆15Updated last year
- ☆20Updated 2 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated last year
- Causal Analysis of Agent Behavior for AI Safety☆18Updated last year
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- ☆53Updated 7 months ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated 9 months ago
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)☆20Updated 9 months ago
- A simple and easy to use implementation of the soft actor-critic algorithm.☆15Updated 2 years ago
- Scalable Opponent Shaping Experiments in JAX☆24Updated last year
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- PyTorch Package For Quasimetric Learning☆42Updated 7 months ago
- Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer☆27Updated last year
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆55Updated 11 months ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆10Updated last year
- ☆17Updated 3 years ago
- ☆32Updated 9 months ago
- ☆13Updated 10 months ago
- ☆17Updated last year