microsoft / MAMBALinks
Imitation learning from multiple experts
☆12Updated 3 years ago
Alternatives and similar repositories for MAMBA
Users that are interested in MAMBA are comparing it to the libraries listed below
Sorting:
- ☆55Updated 10 months ago
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated 2 years ago
- INTeractive learning via REPresentatIon Discovery☆34Updated last year
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆69Updated 4 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆22Updated 4 years ago
- Generalised UDRL☆37Updated 3 years ago
- Codebase for ICLR 2023 paper, "SMART: Self-supervised Multi-task pretrAining with contRol Transformers"☆54Updated last year
- ☆42Updated 3 years ago
- ☆37Updated 2 years ago
- Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)☆42Updated last year
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated last year
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆28Updated 2 years ago
- Sandbox environment for generalizable agent research☆26Updated 3 years ago
- Predictable MDP Abstraction for Unsupervised Model-Based RL (ICML 2023)☆32Updated 2 years ago
- Implementations of Temporal Difference InfoNCE (TD InfoNCE)☆30Updated last year
- Causal Analysis of Agent Behavior for AI Safety☆18Updated 2 years ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆59Updated 11 months ago
- Latent World Models For Intrinsically Motivated Exploration | Official repository☆22Updated 4 years ago
- GPT implementation in Flax☆18Updated 3 years ago
- ☆32Updated last year
- Reinforcement learning library in JAX.☆100Updated last year
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆105Updated 3 years ago
- Proto-RL: Reinforcement Learning with Prototypical Representations☆83Updated 3 years ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- ☆15Updated 3 years ago
- PyTorch Package For Quasimetric Learning☆42Updated 10 months ago
- Public Release of Plan2vec Implementation in pyTorch☆57Updated 2 years ago
- Vectorization techniques for fast population-based training.☆56Updated 3 years ago
- MTM Masked Trajectory Models for Prediction, Representation, and Control.☆158Updated 2 years ago
- Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)☆66Updated 4 years ago