microsoft / MAMBALinks
Imitation learning from multiple experts
☆12Updated 2 years ago
Alternatives and similar repositories for MAMBA
Users that are interested in MAMBA are comparing it to the libraries listed below
Sorting:
- ☆54Updated 8 months ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆22Updated 4 years ago
- GPT implementation in Flax☆18Updated 3 years ago
- Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)☆43Updated last year
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated 2 years ago
- ☆42Updated 3 years ago
- Sandbox environment for generalizable agent research☆25Updated 2 years ago
- INTeractive learning via REPresentatIon Discovery☆34Updated last year
- Codebase for ICLR 2023 paper, "SMART: Self-supervised Multi-task pretrAining with contRol Transformers"☆53Updated last year
- ☆44Updated 9 months ago
- Vectorization techniques for fast population-based training.☆56Updated 2 years ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago
- PyTorch Package For Quasimetric Learning☆42Updated 8 months ago
- ☆32Updated 11 months ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆30Updated last year
- Generalised UDRL☆37Updated 3 years ago
- ☆28Updated 2 years ago
- ☆50Updated 3 years ago
- Contains implementation of AdVIL, AdRIL, and DAeQuIL algorithms from the ICML '21 Paper Of Moments and Matching.☆21Updated 3 years ago
- Mutual Information State Intrinsic Control (ICLR 2021 Spotlight)☆37Updated 4 years ago
- Implements the Messenger environment and EMMA model.☆23Updated 2 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆69Updated 3 years ago
- Proto-RL: Reinforcement Learning with Prototypical Representations☆82Updated 3 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆28Updated 2 years ago
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Updated 4 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆23Updated last year
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆54Updated 3 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Updated 3 years ago
- ☆17Updated last year