microsoft / MAMBALinks
Imitation learning from multiple experts
☆13Updated 3 years ago
Alternatives and similar repositories for MAMBA
Users that are interested in MAMBA are comparing it to the libraries listed below
Sorting:
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆23Updated 5 years ago
- ☆57Updated last year
- GPT implementation in Flax☆18Updated 4 years ago
- Sandbox environment for generalizable agent research☆26Updated 3 years ago
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated 2 years ago
- Generalised UDRL☆37Updated 3 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆29Updated 3 years ago
- Vectorization techniques for fast population-based training.☆57Updated 3 years ago
- PyTorch Package For Quasimetric Learning☆45Updated last year
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆69Updated 4 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Updated 2 years ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆106Updated 3 years ago
- ☆42Updated 3 years ago
- Codebase for ICLR 2023 paper, "SMART: Self-supervised Multi-task pretrAining with contRol Transformers"☆54Updated 2 years ago
- Repo to reproduce the First-Explore paper results☆39Updated last year
- INTeractive learning via REPresentatIon Discovery☆36Updated last year
- ☆31Updated 3 years ago
- Platform to run interactive Reinforcement Learning agents in a Minecraft Server☆56Updated last year
- Public Release of Plan2vec Implementation in pyTorch☆57Updated 3 years ago
- Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)☆42Updated 2 years ago
- Clockwork VAEs in JAX/Flax☆32Updated 4 years ago
- Data-driven offline simulation for online reinforcement learning: benchmark and baselines☆31Updated last year
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆47Updated 4 years ago
- ☆33Updated last year
- ☆46Updated last year
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- MTM Masked Trajectory Models for Prediction, Representation, and Control.☆162Updated last month
- PushWorld: A benchmark for manipulation planning with tools and movable obstacles☆89Updated 2 weeks ago
- Code to reproduce the NeurIPS 2019 paper "Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottlen…☆52Updated 5 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Updated 4 years ago