microsoft / MAMBA
Imitation learning from multiple experts
☆12Updated 2 years ago
Alternatives and similar repositories for MAMBA:
Users that are interested in MAMBA are comparing it to the libraries listed below
- Sandbox environment for generalizable agent research☆24Updated 2 years ago
- More efficient exploration for reinforcement learning in two-player, zero-sum game☆19Updated 5 months ago
- INTeractive learning via REPresentatIon Discovery☆33Updated 7 months ago
- ☆17Updated last year
- ☆42Updated 2 years ago
- Reinforcement Learning via Latent State Decoding☆30Updated last year
- Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery☆13Updated last year
- Model-Based Reinforcement Learning via Latent-Space Collocation.☆32Updated last year
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆12Updated this week
- ☆11Updated 2 years ago
- Generalised UDRL☆37Updated 2 years ago
- Latent World Models For Intrinsically Motivated Exploration | Official repository☆22Updated 3 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Updated 3 years ago
- Variational Reinforcement Learning☆16Updated 5 months ago
- ☆43Updated 3 months ago
- ☆17Updated 2 years ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- Causal Analysis of Agent Behavior for AI Safety☆17Updated last year
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆20Updated 4 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Updated last year
- Code repository accompanying the Heuristic Guided RL NeurIPS'21 paper☆13Updated 3 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 4 years ago
- Repo to reproduce the First-Explore paper results☆37Updated 3 weeks ago
- ☆13Updated 6 months ago
- Mutual Information State Intrinsic Control (ICLR 2021 Spotlight)☆37Updated 3 years ago
- ☆32Updated 5 months ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆21Updated 3 years ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆54Updated 6 months ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Updated 3 years ago