microsoft / MAMBA
Imitation learning from multiple experts
☆12Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for MAMBA
- Sandbox environment for generalizable agent research☆23Updated 2 years ago
- More efficient exploration for reinforcement learning in two-player, zero-sum game☆17Updated 3 months ago
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)☆18Updated 3 months ago
- Latent World Models For Intrinsically Motivated Exploration | Official repository☆21Updated 3 years ago
- Vectorization techniques for fast population-based training.☆54Updated 2 years ago
- Causal Analysis of Agent Behavior for AI Safety☆17Updated last year
- Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning☆24Updated last year
- Reinforcement Learning via Latent State Decoding☆30Updated last year
- ☆17Updated last year
- Code repository accompanying the Heuristic Guided RL NeurIPS'21 paper☆12Updated 2 years ago
- Implements the Messenger environment and EMMA model.☆23Updated last year
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- ☆17Updated 5 months ago
- ☆42Updated 2 years ago
- INTeractive learning via REPresentatIon Discovery☆34Updated 5 months ago
- PyTorch Package For Quasimetric Learning☆42Updated 3 weeks ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆29Updated 3 months ago
- Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)☆43Updated 10 months ago
- A PyTorch Implementation of Skipper☆20Updated last month
- ☆41Updated last month
- ☆29Updated last year
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Updated 4 years ago
- Implementation of Proximal Policy Optimization in Jax+Flax☆16Updated last year
- ☆30Updated 3 months ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆54Updated 4 months ago
- GPT implementation in Flax☆18Updated 2 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- ☆11Updated 2 years ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆11Updated last year
- My Body Is A Cage☆38Updated 3 years ago