microsoft / MAMBA
Imitation learning from multiple experts
☆12Updated 2 years ago
Alternatives and similar repositories for MAMBA:
Users that are interested in MAMBA are comparing it to the libraries listed below
- INTeractive learning via REPresentatIon Discovery☆33Updated 10 months ago
- ☆53Updated 4 months ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆21Updated 4 years ago
- Generalised UDRL☆37Updated 2 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆16Updated 3 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆28Updated 2 years ago
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Updated 4 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Updated 3 years ago
- ☆19Updated 3 years ago
- Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning☆25Updated 2 years ago
- ☆42Updated 2 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆9Updated last year
- Causal Analysis of Agent Behavior for AI Safety☆17Updated last year
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆25Updated 9 months ago
- Implements the Messenger environment and EMMA model.☆23Updated last year
- ☆16Updated 3 years ago
- Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery☆14Updated last year
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated last year
- GPT implementation in Flax☆18Updated 3 years ago
- More efficient exploration for reinforcement learning in two-player, zero-sum game☆20Updated 8 months ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…☆25Updated 2 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆68Updated 3 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆21Updated 3 years ago
- Official PyTorch Implementation for Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning☆15Updated 2 years ago
- Sandbox environment for generalizable agent research☆24Updated 2 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- RAD: Reinforcement Learning with Augmented Data (code for procgen experiments)☆18Updated 4 years ago
- ☆32Updated 7 months ago