microsoft / MAMBALinks

Imitation learning from multiple experts

☆12

Alternatives and similar repositories for MAMBA

Users that are interested in MAMBA are comparing it to the libraries listed below

Sorting:

ml-jku / helm
☆54Updated 8 months ago
epignatelli / discovering-reinforcement-learning-algorithms
A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…
☆22Updated 4 years ago
brentyi / minGPT-flax
GPT implementation in Flax
☆18Updated 3 years ago
ademiadeniji / irm
Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)
☆43Updated last year
facebookresearch / ssorl
Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories
☆42Updated 2 years ago
orybkin / lexa-benchmark
☆42Updated 3 years ago
microsoft / segar
Sandbox environment for generalizable agent research
☆25Updated 2 years ago
microsoft / Intrepid
INTeractive learning via REPresentatIon Discovery
☆34Updated last year
microsoft / smart
Codebase for ICLR 2023 paper, "SMART: Self-supervised Multi-task pretrAining with contRol Transformers"
☆53Updated last year
google-deepmind / csuite
☆44Updated 9 months ago
instadeepai / fastpbrl
Vectorization techniques for fast population-based training.
☆56Updated 2 years ago
instadeepai / outer-value-function-meta-rl
Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function
☆13Updated 2 years ago
quasimetric-learning / torch-quasimetric
PyTorch Package For Quasimetric Learning
☆42Updated 8 months ago
google-deepmind / active_ops
☆32Updated 11 months ago
uber-research / D3G
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Updated 5 years ago
Farama-Foundation / CrowdPlay
A web based platform for collecting human actions in reinforcement learning environments
☆30Updated last year
Kaixhin / GUDRL
Generalised UDRL
☆37Updated 3 years ago
google-deepmind / zipfian_environments
☆28Updated 2 years ago
rail-berkeley / design-bench
☆50Updated 3 years ago
gkswamy98 / pillbox
Contains implementation of AdVIL, AdRIL, and DAeQuIL algorithms from the ICML '21 Paper Of Moments and Matching.
☆21Updated 3 years ago
ruizhaogit / music
Mutual Information State Intrinsic Control (ICLR 2021 Spotlight)
☆37Updated 4 years ago
ahjwang / messenger-emma
Implements the Messenger environment and EMMA model.
☆23Updated 2 years ago
younggyoseo / RE3
RE3: State Entropy Maximization with Random Encoders for Efficient Exploration
☆69Updated 3 years ago
denisyarats / proto
Proto-RL: Reinforcement Learning with Prototypical Representations
☆82Updated 3 years ago
xingchenwan / bgpbt
[AutoML'22] Bayesian Generational Population-based Training (BG-PBT)
☆28Updated 2 years ago
younggyoseo / trajectory_mcl
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)
☆39Updated 4 years ago
sahandrez / homomorphic_policy_gradient
Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024
☆23Updated last year
mila-iqia / SGI
Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)
☆54Updated 3 years ago
ben-eysenbach / info_geometry
Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"
☆20Updated 3 years ago
hychen-naza / LEAP
☆17Updated last year