Neo-X / SMiRL_CodeLinks
☆19Updated 3 years ago
Alternatives and similar repositories for SMiRL_Code
Users that are interested in SMiRL_Code are comparing it to the libraries listed below
Sorting:
- ☆58Updated 2 years ago
- Simple maze environments using mujoco-py☆57Updated last year
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Updated 2 years ago
- The Starcraft Multi-Agent challenge lite☆41Updated last year
- An open source benchmark for Multi Agent Reinforcement Learning☆30Updated 2 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated 2 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆70Updated 4 months ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆38Updated 2 years ago
- ☆46Updated 2 years ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆68Updated 2 years ago
- Official Repository for "Agent Modelling under Partial Observability for Deep Reinforcement Learning"☆39Updated 3 years ago
- Advantage weighted Actor Critic for Offline RL☆50Updated 3 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆48Updated 5 years ago
- Fast and flexible multi-agent gridworld reinforcement learning environments.☆46Updated 7 months ago
- ☆17Updated 3 years ago
- ☆54Updated last year
- Conservative Q Learning on top of SAC☆132Updated 3 years ago
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022☆30Updated 2 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆67Updated 2 years ago
- OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation☆15Updated 2 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆41Updated 3 years ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆35Updated 2 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 4 years ago
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆55Updated 4 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆68Updated last year
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆40Updated 9 months ago
- Code for the NeurIPS 2021 paper "Safe Reinforcement Learning by Imagining the Near Future"☆48Updated 3 years ago
- Codes accompanying the paper "Influence-Based Multi-Agent Exploration" (ICLR 2020 spotlight)☆33Updated 5 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆81Updated 2 years ago