google-deepmind / constrained_optidiceLinks
β10Updated 2 years ago
Alternatives and similar repositories for constrained_optidice
Users that are interested in constrained_optidice are comparing it to the libraries listed below
Sorting:
- π₯ Datasets and env wrappers for offline safe reinforcement learningβ95Updated 10 months ago
- β56Updated 2 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.β64Updated last year
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordinationβ28Updated 2 years ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.β70Updated 2 years ago
- [NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizatβ¦β38Updated last year
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022β28Updated 2 years ago
- [ICML 2022] Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learningβ35Updated 2 years ago
- PyTorch Implementation of COPA for coordinating teams that can dynamically change.β21Updated 3 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)β57Updated 2 years ago
- The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".β43Updated 8 months ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.β37Updated 4 months ago
- Author's PyTorch implementation of TD7 for online and offline RLβ144Updated last year
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023β32Updated 7 months ago
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPSβ¦β74Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).β86Updated last year
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"β34Updated 2 years ago
- Official codebase for paper "Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning" (ICML22)β22Updated 3 years ago
- Code for the NeurIPS 2021 paper "Safe Reinforcement Learning by Imagining the Near Future"β45Updated 3 years ago
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learningβ27Updated 3 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)β27Updated 3 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.β170Updated 8 months ago
- OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimationβ15Updated last year
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)β57Updated last year
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPOβ170Updated 3 years ago
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]β53Updated 3 years ago
- Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020β45Updated last year
- An open source benchmark for Multi Agent Reinforcement Learningβ30Updated 2 years ago
- MATE: the Multi-Agent Tracking Environment.β45Updated 2 years ago
- Conservative Q Learning on top of SACβ132Updated 2 years ago