google-deepmind / constrained_optidiceLinks
☆10Updated 2 years ago
Alternatives and similar repositories for constrained_optidice
Users that are interested in constrained_optidice are comparing it to the libraries listed below
Sorting:
- 🔥 Datasets and env wrappers for offline safe reinforcement learning☆100Updated 11 months ago
- ☆57Updated 2 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆65Updated last year
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023☆31Updated 8 months ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆70Updated 2 years ago
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆35Updated 4 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆60Updated 2 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆179Updated 3 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆172Updated 9 months ago
- Synthetic Experience Replay☆100Updated last year
- Code for the NeurIPS 2021 paper "Safe Reinforcement Learning by Imagining the Near Future"☆46Updated 3 years ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆35Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆88Updated last year
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆35Updated 2 years ago
- Model-Based Offline Reinforcement Learning☆51Updated 4 years ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆209Updated 11 months ago
- IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL☆43Updated 11 months ago
- Efficient Adversarial Training without Attacking: Worst-Case-Aware Robust Reinforcement Learning☆26Updated last year
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆27Updated 3 years ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆37Updated 6 months ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆66Updated 2 years ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Updated 2 years ago
- Implementations of SAILR, PDO, and CSC☆31Updated last year
- OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation☆15Updated 2 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆57Updated 2 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆148Updated last year
- An open source benchmark for Multi Agent Reinforcement Learning☆30Updated 2 years ago
- Code for FOCAL Paper Published at ICLR 2021☆51Updated last year
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆185Updated 3 years ago