google-deepmind / constrained_optidice
β8Updated 2 years ago
Related projects: β
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022β23Updated last year
- π₯ Datasets and env wrappers for offline safe reinforcement learningβ65Updated last week
- β51Updated last year
- β46Updated last year
- Simple maze environments using mujoco-pyβ52Updated 8 months ago
- Source files to replicate experiments in my ICLR 2022 paper.β59Updated 2 months ago
- Code for "On Uncertainty in Deep State Space Models for Model-Based Reinforcement Learning" (TMLR, 2022)β13Updated last year
- Official repository for Paper "Offline Goal-Conditioned Reinforcement Learning via f-Advantage Regression" (NeurIPS 2022)β34Updated 11 months ago
- β30Updated last month
- β29Updated 3 years ago
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)β29Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)β71Updated 10 months ago
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)β21Updated last year
- Official repository for paper "Conservative Offline Distributional Reinforcement Learning" (NeurIPS 2021)β17Updated 3 years ago
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline settingβ33Updated 3 years ago
- [ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and β¦β31Updated 6 months ago
- Code for "Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum" (ICML 2023)β9Updated last year
- Code for the NeurIPS 2021 paper "Safe Reinforcement Learning by Imagining the Near Future"β39Updated 2 years ago
- Code and project page for D-REX algorithm from the paper "Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstratβ¦β49Updated last year
- β12Updated 2 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.β45Updated 3 months ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"β34Updated last year
- CORRO codeβ33Updated 2 years ago
- Lipschitz-constrained Unsupervised Skill Discovery (ICLR 2022)β32Updated last year
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)β49Updated 11 months ago
- Implementation of Jump-Start Reinforcement Learning (JSRL) with Stable Baselines3β24Updated 8 months ago
- Author's PyTorch implementation of TD7 for online and offline RLβ108Updated last year
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)β24Updated 2 years ago
- This is the official implementation of NeurIPS 2022 paper "Generalizing Goal-Conditioned Reinforcement Learning with Variational Causal Rβ¦β30Updated last year
- OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimationβ13Updated last year