google-deepmind / constrained_optidice
β8Updated 2 years ago
Alternatives and similar repositories for constrained_optidice:
Users that are interested in constrained_optidice are comparing it to the libraries listed below
- π₯ Datasets and env wrappers for offline safe reinforcement learningβ84Updated 4 months ago
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022β26Updated last year
- β32Updated 5 months ago
- β53Updated last year
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline settingβ33Updated 3 years ago
- β47Updated last year
- Source files to replicate experiments in my ICLR 2022 paper.β67Updated 6 months ago
- Official repository for paper "Conservative Offline Distributional Reinforcement Learning" (NeurIPS 2021)β20Updated 3 years ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"β34Updated 2 years ago
- This repository has code for the paper "Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algorβ¦β26Updated last year
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.β52Updated 7 months ago
- β26Updated last year
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICMLβ¦β25Updated 2 years ago
- Code for the NeurIPS 2021 paper "Safe Reinforcement Learning by Imagining the Near Future"β43Updated 2 years ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimizationβ32Updated 2 years ago
- OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimationβ13Updated last year
- β29Updated 2 years ago
- Implementations of SAILR, PDO, and CSCβ31Updated 6 months ago
- β31Updated 3 years ago
- Simple maze environments using mujoco-pyβ54Updated last year
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)β25Updated 3 years ago
- β12Updated 2 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimationβ38Updated 2 months ago
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstrationβ26Updated 2 years ago
- Code and project page for D-REX algorithm from the paper "Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstratβ¦β49Updated 2 years ago
- β45Updated this week
- Efficient Adversarial Training without Attacking: Worst-Case-Aware Robust Reinforcement Learningβ22Updated last year
- Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021β31Updated 2 years ago
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learningβ20Updated 2 years ago
- OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learningβ13Updated 2 years ago