sff1019 / opirl
☆9Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for opirl
- behavior cloning from observation☆35Updated 3 years ago
- ☆31Updated 3 years ago
- A library for building reinforcement learning and imitation learning agents in Pytorch☆58Updated 4 years ago
- Unofficial re-implementation of "Learning Latent Dynamics for Planning from Pixels" (https://arxiv.org/abs/1811.04551 ) with PyTorch☆43Updated 4 years ago
- Code for the NeurIPS 2021 paper "Safe Reinforcement Learning by Imagining the Near Future"☆39Updated 2 years ago
- ☆52Updated last year
- ☆34Updated 3 years ago
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022☆25Updated last year
- Advantage weighted Actor Critic for Offline RL☆47Updated 2 years ago
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆48Updated 3 years ago
- Unofficial Re-implementation of "Dream to Control: Learning Behaviors by Latent Imagination" (https://arxiv.org/abs/1912.01603 ) with PyT…☆27Updated 4 years ago
- OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation☆13Updated last year
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆62Updated last year
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- Code and project page for D-REX algorithm from the paper "Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstrat…☆49Updated last year
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆26Updated 2 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆51Updated 5 months ago
- Codebase of Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization (ICLR2021)☆51Updated 3 years ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆34Updated last year
- Official codebase for LEAP: Planning with Goal Conditioned Policies☆50Updated 2 years ago
- ☆53Updated 3 years ago
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆33Updated 3 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆34Updated 2 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆62Updated 4 months ago
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆26Updated 2 years ago
- Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…☆41Updated 5 years ago
- Public implementation of "Learning from Suboptimal Demonstration via Self-Supervised Reward Regression" from CoRL'21☆23Updated 3 years ago
- Distributional Soft Actor Critic☆49Updated 4 years ago
- NeurIPS Reproducibility Challenge 2019☆20Updated 4 years ago
- A general model-free off-policy actor-critic implementation. Continuous and Discrete Soft Actor-Critic with multimodal observations, data…☆36Updated 8 months ago