hari-sikchi / AWAC
Advantage weighted Actor Critic for Offline RL
☆47Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for AWAC
- Source files to replicate experiments in my ICLR 2022 paper.☆62Updated 4 months ago
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆48Updated 3 years ago
- Code and project page for D-REX algorithm from the paper "Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstrat…☆49Updated last year
- ☆52Updated last year
- ☆53Updated 3 years ago
- Official repository for Paper "Offline Goal-Conditioned Reinforcement Learning via f-Advantage Regression" (NeurIPS 2022)☆35Updated last year
- Public implementation of "Learning from Suboptimal Demonstration via Self-Supervised Reward Regression" from CoRL'21☆23Updated 3 years ago
- behavior cloning from observation☆35Updated 3 years ago
- ☆17Updated 2 years ago
- ☆31Updated 3 years ago
- Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines☆49Updated 2 years ago
- Simple maze environments using mujoco-py☆52Updated 10 months ago
- ☆22Updated 9 months ago
- RL Algorithms for Visual Continuous Control☆32Updated last year
- Skill-based Model-based Reinforcement Learning (CoRL 2022)☆52Updated 2 years ago
- A collection of manipulation tasks with the fetch robot☆21Updated 3 years ago
- Unofficial re-implementation of "Learning Latent Dynamics for Planning from Pixels" (https://arxiv.org/abs/1811.04551 ) with PyTorch☆43Updated 4 years ago
- PyTorch code accompanying the paper "Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning" (NeurIPS 2020 spot…☆37Updated last year
- EARL: Environment for Autonomous Reinforcement Learning☆34Updated last year
- Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL☆22Updated 2 weeks ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆60Updated last year
- Official codebase for LEAP: Planning with Goal Conditioned Policies☆50Updated 2 years ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆34Updated last year
- Lipschitz-constrained Unsupervised Skill Discovery (ICLR 2022)☆33Updated last year
- Official release of CompoSuite, a compositional RL benchmark☆46Updated 9 months ago
- ☆54Updated 8 months ago
- Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.☆26Updated 2 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 4 years ago
- ☆21Updated 2 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning