dsbrown1331 / CoRL2019-DREX
Code and project page for D-REX algorithm from the paper "Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstrations" presented at CoRL 2019.
☆49Updated last year
Related projects ⓘ
Alternatives and complementary repositories for CoRL2019-DREX
- ☆53Updated 3 years ago
- A collection of manipulation tasks with the fetch robot☆21Updated 3 years ago
- ☆21Updated 2 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆62Updated 4 months ago
- Official codebase for LEAP: Planning with Goal Conditioned Policies☆50Updated 2 years ago
- Public implementation of "Learning from Suboptimal Demonstration via Self-Supervised Reward Regression" from CoRL'21☆23Updated 3 years ago
- accompanying code for neurips submission "Goal-conditioned Imitation Learning"☆67Updated last year
- ☆31Updated 3 years ago
- Residual policy learning☆58Updated 5 years ago
- Advantage weighted Actor Critic for Offline RL☆47Updated 2 years ago
- ☆18Updated 5 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆34Updated 2 years ago
- The MAGICAL benchmark suite for robust imitation learning (NeurIPS 2020)☆75Updated 11 months ago
- [NeurIPS 2021] PyTorch Code for Accelerating Robotic Reinforcement Learning with Parameterized Action Primitives☆72Updated 2 years ago
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆48Updated 3 years ago
- EARL: Environment for Autonomous Reinforcement Learning☆34Updated last year
- ☆17Updated 2 years ago
- ☆21Updated 2 years ago
- Motion Planner Augmented Reinforcement Learning for Robot Manipulation in Obstructed Environments (CoRL 2020)☆72Updated last year
- Official release of CompoSuite, a compositional RL benchmark☆46Updated 9 months ago
- Learning to Coordinate Manipulation Skills via Skill Behavior Diversification (ICLR 2020)☆43Updated 2 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆28Updated 5 years ago
- behavior cloning from observation☆35Updated 3 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- A standalone library to randomize various OpenAI Gym Environments☆60Updated 5 years ago
- Simple maze environments using mujoco-py☆52Updated 10 months ago
- ☆62Updated 4 years ago
- ☆25Updated 4 years ago
- Official repository for Paper "Offline Goal-Conditioned Reinforcement Learning via f-Advantage Regression" (NeurIPS 2022)☆35Updated last year
- ☆52Updated last year