fiberleif / POfD
Reimplementation of Policy Optimization with Demonstrations (POfD) from ICML 2018.
☆14Updated 5 years ago
Alternatives and similar repositories for POfD
Users that are interested in POfD are comparing it to the libraries listed below
Sorting:
- behavior cloning from observation☆34Updated 4 years ago
- This is the official repository for the paper "Guided Exploration with Proximal Policy Optimization using a Single Demonstration", https:…☆19Updated 3 years ago
- Codes accompanying the paper "Influence-Based Multi-Agent Exploration" (ICLR 2020 spotlight)☆31Updated 5 years ago
- ☆14Updated 5 years ago
- Official implementation of the paper `Augmenting GAIL with BC for sample efficient imitation learning` in PyTorch☆33Updated 4 years ago
- CORRO code☆35Updated 2 years ago
- Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021☆34Updated 2 years ago
- ☆55Updated 2 years ago
- Assignments for CS294-112.☆30Updated 5 years ago
- PyTorch code accompanying the paper "Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning" (NeurIPS 2020 spot…☆41Updated last year
- Advantage weighted Actor Critic for Offline RL☆50Updated 2 years ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆68Updated last year
- DDPGfD: This is our implementation project for the Reinforcement Learning course in NCTU.☆33Updated 3 years ago
- Benchmark present methods for efficient reinforcement learning. Methods include Reptile, MAML, Residual Policy, etc. RL algorithms includ…☆31Updated 2 years ago
- ☆49Updated 3 years ago
- ☆17Updated 3 years ago
- ☆39Updated 2 years ago
- ☆44Updated 4 years ago
- Reading List☆33Updated last year
- curriculum☆24Updated 2 years ago
- ☆31Updated 4 years ago
- Modular-HER is revised from OpenAI baselines and supports many improvements for Hindsight Experience Replay as modules.☆16Updated 3 years ago
- ☆38Updated 3 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆62Updated 11 months ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆85Updated 2 years ago
- Domain-Robust Visual Imitation Learning with Mutual Information Constraints code☆17Updated 4 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆40Updated 4 years ago
- Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards☆31Updated 2 years ago
- rlplot is an easy to use and highly encapsulated RL plot library (including basic error bar lineplot and a wrapper to "rliable").☆31Updated last year