fiberleif / POfDLinks

Reimplementation of Policy Optimization with Demonstrations (POfD) from ICML 2018.

☆15

Alternatives and similar repositories for POfD

Users that are interested in POfD are comparing it to the libraries listed below

Sorting:

montaserFath / BCO
behavior cloning from observation
☆35Updated 4 years ago
PKU-RL / CORRO
CORRO code
☆35Updated 2 years ago
rohitrango / BC-regularized-GAIL
Official implementation of the paper `Augmenting GAIL with BC for sample efficient imitation learning` in PyTorch
☆33Updated 4 years ago
compsciencelab / ppo_D
This is the official repository for the paper "Guided Exploration with Proximal Policy Optimization using a Single Demonstration", https:…
☆19Updated 3 years ago
trzhang0116 / HRAC
PyTorch code accompanying the paper "Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning" (NeurIPS 2020 spot…
☆42Updated last year
cychai1995 / DDPGfD
DDPGfD: This is our implementation project for the Reinforcement Learning course in NCTU.
☆33Updated 3 years ago
Mee321 / policy-distillation
☆14Updated 5 years ago
shlee94 / Off2OnRL
☆56Updated 2 years ago
TonghanWang / EITI-EDTI
Codes accompanying the paper "Influence-Based Multi-Agent Exploration" (ICLR 2020 spotlight)
☆33Updated 5 years ago
YangRui2015 / Sparse-Reward-Algorithms
Implement many Sparse Reward algorithms in Gym Fetch environment
☆88Updated 4 years ago
hari-sikchi / AWAC
Advantage weighted Actor Critic for Offline RL
☆50Updated 2 years ago
martius-lab / HiTS
Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021
☆34Updated 2 years ago
YangRui2015 / Modular_HER
Modular-HER is revised from OpenAI baselines and supports many improvements for Hindsight Experience Replay as modules.
☆16Updated 4 years ago
rraileanu / idaac
☆53Updated last year
MAS-anony / ASN
☆32Updated 2 years ago
IouJenLiu / CMAE
☆49Updated 3 years ago
feidieufo / homework
Assignments for CS294-112.
☆30Updated 5 years ago
Cranial-XIX / marl-copa
PyTorch Implementation of COPA for coordinating teams that can dynamically change.
☆21Updated 3 years ago
shariqiqbal2810 / REFIL
Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021
☆65Updated 4 years ago
apexrl / bmpo
Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization>
☆23Updated 2 years ago
alirezakazemipour / DIAYN-PyTorch
Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.
☆68Updated last year
illidanlab / opolo-code
☆31Updated 4 years ago
TJU-DRL-LAB / self-supervised-rl
☆39Updated 3 years ago
TonghanWang / DOP
Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)
☆52Updated 2 years ago
PKU-RL / I2C
☆44Updated 4 years ago
jiayu-ch15 / Variational-Automatic-Curriculum-Learning
curriculum
☆25Updated 2 years ago
quantumiracle / Benchmark-Efficient-Reinforcement-Learning-with-Demonstrations
Benchmark present methods for efficient reinforcement learning. Methods include Reptile, MAML, Residual Policy, etc. RL algorithms includ…
☆31Updated 2 years ago
Haichao-Zhang / PEX
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)
☆57Updated 2 years ago
morning9393 / Optimal-Baseline-for-Multi-agent-Policy-Gradients
☆28Updated 3 years ago
lweitkamp / feudalnets-pytorch
PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.
☆41Updated 5 years ago