wyjung0625 / p3sLinks

Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning

☆22

Alternatives and similar repositories for p3s

Users that are interested in p3s are comparing it to the libraries listed below

Sorting:

jparkerholder / DvD_ES
Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…
☆44Updated 4 years ago
tesslerc / GAC
Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"
☆22Updated 5 years ago
TonghanWang / DOP
Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)
☆51Updated 2 years ago
llan-ml / tesp
Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"
☆34Updated 6 years ago
eugenevinitsky / robust_RL_multi_adversary
We investigate the effect of populations on finding good solutions to the robust MDP
☆28Updated 4 years ago
junjungoal / IMPALA-pytorch
PyTorch IMPALA implementation
☆27Updated 5 years ago
gyh75520 / Relational_DRL
Implementation of Relational Deep Reinforcement Learning
☆25Updated 5 years ago
wendelinboehmer / dcg
☆75Updated last year
aviralkumar2907 / BEAR
Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction
☆161Updated 5 years ago
ying-wen / malib_deprecated
A Multi-agent Learning Framework
☆62Updated 4 years ago
tesslerc / ActionRobustRL
Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…
☆46Updated 6 years ago
WilsonWangTHU / POPLIN
☆99Updated 2 years ago
alversafa / option-critic-arch
Implementation of the Option-Critic Architecture
☆40Updated 6 years ago
ermongroup / MetaIRL
Meta-Inverse Reinforcement Learning with Probabilistic Context Variables
☆73Updated 2 years ago
robintyh1 / onpolicybaselines
on-policy optimization baselines for deep reinforcement learning
☆30Updated 5 years ago
apourchot / CEM-RL
Combining Evolutionary Algorithms and deep RL in various ways
☆103Updated 4 years ago
mxu34 / mbrl-gpmm
☆26Updated 5 years ago
uoe-agents / derl
The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)
☆27Updated 3 years ago
liuzuxin / safe-mbrl
Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method
☆66Updated 2 years ago
apexrl / bmpo
Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization>
☆23Updated 2 years ago
ermongroup / CalibratedModelBasedRL
Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.
☆55Updated 6 years ago
thanard / me-trpo
☆92Updated last year
Knoxantropicen / model-based-meta-rl
Self-implemented code for Model-Based Meta-Reinforcement Learning
☆17Updated 6 years ago
behaviorguidedRL / BGRL
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Updated 5 years ago
schroederdewitt / mackrl
Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)
☆32Updated 5 years ago
tjuHaoXiaotian / GASIL
Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems
☆31Updated 6 years ago
ermongroup / multiagent-gail
☆84Updated 6 years ago
Hwhitetooth / lirpg
☆61Updated 7 years ago
HumanCompatibleAI / population-irl
(Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards
☆28Updated 6 years ago
jonasrothfuss / model_ensemble_meta_learning
Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm
☆44Updated 6 years ago