BY571 / D4PGLinks

PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2RL which can be added to D4PG to improve its performance.

☆23

Alternatives and similar repositories for D4PG

Users that are interested in D4PG are comparing it to the libraries listed below

Sorting:

ThibautTheate / Risk-Sensitive-Policy-with-Distributional-Reinforcement-Learning
Official implementation of the algorithmic approach presented in the research paper entitled "Risk-Sensitive Policy with Distributional R…
☆15Updated 2 years ago
karush17 / esac
Evolution-based Soft Actor-Critic (ESAC)
☆42Updated 11 months ago
schatty / oprl
A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing
☆133Updated this week
BY571 / IQN-and-Extensions
PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…
☆90Updated 2 years ago
quantumiracle / nash-dqn
Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…
☆20Updated 2 years ago
BY571 / Munchausen-RL
PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN
☆45Updated 4 years ago
pairlab / d2rl
Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"
☆39Updated 4 years ago
felix-kerkhoff / DQfD
An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games
☆29Updated 2 years ago
danielwillemsen / MAMBPO
DecentralizedLearning
☆24Updated 2 years ago
seolhokim / DistributedRL-Pytorch-Ray
Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)
☆27Updated 3 years ago
marcbrittain / Prioritized-Sequence-Experience-Replay
Prioritized Sequence Experience Replay
☆10Updated 3 years ago
BlueFisher / Advanced-Soft-Actor-Critic
Soft Actor-Critic with advanced features
☆50Updated this week
Valarzz / DLPA
☆22Updated last year
crisbodnar / pderl
Code for "Proximal Distilled Evolutionary Reinforcement Learning", accepted at AAAI 2020
☆52Updated 11 months ago
quantumiracle / MARS
MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.
☆49Updated last year
thomashirtz / gym-hybrid
Collection of OpenAI parametrized action-space environments.
☆65Updated 3 months ago
yashbonde / Transformer-RL
Experiments to train transformer network to master reinforcement learning environments.
☆32Updated 4 years ago
zhihanyang2022 / off-policy-continuous-control
Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)
☆85Updated last year
twni2016 / Meta-SAC
Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020
☆32Updated 3 years ago
apourchot / CEM-RL
Combining Evolutionary Algorithms and deep RL in various ways
☆102Updated 4 years ago
alirezakazemipour / PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆51Updated 2 months ago
Jingliang-Duan / DSAC-v1
DSAC; Distributional Soft Actor-Critic
☆129Updated 5 months ago
alirezakazemipour / Distributional-RL
Implementation of some of the Deep Distributional Reinforcement Learning Algorithms.
☆17Updated 3 weeks ago
jqueeney / geppo
Generalized Proximal Policy Optimization with Sample Reuse (GePPO)
☆24Updated last year
xtma / dsac
Distributional Soft Actor Critic
☆58Updated 5 years ago
rmst / rlrd
PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)
☆41Updated 3 years ago
Improbable-AI / eipo
Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization
☆82Updated 2 years ago
apourchot / ERL-pytorch
Combining Evolutionary Algorithms and deep Reinforcement Learning
☆16Updated 6 years ago
lweitkamp / feudalnets-pytorch
PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.
☆41Updated 5 years ago
mit-gfx / PGMORL
[ICML 2020] Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control
☆119Updated 4 years ago