BY571 / D4PG
PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2RL which can be added to D4PG to improve its performance.
☆18Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for D4PG
- Official implementation of the algorithmic approach presented in the research paper entitled "Risk-Sensitive Policy with Distributional R…☆15Updated last year
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆42Updated 4 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆37Updated 3 years ago
- Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments☆26Updated 4 years ago
- Prioritized Sequence Experience Replay☆10Updated 3 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆132Updated 3 months ago
- DecentralizedLearning☆21Updated last year
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆36Updated 3 weeks ago
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆39Updated 2 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆51Updated 5 months ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- Experiments to train transformer network to master reinforcement learning environments.☆33Updated 3 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆50Updated 3 years ago
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆20Updated last year
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆26Updated 2 years ago
- Experiments with transformer based RL algorithms☆22Updated 4 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆79Updated last year
- ☆21Updated 7 months ago
- This repository has code for the paper "Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algor…☆25Updated last year
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆62Updated last year
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…