BY571 / D4PG
PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2RL which can be added to D4PG to improve its performance.
☆22Updated 3 years ago
Alternatives and similar repositories for D4PG:
Users that are interested in D4PG are comparing it to the libraries listed below
- Official implementation of the algorithmic approach presented in the research paper entitled "Risk-Sensitive Policy with Distributional R…☆15Updated 2 years ago
- DecentralizedLearning☆24Updated 2 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆83Updated last year
- Distributional Soft Actor Critic☆52Updated 4 years ago
- Evolution-based Soft Actor-Critic (ESAC)☆41Updated 8 months ago
- ☆9Updated 5 years ago
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆21Updated last year
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated 8 months ago
- Prioritized Sequence Experience Replay☆10Updated 3 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆39Updated 4 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆48Updated 2 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆26Updated 2 years ago
- Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning☆56Updated 2 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆45Updated 4 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆60Updated 9 months ago
- Codes for Paper "Delay-Aware Model-Based Reinforcement Learning for Continuous Control".☆26Updated 5 years ago
- using recurrent networks(LSTM) to solve POMDPs☆35Updated 6 years ago
- Collection of OpenAI parametrized action-space environments.☆64Updated 2 weeks ago
- PyTorch IMPALA implementation☆26Updated 5 years ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆66Updated 2 years ago
- Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments☆27Updated 4 years ago
- Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…☆43Updated 5 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆60Updated last year
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆53Updated 4 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆55Updated 3 weeks ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Updated 4 years ago
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆30Updated 3 years ago
- Implementation of Off Policy Adversarial Inverse Reinforcement Learning☆22Updated 4 years ago