BY571 / D4PGLinks
PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2RL which can be added to D4PG to improve its performance.
☆24Updated 4 years ago
Alternatives and similar repositories for D4PG
Users that are interested in D4PG are comparing it to the libraries listed below
Sorting:
- Evolution-based Soft Actor-Critic (ESAC)☆42Updated last year
- Official implementation of the algorithmic approach presented in the research paper entitled "Risk-Sensitive Policy with Distributional R…☆15Updated 2 years ago
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆88Updated 2 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆135Updated last month
- Collection of OpenAI parametrized action-space environments.☆66Updated 7 months ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆39Updated 4 years ago
- Code for "Proximal Distilled Evolutionary Reinforcement Learning", accepted at AAAI 2020☆55Updated last year
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆89Updated last year
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆48Updated last year
- DecentralizedLearning☆25Updated 2 years ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Updated 4 years ago
- Experiments with reinforcement learning and recurrent neural networks☆115Updated 2 years ago
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆146Updated 3 years ago
- ☆26Updated last year
- Combining Evolutionary Algorithms and deep RL in various ways☆105Updated 4 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Updated 3 years ago
- Soft Actor-Critic with advanced features☆51Updated last week
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆66Updated 2 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆52Updated 5 months ago
- Experiments with transformer based RL algorithms☆22Updated 5 years ago
- Decision Transformer: A brand new Offline RL Pattern.