BY571 / D4PG
PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2RL which can be added to D4PG to improve its performance.
☆22Updated 3 years ago
Alternatives and similar repositories for D4PG:
Users that are interested in D4PG are comparing it to the libraries listed below
- Official implementation of the algorithmic approach presented in the research paper entitled "Risk-Sensitive Policy with Distributional R…☆15Updated 2 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆83Updated last year
- DecentralizedLearning☆24Updated 2 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated 8 months ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆26Updated 2 years ago
- A collection of environments and reference agents for planning and reinforcement learning research in partially observable, multi-agent …☆20Updated 4 months ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆38Updated 4 years ago
- Evolution-based Soft Actor-Critic (ESAC)☆41Updated 8 months ago
- using recurrent networks(LSTM) to solve POMDPs☆35Updated 6 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆39Updated 4 years ago
- Collection of OpenAI parametrized action-space environments.☆64Updated last week
- Codes for Paper "Delay-Aware Model-Based Reinforcement Learning for Continuous Control".☆26Updated 5 years ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆71Updated last year
- Distributional Soft Actor Critic☆52Updated 4 years ago
- Prioritized Sequence Experience Replay☆10Updated 3 years ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆66Updated 2 years ago
- Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments☆27Updated 4 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆40Updated 6 years ago
- Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.☆26Updated 8 months ago
- Decision Transformer: A brand new Offline RL Pattern.☆35Updated 3 years ago
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆87Updated 2 years ago
- Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…☆43Updated 5 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆60Updated 9 months ago
- Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning☆56Updated 2 years ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Updated 4 years ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆19Updated 2 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆60Updated last year
- The implementation of LSTM-TD3.☆77Updated 2 years ago
- ☆20Updated 9 months ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆34Updated 3 years ago