BY571 / D4PGLinks
PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2RL which can be added to D4PG to improve its performance.
☆22Updated 4 years ago
Alternatives and similar repositories for D4PG
Users that are interested in D4PG are comparing it to the libraries listed below
Sorting:
- Official implementation of the algorithmic approach presented in the research paper entitled "Risk-Sensitive Policy with Distributional R…☆15Updated 2 years ago
- Prioritized Sequence Experience Replay☆10Updated 3 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆85Updated last year
- An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games☆29Updated 2 years ago
- DecentralizedLearning☆24Updated 2 years ago
- Evolution-based Soft Actor-Critic (ESAC)☆42Updated 10 months ago
- Distributional Soft Actor Critic☆53Updated 5 years ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆49Updated last year
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆46Updated 4 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆28Updated 2 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated 10 months ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆49Updated 3 weeks ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆20Updated 2 years ago
- Codes for Paper "Delay-Aware Model-Based Reinforcement Learning for Continuous Control".☆26Updated 5 years ago
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆41Updated 3 years ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆66Updated 2 years ago
- Collection of OpenAI parametrized action-space environments.☆65Updated 2 months ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Updated 4 years ago
- Implementation of some of the Deep Distributional Reinforcement Learning Algorithms.☆14Updated 3 years ago
- ☆28Updated 3 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆27Updated 5 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆38Updated 4 years ago
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆24Updated last year
- using recurrent networks(LSTM) to solve POMDPs☆35Updated 6 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago
- ☆74Updated last year
- Negative Update Intervals in Multi-Agent Deep Reinforcement Learning☆33Updated 6 years ago
- Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…☆44Updated 6 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆51Updated 4 years ago