alirezakazemipour / Distributional-RLLinks
Implementation of some of the Deep Distributional Reinforcement Learning Algorithms.
☆18Updated last month
Alternatives and similar repositories for Distributional-RL
Users that are interested in Distributional-RL are comparing it to the libraries listed below
Sorting:
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆41Updated 5 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Updated 3 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆36Updated 3 years ago
- An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games☆29Updated 2 years ago
- Prioritized Experience Replay implementation with proportional prioritization☆81Updated 2 years ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆20Updated 2 years ago
- Collection of OpenAI parametrized action-space environments.☆65Updated 4 months ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆165Updated last year
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆37Updated 5 months ago
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆25Updated 2 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Updated 2 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆39Updated 2 years ago
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆23Updated 8 months ago
- Multi-Agent Deep Reinforcement Learning by using Asynchronous & Impala Proximal Policy Optimization in Pytorch with some explanation☆35Updated 4 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆23Updated 4 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆87Updated last year
- DecentralizedLearning☆24Updated 2 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆134Updated 3 weeks ago
- Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"☆14Updated 4 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆105Updated 3 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆183Updated last year
- ☆49Updated 4 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆51Updated 2 months ago
- Solving POMDP using Recurrent networks☆90Updated 5 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆53Updated 4 years ago
- Soft Actor-Critic with advanced features☆50Updated last week
- A collection of Meta-Reinforcement Learning algorithms in PyTorch☆48Updated last year
- Revisiting Discrete Gradient Estimation in MADDPG☆25Updated 2 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆143Updated 6 years ago
- MADDPG in Ray/RLlib☆54Updated 5 years ago