MoMe36 / BranchingDQN
BranchingDQN
☆50Updated 6 years ago
Alternatives and similar repositories for BranchingDQN:
Users that are interested in BranchingDQN are comparing it to the libraries listed below
- ☆85Updated 3 years ago
- (AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning☆116Updated 2 years ago
- Code for Dynamic Weights in Multi-Objective Deep Reinforcement Learning☆94Updated 2 years ago
- ☆75Updated 5 years ago
- Implementation of the paper Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation - https:/…☆83Updated 7 years ago
- ☆40Updated 3 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆51Updated 3 years ago
- Code for Weighted QMIX☆136Updated 4 years ago
- PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, H…☆71Updated 5 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆102Updated 2 years ago
- There will be updates later☆84Updated 5 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆34Updated 3 years ago
- Paper list for constrained policy optimization in reinforcement learning.☆72Updated last year
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆21Updated 5 months ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆59Updated 5 years ago
- Multi-Objective Deep Reinforcement Learning☆45Updated 8 years ago
- ☆47Updated 5 years ago
- pytorch implementation of "Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control"☆53Updated 2 years ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆63Updated 5 years ago
- PyTorch implementation of Constrained Policy Optimization☆54Updated 3 years ago
- ☆96Updated 3 years ago
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆153Updated 2 years ago
- Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatib…☆66Updated 2 years ago
- A novel DDPG method with prioritized experience replay (IEEE SMC 2017)☆50Updated 6 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆92Updated 4 years ago
- ☆93Updated 4 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated 2 years ago
- Multi-Objective Reinforcement Learning☆271Updated 3 years ago
- Collection of OpenAI parametrized action-space environments.☆64Updated last month
- A clean and robust Pytorch implementation of SAC on discrete action space☆35Updated 6 months ago