ChangyWen / wolpertinger_ddpg
Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatible.
☆66Updated 2 years ago
Alternatives and similar repositories for wolpertinger_ddpg:
Users that are interested in wolpertinger_ddpg are comparing it to the libraries listed below
- PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, H…☆69Updated 5 years ago
- Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym☆175Updated 6 years ago
- ☆73Updated 5 years ago
- ☆83Updated 3 years ago
- Code for Dynamic Weights in Multi-Objective Deep Reinforcement Learning☆91Updated last year
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated 2 years ago
- Implementation of the paper Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation - https:/…☆83Updated 7 years ago
- The code for maddpg using pytorch☆165Updated 4 years ago
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆116Updated 3 months ago
- Code for Weighted QMIX☆129Updated 4 years ago
- ☆120Updated 2 years ago
- (AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning☆112Updated 2 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆99Updated 2 years ago
- Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"☆202Updated 5 years ago
- Multi-Objective Reinforcement Learning☆261Updated 3 years ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆64Updated 5 years ago
- Solving POMDP using Recurrent networks☆85Updated 4 years ago
- BranchingDQN☆49Updated 6 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆40Updated 2 years ago
- There will be updates later☆84Updated 5 years ago
- DGN Code☆342Updated last year
- pytorch implementation of "Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control"☆51Updated 2 years ago
- The pytorch implementation of DGN on grid world and Starcraft☆137Updated 3 years ago
- ☆41Updated 5 years ago
- MADDPG in Ray/RLlib☆52Updated 5 years ago
- Learning Individual Intrinsic Reward in MARL☆63Updated 2 years ago
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆115Updated 2 years ago
- ☆91Updated 4 years ago
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆149Updated last year
- Code for paper 'Learning transferable cooperative behaviors in multi-agent teams' (ICML 2019)☆108Updated 2 years ago