jw1401 / PPO-Tensorflow-2.0
Proximal Policy Optimization with Tensorflow 2.0
☆30Updated 4 years ago
Related projects: ⓘ
- ☆70Updated 4 years ago
- The implement of all kinds of dqn reinforcement learning with Pytorch☆86Updated 3 years ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆63Updated 5 years ago
- Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.☆98Updated 3 years ago
- MADDPG in Ray/RLlib☆50Updated 4 years ago
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆110Updated 2 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆62Updated 6 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆91Updated 2 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆92Updated 4 years ago
- 这是一个关于基于模型的强化学习的资料,包括一些代码地址、paper、slide等。☆34Updated 4 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆27Updated 3 years ago
- ☆39Updated 2 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆52Updated 4 years ago
- There will be updates later☆79Updated 5 years ago
- PyTorch implementation of Deep Reinforcement Algorithm☆30Updated 2 years ago
- scalable multi agents reinforcement learning☆53Updated 6 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆128Updated 5 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆44Updated 3 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆25Updated 4 years ago
- an implementation of CommNet☆29Updated 6 years ago
- Code for Weighted QMIX☆119Updated 3 years ago
- ☆41Updated 5 years ago
- A collection of offline reinforcement learning algorithms.☆153Updated 3 months ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆87Updated 3 years ago
- Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning☆54Updated 2 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆115Updated 3 months ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆101Updated 5 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated last month
- use tensorflow to implement the MADDPG(simple_tag)☆17Updated 6 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆98Updated 3 years ago