jxx123 / rl-tf2Links
My own implementation of Reinforcement Learning algorithms using Tensorflow 2.0
☆30Updated 3 years ago
Alternatives and similar repositories for rl-tf2
Users that are interested in rl-tf2 are comparing it to the libraries listed below
Sorting:
- A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm☆354Updated 4 years ago
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆104Updated 3 years ago
- 多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等☆204Updated 3 years ago
- 多智能体强化学习☆100Updated 6 years ago
- Tutorial for Reinforcement Learning☆187Updated 3 years ago
- TD3 in Pytorch☆34Updated 3 years ago
- RL algorithms☆142Updated 4 years ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆83Updated 4 months ago
- implementation of MADDPG using PettingZoo and PyTorch☆154Updated last year
- Solve BipedalWalkerHardcore-v2 with TD3☆91Updated 2 years ago
- PyTorch implementations of MADDPG, MAPPO (coming)☆167Updated last year
- PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.☆492Updated 2 years ago
- D3QN Pytorch☆62Updated 3 years ago
- Deep Q-learning (DQN) for Multi-agent Reinforcement Learning (RL)☆348Updated 5 years ago
- Concise pytorch implements of MARL algorithms, including MAPPO, MADDPG, MATD3, QMIX and VDN.☆625Updated 2 years ago
- DRLib:a Concise Deep Reinforcement Learning Library, Integrating HER, PER and D2SR for Almost Off-Policy RL Algorithms.☆553Updated last year
- The code for maddpg using pytorch☆170Updated 4 years ago
- A Collection of Multi-Agent Reinforcement Learning (MARL) Resources☆247Updated 2 years ago
- Pytorch implementation of the MARL algorithm, MADDPG, which correspondings to the paper "Multi-Agent Actor-Critic for Mixed Cooperative-C…☆642Updated 3 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆161Updated last year
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)