jxx123 / rl-tf2
My own implementation of Reinforcement Learning algorithms using Tensorflow 2.0
☆28Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for rl-tf2
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆91Updated 2 years ago
- TD3 in Pytorch☆27Updated 2 years ago
- RL algorithms☆139Updated 3 years ago
- Algorithm that combines QMIX with SAC for Multi-Agent Reinforcement Learning.☆39Updated 2 years ago
- PyTorch implementations of MADDPG, MAPPO (coming)☆83Updated 8 months ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆75Updated 3 years ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆59Updated 5 months ago
- Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x☆130Updated last year
- 多智能体强化学习☆83Updated 5 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆82Updated last year
- D3QN Pytorch☆51Updated 2 years ago
- my code for paper Parameterized-DQN☆20Updated 3 years ago
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆243Updated this week
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆89Updated 3 years ago
- implementation of MADDPG using PyTorch and multiagent-particle-envs☆28Updated 2 years ago
- Use Multi-Agent Deep Deterministic Policy Gradient(DDPG) algorithm to find reasonable paths for ships☆28Updated 2 years ago
- implementation of MADDPG using PettingZoo and PyTorch☆109Updated last year
- Use Multi-agent Twin Delayed Deep Deterministic Policy Gradient(TD3) algorithm to find reasonable paths for ships☆49Updated last year
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆109Updated 6 months ago
- Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space☆36Updated 2 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆73Updated 9 months ago
- 多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等☆178Updated 2 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆120Updated 5 months ago
- The code for maddpg using pytorch☆162Updated 4 years ago
- demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention…☆48Updated 3 years ago
- 多智能体强化学习VDN、QMIX、QTRAN、QPLEX复现☆25Updated last year
- multi-turtlebot3 collision avoidance and navigation via DDPG-LSTM with Prioritized Experience Replay on ROS☆65Updated 2 years ago
- A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm☆298Updated 3 years ago
- Source code of paper "A Hierarchical Reinforcement Learning Algorithm Based on Attention Mechanism for UAV Autonomous Navigation"☆10Updated 2 years ago
- ☆185Updated last year