libenfan / DQN_pong
use DQN(pytorch) to play pong
☆10Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for DQN_pong
- The code for maddpg using pytorch☆162Updated 4 years ago
- RL algorithms☆139Updated 3 years ago
- A Collection of Multi-Agent Reinforcement Learning (MARL) Resources☆208Updated 2 years ago
- ☆158Updated last year
- 多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等☆179Updated 2 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆82Updated last year
- The state-of-the-art in multi-agent Reinforcement Learning is the MADDPG algorithm which utilises DDPG actor-critic neural networks where…☆23Updated 4 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆90Updated 3 years ago
- 多智能体强化学习☆83Updated 5 years ago
- demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention…☆49Updated 3 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆109Updated 6 months ago
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆53Updated 3 years ago
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆111Updated last year
- implementation of MADDPG using PyTorch and multiagent-particle-envs☆28Updated 2 years ago
- D3QN Pytorch☆52Updated 2 years ago
- pytorch实现的一些MARL算法☆64Updated 3 years ago
- PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.☆411Updated last year
- Tutorial for Reinforcement Learning☆172Updated 2 years ago
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆91Updated 2 years ago
- Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x☆130Updated last year
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆243Updated this week
- ☆185Updated last year
- A plotter for reinforcement learning (RL)☆207Updated 2 years ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆55Updated 2 years ago
- Reinforcement learning☆20Updated this week
- Deep recurrent Q learning on CartPole-v1 environment☆73Updated 9 months ago
- Lightweight version of MAPPO to help you quickly migrate to your local environment.☆498Updated last month
- ☆55Updated 10 months ago
- Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"☆191Updated 5 years ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆129Updated 10 months ago