xiaohaomao / Reinforcment-Leanring-algorithmLinks
强化学习经典算法(offline\online learning, q-learning, DQN)的实现在平衡杆游戏和几个Atari 游戏 (CartPole\Pong\Boxing\MsPacman)
☆29Updated 6 years ago
Alternatives and similar repositories for Reinforcment-Leanring-algorithm
Users that are interested in Reinforcment-Leanring-algorithm are comparing it to the libraries listed below
Sorting:
- Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).☆21Updated 4 years ago
- A novel DDPG method with prioritized experience replay (IEEE SMC 2017)☆50Updated 6 years ago
- ☆47Updated 5 years ago
- pytorch实现的一些MARL算法☆66Updated 4 years ago
- The implement of the policy gradient RL algorithm with pytorch☆38Updated 4 years ago
- Reinforcement learning algorithms implemented for Tensorflow 2.0+ [DQN, DDPG, AE-DDPG, SAC, PPO, Primal-Dual DDPG]☆310Updated 2 years ago
- OpenAI团队的深度强化学习教程中文版☆29Updated 5 years ago
- ☆124Updated 3 years ago
- Reinforcement Learning | tensorflow implementation of DQN, Dueling DQN and Double DQN performed on Atari Breakout☆94Updated 6 years ago
- ☆166Updated last year
- 这是一个学习强化学习基础原理的仓库,主要包括了《深入浅出强化学习原理入门》书中一些例子和课后作业的代码☆262Updated 6 years ago
- My internship project in 𝖢𝖠𝖲𝖨𝖠. 🤗☆4Updated 6 years ago
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 6 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆106Updated 6 years ago
- ☆33Updated 7 years ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆65Updated 7 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆52Updated 3 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆177Updated 10 months ago
- Solve BipedalWalkerHardcore-v2 with TD3☆89Updated 2 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆58Updated 5 years ago
- Code for Weighted QMIX☆136Updated 4 years ago
- [动手学强化学习]系列,基于pytorch。☆55Updated 4 years ago
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆128Updated 2 years ago
- 多智能体强化学习☆98Updated 6 years ago
- Implementation of benchmark RL algorithms☆467Updated 2 years ago
- 强化学习面试(未完待续)☆35Updated 5 years ago
- Proximal Policy Optimization with Tensorflow 2.0☆31Updated 5 years ago
- Tutorial for Reinforcement Learning☆184Updated 3 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆91Updated last year
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆28Updated 2 years ago