coach00 / Reinforcement-LearningLinks
coach的rl练习,主要有q_learning,sarsa,REINFORCE,DQN,Actor-Critic,DDPG,使用tensorflow2.0编写
☆9Updated 5 years ago
Alternatives and similar repositories for Reinforcement-Learning
Users that are interested in Reinforcement-Learning are comparing it to the libraries listed below
Sorting:
- 强化学习面试(未完待续)☆35Updated 5 years ago
- Deep Reinforcement Learning framework that uses GNN to solve planning tasks for infrastructural assets☆16Updated 3 years ago
- an implementation of ATOC☆14Updated 3 years ago
- Multi-agent Reinforcement Learning Algorithms(COMA, VDN, QMIX)☆15Updated 5 years ago
- A code reimplementation of DeepMind's "Multiagent Cooperation and Competition with Deep Reinforcement Learning" with Tensorflow☆15Updated 7 years ago
- my code for paper Parameterized-DQN☆22Updated 4 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆91Updated last year
- Deep Q Network for Multi-agent RL☆15Updated 4 years ago
- ☆14Updated 4 years ago
- Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents☆8Updated 3 years ago
- RL projects including implementation of DQN/DDPG/MADDPG/BicNet on StarCraft II multi-agent learning environment SMAC☆48Updated 5 years ago
- IQL, QMIX, VDN, COMA, QTRAN (QTRAN-Base and QTRAN-Alt), MAVEN, CommNet, DYMA-Cl, G2ANet, and MADDPG☆18Updated 3 years ago
- A novel DDPG method with prioritized experience replay (IEEE SMC 2017)☆50Updated 6 years ago
- Codes for Paper "Delay-Aware Multi-Agent Reinforcement Learning".☆53Updated 4 years ago
- Reinforcement Learning Algorithms Based on PyTorch☆19Updated 3 years ago
- Implementation of Pareto Deep Q Networks in a multi-objective Gym Reinforcement Learning Environment☆16Updated 2 years ago
- Implementation of DyMA-CL, MARL algorithm☆27Updated 5 years ago
- My DRL library with tensorflow1.14 based on openai spinning-up☆61Updated 4 years ago
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆59Updated 5 years ago
- The state-of-the-art in multi-agent Reinforcement Learning is the MADDPG algorithm which utilises DDPG actor-critic neural networks where…☆27Updated 5 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆35Updated 3 years ago
- 用DDPG/MADDPG/DQN/MADDPG+advantage实验 OpenAI开源的MPE环境☆24Updated 7 years ago
- ☆85Updated 3 years ago
- Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).☆19Updated 7 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆51Updated 4 months ago
- Re-produce DQN, REINFORCE, REINFORCE with baseline, one-step AC, QAC, QAC with shared network, PPO2, DDPG, TD3, SAC, SAC discrete,A2C,A3C☆19Updated 4 years ago
- 在PyTorch上重构multi-agent deep deterministic policy gradient(MADDPG),将https://github.com/xuemei-ye/maddpg-mpe 修改到自己电脑上可运行。因为本人笔记本没有CUDA,实验速度…☆13Updated 6 years ago
- 🎾 Multi-Agent Proximal Policy Optimization approach to a competitive reinforcement learning problem☆22Updated 2 years ago
- ☆29Updated 6 years ago
- ☆40Updated 3 years ago