talebolano / example_of_reinforcement_lreaning_by_pytorch
一些利用pytorch编程实现的强化学习例子
☆36Updated 6 years ago
Alternatives and similar repositories for example_of_reinforcement_lreaning_by_pytorch:
Users that are interested in example_of_reinforcement_lreaning_by_pytorch are comparing it to the libraries listed below
- [动手学强化学习]系列,基于pytorch。☆54Updated 3 years ago
- My DRL library with tensorflow1.14 based on openai spinning-up☆61Updated 4 years ago
- PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....☆52Updated 5 years ago
- RL algorithms☆141Updated 4 years ago
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆57Updated 3 years ago
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆99Updated 2 years ago
- 多智能体强化学习☆92Updated 6 years ago
- 强化学习中纳什Qlearning 实现矩阵博弈☆30Updated 6 years ago
- Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow☆27Updated 5 years ago
- My own implementation of Reinforcement Learning algorithms using Tensorflow 2.0☆29Updated 3 years ago
- 天授中文文档☆58Updated 4 months ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆50Updated 2 months ago
- A collection of multi agent environments based on OpenAI gym.☆23Updated last year
- simple code to reinforcement learning☆19Updated 4 years ago
- Re-produce DQN, REINFORCE, REINFORCE with baseline, one-step AC, QAC, QAC with shared network, PPO2, DDPG, TD3, SAC, SAC discrete,A2C,A3C☆19Updated 4 years ago
- 多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等☆196Updated 2 years ago
- The code for maddpg using pytorch☆168Updated 4 years ago
- A novel DDPG method with prioritized experience replay (IEEE SMC 2017)☆50Updated 6 years ago
- Tutorial for Reinforcement Learning☆185Updated 3 years ago
- Implement reinforcement learning algorithms in Pytorch☆33Updated 3 years ago
- ☆164Updated last year
- Solve BipedalWalkerHardcore-v2 with TD3☆88Updated last year
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆59Updated 4 years ago
- practice☆9Updated 4 years ago
- Reinforcement learning algorithms implemented for Tensorflow 2.0+ [DQN, DDPG, AE-DDPG, SAC, PPO, Primal-Dual DDPG]☆308Updated 2 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆92Updated 4 years ago
- Hello😜☆31Updated 4 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆143Updated 10 months ago
- ☆45Updated 5 years ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆81Updated 4 years ago