talebolano / example_of_reinforcement_lreaning_by_pytorchLinks
一些利用pytorch编程实现的强化学习例子
☆36Updated 6 years ago
Alternatives and similar repositories for example_of_reinforcement_lreaning_by_pytorch
Users that are interested in example_of_reinforcement_lreaning_by_pytorch are comparing it to the libraries listed below
Sorting:
- RL algorithms☆142Updated 4 years ago
- My DRL library with tensorflow1.14 based on openai spinning-up☆61Updated 4 years ago
- Re-produce DQN, REINFORCE, REINFORCE with baseline, one-step AC, QAC, QAC with shared network, PPO2, DDPG, TD3, SAC, SAC discrete,A2C,A3C☆19Updated 4 years ago
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆57Updated 3 years ago
- PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....☆53Updated 5 years ago
- practice☆9Updated 4 years ago
- 强化学习中纳什Qlearning 实现矩阵博弈☆30Updated 6 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆90Updated 2 years ago
- Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow☆28Updated 5 years ago
- qmix☆22Updated 5 years ago
- Implement reinforcement learning algorithms in Pytorch☆33Updated 4 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆92Updated 4 years ago
- ☆13Updated 5 years ago
- [动手学强化学习]系列,基于pytorch。☆55Updated 4 years ago
- Reinforcement Learning Algorithms Based on PyTorch☆19Updated 3 years ago
- 天授中文文档☆58Updated 6 months ago
- A collection of multi agent environments based on OpenAI gym.☆26Updated last year
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆59Updated 5 years ago
- Nash Q Learning☆31Updated 4 years ago
- 多智能体强化学习☆99Updated 6 years ago
- 多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等☆201Updated 3 years ago
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆100Updated 3 years ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆86Updated 4 years ago
- ☆124Updated 3 years ago
- Deep Q Network for Multi-agent RL☆15Updated 4 years ago
- RLlib超参数详解(中文)☆18Updated 3 years ago
- The code for maddpg using pytorch☆169Updated 4 years ago
- ☆14Updated 4 years ago
- A novel DDPG method with prioritized experience replay (IEEE SMC 2017)☆50Updated 6 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆51Updated 4 months ago