sunnyswag / RL_notes_and_codesLinks
学习强化学习过程中的笔记和代码
☆10Updated 4 years ago
Alternatives and similar repositories for RL_notes_and_codes
Users that are interested in RL_notes_and_codes are comparing it to the libraries listed below
Sorting:
- ☆30Updated 2 years ago
- MADDPG in Ray/RLlib☆54Updated 5 years ago
- Simple Reinforcement learning tutorials☆15Updated 5 years ago
- Decision Transformer: A brand new Offline RL Pattern.☆37Updated 3 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆29Updated 6 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆28Updated 3 years ago
- The implement of the policy gradient RL algorithm with pytorch☆38Updated 4 years ago
- PyTorch Implementation of Ape-X (Distributed prioritized experience replay) architecture with DQN learner☆27Updated 4 years ago
- Reinforcement Learning Algorithms Based on PyTorch☆19Updated 3 years ago
- ☆76Updated 5 years ago
- ☆13Updated 5 years ago
- ☆41Updated 5 years ago
- DQN examples codes in chapter 4☆43Updated 2 years ago
- ☆8Updated 3 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆141Updated 6 years ago
- PyTorch implementation of QR-DQN: Distributional Reinforcement Learning with Quantile Regression☆28Updated 4 years ago
- ☆33Updated 7 years ago
- ☆39Updated 2 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆55Updated 2 years ago
- The implement of GAIL with pytorch☆14Updated 5 years ago
- ☆166Updated last year
- Re-produce DQN, REINFORCE, REINFORCE with baseline, one-step AC, QAC, QAC with shared network, PPO2, DDPG, TD3, SAC, SAC discrete,A2C,A3C☆19Updated 4 years ago
- Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow☆28Updated 5 years ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆64Updated 5 years ago
- Deep Reinforcement Learning for Nash Equilibria☆42Updated 2 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆61Updated 4 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆41Updated 5 years ago
- Proximal Policy Optimization with Tensorflow 2.0☆31Updated 5 years ago
- Paper list for constrained policy optimization in reinforcement learning.☆72Updated last year
- Hierarchical Attention in Reinforcement Learning for Stock Order Executions☆30Updated 4 years ago