zhengsizuo / DRL_udacity
A explaintable and modified version of udacity DRL homework
☆26Updated 4 years ago
Alternatives and similar repositories for DRL_udacity
Users that are interested in DRL_udacity are comparing it to the libraries listed below
Sorting:
- ☆165Updated last year
- Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow☆28Updated 5 years ago
- RLlib超参数详解(中文)☆18Updated 3 years ago
- ☆27Updated 4 years ago
- rl-papers☆47Updated 2 years ago
- 天授中文文档☆58Updated 5 months ago
- ☆38Updated 2 years ago
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 6 years ago
- 这是一个关于基于模型的强化学习的资料,包括一些代码地址、paper、slide等。☆42Updated 4 years ago
- [动手学强化学习]系列,基于pytorch。☆54Updated 3 years ago
- DQN examples codes in chapter 4☆43Updated 2 years ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆55Updated last year
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆92Updated 4 years ago
- ☆124Updated 3 years ago
- ☆42Updated 2 years ago
- Hello😜☆31Updated 4 years ago
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆57Updated 3 years ago
- My DRL library with tensorflow1.14 based on openai spinning-up☆61Updated 4 years ago
- Paper list for constrained policy optimization in reinforcement learning.☆72Updated last year
- simple code to reinforcement learning☆20Updated 4 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆28Updated 2 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆88Updated last year
- Code for the paper “Control Strategy of Speed Servo Systems Based on Deep Reinforcement Learning”☆24Updated 2 years ago
- 多智能体即时策略对抗方法与实践 苏炯铭 刘鸿福 陈少飞 项凤涛 编著 科学出版社 2019.11 随书代码☆31Updated 4 years ago
- 强化学习-中文笔记&资源-以python实例为主-由浅入深☆102Updated 4 years ago
- Decision Transformer: A brand new Offline RL Pattern.☆36Updated 3 years ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆65Updated 3 years ago
- A pack of reinforcement learning algorithms.☆84Updated 3 years ago
- PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....☆53Updated 5 years ago
- A collection of offline reinforcement learning algorithms.☆181Updated 5 months ago