zhengsizuo / DRL_udacity
A explaintable and modified version of udacity DRL homework
☆26Updated 4 years ago
Alternatives and similar repositories for DRL_udacity:
Users that are interested in DRL_udacity are comparing it to the libraries listed below
- ☆159Updated last year
- ☆38Updated 2 years ago
- rl-papers☆47Updated last year
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 6 years ago
- [动手学强化学习]系列,基于pytorch。☆54Updated 3 years ago
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆56Updated 3 years ago
- ☆41Updated last month
- A pack of reinforcement learning algorithms.☆82Updated 3 years ago
- reinforcement learning☆45Updated 4 years ago
- Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow☆28Updated 5 years ago
- Paper list for constrained policy optimization in reinforcement learning.☆69Updated last year
- ☆41Updated 2 years ago
- 多智能体即时策略对抗方法与实践 苏炯铭 刘鸿福 陈少飞 项凤涛 编著 科学出版社 2019.11 随书代码☆32Updated 4 years ago
- DQN examples codes in chapter 4☆42Updated last year
- ☆122Updated 3 years ago
- RL algorithms☆140Updated 3 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆62Updated 6 years ago
- RLlib超参数详解(中文)☆16Updated 2 years ago
- 强化学习-中文笔记&资源-以python实例为主-由浅入深☆92Updated 4 years ago
- 天授中文文档☆55Updated last month
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆55Updated last year
- Solve BipedalWalkerHardcore-v2 with TD3☆83Updated last year
- simple code to reinforcement learning☆19Updated 4 years ago
- ☆76Updated 3 years ago
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆55Updated 4 years ago
- 这是一个关于基于模型的强化学习的资料,包括一些代码地址、paper、slide等。☆40Updated 4 years ago
- My DRL library with tensorflow1.14 based on openai spinning-up☆61Updated 3 years ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆162Updated last year
- Reinforcement Learning algorithms and use-cases, including DQN, PG, A3C, PPO etc. and RLHF, AlphaZero implementations. Designed for clari…☆26Updated 7 months ago
- ☆90Updated 2 years ago