zhengsizuo / DRL_udacity
A explaintable and modified version of udacity DRL homework
☆26Updated 4 years ago
Alternatives and similar repositories for DRL_udacity:
Users that are interested in DRL_udacity are comparing it to the libraries listed below
- ☆162Updated last year
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 6 years ago
- [动手学强化学习]系列,基于pytorch。☆54Updated 3 years ago
- ☆38Updated 2 years ago
- rl-papers☆48Updated 2 years ago
- ☆122Updated 3 years ago
- RLlib超参数详解(中文)☆16Updated 3 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆62Updated 6 years ago
- A pack of reinforcement learning algorithms.☆83Updated 3 years ago
- Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow☆28Updated 5 years ago
- ☆20Updated 7 years ago
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆57Updated 3 years ago
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆59Updated 4 years ago
- Hello😜☆31Updated 4 years ago
- Paper list for constrained policy optimization in reinforcement learning.☆71Updated last year
- simple code to reinforcement learning☆19Updated 4 years ago
- ☆42Updated 2 years ago
- My DRL library with tensorflow1.14 based on openai spinning-up☆61Updated 4 years ago
- 强化学习-中文笔记&资源-以python实例为主-由浅入深☆97Updated 4 years ago
- ☆16Updated 3 years ago
- ☆26Updated 4 years ago
- 天授中文文档☆56Updated 3 months ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆26Updated 2 years ago
- ☆42Updated last week
- Reinforcement Learning algorithms and use-cases, including DQN, PG, A3C, PPO etc. and RLHF, AlphaZero implementations. Designed for clari…☆28Updated 9 months ago
- 这是一个关于基于模型的强化学习的资料,包括一些代码地址、paper、slide等。☆41Updated 4 years ago
- DQN examples codes in chapter 4☆43Updated last year
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆55Updated last year
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆129Updated last year
- 此项目中将上传我在B站《强化学习理论基础》系列视频中的板书、参考资料等内容。☆76Updated 2 years ago