zhengsizuo / DRL_udacity
A explaintable and modified version of udacity DRL homework
☆26Updated 4 years ago
Alternatives and similar repositories for DRL_udacity:
Users that are interested in DRL_udacity are comparing it to the libraries listed below
- ☆159Updated last year
- rl-papers☆48Updated last year
- Learning Resources And Links Of Reinforcement Learning (updating)☆245Updated 3 years ago
- 天授中文文档☆55Updated 2 months ago
- RLlib超参数详解(中文)☆16Updated 3 years ago
- 《深度强化学习:原理与实践》,Code of the book <Deep Reinforcement Learning: Principles and Practices>☆167Updated 5 years ago
- [动手学强化学习]系列,基于pytorch。☆54Updated 3 years ago
- ☆41Updated 2 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆62Updated 6 years ago
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 6 years ago
- Hello😜☆31Updated 4 years ago
- 强化学习-中文笔记&资源-以python实例为主-由浅入深☆93Updated 4 years ago
- ☆122Updated 3 years ago
- Reinforcement Learning algorithms and use-cases, including DQN, PG, A3C, PPO etc. and RLHF, AlphaZero implementations. Designed for clari…☆26Updated 8 months ago
- RL-code for beginners. Enjoying!☆113Updated 4 years ago
- ☆20Updated 6 years ago
- A pack of reinforcement learning algorithms.☆82Updated 3 years ago
- ☆45Updated 5 years ago
- My internship project in 𝖢𝖠𝖲𝖨𝖠. 🤗☆3Updated 6 years ago
- Reinforcement Learning Algorithms Based on PyTorch☆18Updated 2 years ago
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆56Updated 3 years ago
- ☆38Updated 2 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆84Updated last year
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆55Updated last year
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆128Updated last year
- DQN examples codes in chapter 4☆43Updated last year
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆26Updated 2 years ago
- DSAC; Distributional Soft Actor-Critic☆123Updated last week
- My DRL library with tensorflow1.14 based on openai spinning-up☆61Updated 3 years ago
- RL algorithms☆140Updated 3 years ago