zhengsizuo / DRL_udacity
A explaintable and modified version of udacity DRL homework
☆26Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for DRL_udacity
- ☆158Updated last year
- RLlib超参数详解(中文)☆14Updated 2 years ago
- ☆38Updated 2 years ago
- DQN examples codes in chapter 4☆41Updated last year
- rl-papers☆44Updated last year
- ☆41Updated 2 years ago
- 这是一个关于基于模型的强化学习的资料,包括一些代码地址、paper、slide等。☆39Updated 4 years ago
- ☆26Updated 4 years ago
- [动手学强化学习]系列,基于pytorch。☆54Updated 3 years ago
- ☆42Updated 6 months ago
- simple code to reinforcement learning☆19Updated 4 years ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆54Updated 10 months ago
- ☆45Updated 5 years ago
- ☆121Updated 3 years ago
- Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow☆27Updated 5 years ago
- Hello😜☆30Updated 4 years ago
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆55Updated 4 years ago
- 天授中文文档☆55Updated 2 years ago
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆53Updated 3 years ago
- Pytorch implementation of Multi-Agent Generative Adversarial Imitation Learning☆37Updated 2 years ago
- My DRL library with tensorflow1.14 based on openai spinning-up☆59Updated 3 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆92Updated 3 years ago
- Code for the paper “Control Strategy of Speed Servo Systems Based on Deep Reinforcement Learning”☆22Updated last year
- Implement many Sparse Reward algorithms in Gym Fetch environment☆82Updated 4 years ago
- A pack of reinforcement learning algorithms.☆81Updated 3 years ago
- My internship project in 𝖢𝖠𝖲𝖨𝖠. 🤗☆2Updated 5 years ago
- PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....☆52Updated 4 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆26Updated 2 years ago
- Decision Transformer: A brand new Offline RL Pattern.☆34Updated 2 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆82Updated last year