deep-reinforcement-learning-book / Chapter4-DQNLinks
DQN examples codes in chapter 4
☆44Updated 2 years ago
Alternatives and similar repositories for Chapter4-DQN
Users that are interested in Chapter4-DQN are comparing it to the libraries listed below
Sorting:
- [动手学强化学习]系列,基于pytorch。☆59Updated 4 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆96Updated 4 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆94Updated 2 years ago
- ☆128Updated 4 years ago
- Decision Transformer: A brand new Offline RL Pattern.☆37Updated 3 years ago
- Hierarchical-DQN in pytorch (not actively maintained)☆73Updated 8 years ago
- ☆27Updated 5 years ago
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆58Updated 4 years ago
- ☆39Updated 3 years ago
- ☆174Updated 2 years ago
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆58Updated 5 years ago
- Transformer in RL for decision-making☆103Updated 2 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆55Updated 4 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆169Updated last year
- Collection of OpenAI parametrized action-space environments.☆68Updated 10 months ago
- simple code to reinforcement learning☆21Updated 5 years ago
- ☆42Updated 6 years ago
- DSAC; Distributional Soft Actor-Critic☆136Updated 11 months ago
- Pytorch implementation of Multi-Agent Generative Adversarial Imitation Learning☆43Updated 3 years ago
- Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow☆29Updated 6 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Updated 3 years ago
- Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.☆29Updated 8 months ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆64Updated 2 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆218Updated last year
- Implementation of Hierarchical Deep Q-Learning (Kulkarni et al., 2016)☆35Updated 6 years ago
- A novel preference-driven multi-objective reinforcement learning algorithm using a single policy network that covers the entire preferenc…☆39Updated 2 years ago
- Paper list for constrained policy optimization in reinforcement learning.☆73Updated 2 years ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆171Updated last year
- ☆43Updated 3 years ago
- A clean and robust Pytorch implementation of PPO on Discrete action space☆72Updated last year