shakti365 / soft-actor-critic
TF2 Implementation of the Soft Actor-Critic Algorithm
☆44Updated last year
Related projects: ⓘ
- Proximal policy optimization in PyTorch. Easy to read and understand.☆48Updated 3 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆92Updated 4 years ago
- Implementation of Soft Actor Critic☆37Updated 3 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆92Updated 2 years ago
- Soft Actor-Critic with advanced features☆47Updated 3 weeks ago
- Pytorch implementation of Soft Actor-Critic☆18Updated 4 years ago
- PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning☆66Updated 4 years ago
- ☆91Updated 3 years ago
- ☆80Updated 5 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated last month
- Pytorch implementation of distributed deep reinforcement learning☆72Updated 2 years ago
- Deep Recurrent Attention Reinforcement Learning in Atari☆82Updated 6 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆128Updated 5 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆131Updated last year
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆77Updated last year
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆92Updated 5 years ago
- PyTorch implementation of our paper Real-Time Reinforcement Learning (NeurIPS 2019)☆72Updated 4 years ago
- Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.☆88Updated 5 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆98Updated 3 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆101Updated 5 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆158Updated last month
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆40Updated 3 years ago
- Implementation of Linear Inverse Reinforcement Learning Algorithm (IRL) on Mountain Car Environment.☆29Updated 4 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆28Updated 5 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 7 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆53Updated last year
- research and implementations of Deep RL agents and their applications☆46Updated 2 weeks ago
- ☆69Updated 3 months ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆60Updated 3 years ago
- PyTorch implementation of QR-DQN: Distributional Reinforcement Learning with Quantile Regression☆25Updated 4 years ago