BlueFisher / Advanced-Soft-Actor-Critic
Soft Actor-Critic with advanced features
☆48Updated this week
Alternatives and similar repositories for Advanced-Soft-Actor-Critic:
Users that are interested in Advanced-Soft-Actor-Critic are comparing it to the libraries listed below
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆48Updated 2 years ago
- ☆74Updated 9 months ago
- PyTorch IMPALA implementation☆26Updated 5 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated 7 months ago
- Author's PyTorch implementation of LAP and PAL with TD3 and DDQN☆34Updated 3 years ago
- ☆53Updated last year
- Random Network Distillation(RND) algo in Pytorch☆49Updated 6 years ago
- Pytorch implementation of distributed deep reinforcement learning☆75Updated 2 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆102Updated 4 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆55Updated last week
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- Deep RL agents with PyTorch☆35Updated 3 years ago
- This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.☆60Updated 5 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆94Updated 4 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆50Updated 3 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆65Updated 5 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 7 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆175Updated 2 years ago
- Learning Individual Intrinsic Reward in MARL☆62Updated 2 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- Distributional Soft Actor Critic☆52Updated 4 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆83Updated last year
- PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning☆66Updated 5 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆26Updated 5 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated 2 years ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆65Updated last year
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- ☆97Updated last year
- PyTorch implementation of Soft Actor-Critic(SAC).☆103Updated 4 years ago