karush17 / esacLinks
Evolution-based Soft Actor-Critic (ESAC)
☆42Updated 11 months ago
Alternatives and similar repositories for esac
Users that are interested in esac are comparing it to the libraries listed below
Sorting:
- Energy-based Surprise Minimization for Multi-Agent Value Factorization☆12Updated last year
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆23Updated 4 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆27Updated 5 years ago
- Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"☆14Updated 4 years ago
- DecentralizedLearning☆24Updated 2 years ago
- ☆49Updated 3 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Updated 3 years ago
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆32Updated 3 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆102Updated 4 years ago
- Distributional Soft Actor Critic☆55Updated 5 years ago
- Code for "Proximal Distilled Evolutionary Reinforcement Learning", accepted at AAAI 2020☆52Updated 11 months ago
- Prioritized Sequence Experience Replay☆10Updated 3 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆50Updated last month
- PyTorch IMPALA implementation☆26Updated 5 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆40Updated 2 years ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆20Updated 2 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆41Updated 5 years ago
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆24Updated last year
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆85Updated last year
- Collection of OpenAI parametrized action-space environments.☆65Updated 3 months ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning☆49Updated 4 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆45Updated 4 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆51Updated 4 years ago
- POMDP wrappers for OpenAI Gym☆15Updated 5 years ago
- ☆52Updated 6 years ago
- Revisiting Discrete Gradient Estimation in MADDPG☆24Updated 2 years ago
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆45Updated 3 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆35Updated 3 years ago