twni2016 / Meta-SAC
Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020
☆30Updated 3 years ago
Alternatives and similar repositories for Meta-SAC:
Users that are interested in Meta-SAC are comparing it to the libraries listed below
- PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning☆49Updated 3 years ago
- ☆73Updated 8 months ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆26Updated 3 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- Code for "Multi-task Reinforcement Learning with Soft Modularization"☆117Updated 4 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆51Updated last year
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆52Updated 4 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆93Updated 2 years ago
- ☆54Updated 11 months ago
- Curriculum-guided Hindsight Experience Replay (NeurIPS-2019)☆63Updated 5 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated 2 years ago
- Meta-Inverse Reinforcement Learning with Probabilistic Context Variables☆71Updated last year
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆47Updated 2 years ago
- behavior cloning from observation☆35Updated 4 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- Deep Implicit Coordination Graphs☆41Updated 8 months ago
- PyTorch IMPALA implementation☆25Updated 5 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆162Updated 2 years ago
- Official implementation of the paper `Augmenting GAIL with BC for sample efficient imitation learning` in PyTorch☆33Updated 4 years ago
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆44Updated 3 years ago
- This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.☆60Updated 5 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆55Updated 5 years ago
- ☆26Updated 6 years ago
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆146Updated 3 years ago
- ☆55Updated 2 years ago
- ☆28Updated 3 years ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆31Updated 2 years ago