twni2016 / Meta-SACLinks
Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020
☆33Updated 4 years ago
Alternatives and similar repositories for Meta-SAC
Users that are interested in Meta-SAC are comparing it to the libraries listed below
Sorting:
- Code for "Multi-task Reinforcement Learning with Soft Modularization"☆122Updated 4 years ago
- Curriculum-guided Hindsight Experience Replay (NeurIPS-2019)☆68Updated 5 years ago
- Implementation of Relational Deep Reinforcement Learning☆25Updated 5 years ago
- Code for the paper "Meta-Q-Learning"( ICLR 2020)☆106Updated 3 years ago
- PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning☆51Updated 4 years ago
- ☆54Updated last year
- Code for "Proximal Distilled Evolutionary Reinforcement Learning", accepted at AAAI 2020☆55Updated last year
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆62Updated 7 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Updated 4 months ago
- [ICML 2019] Implementation of "Imitation Learning from Imperfect Demonstration"☆48Updated 6 years ago
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆153Updated 4 years ago
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆46Updated 4 years ago
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆28Updated 3 years ago
- This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.☆62Updated 6 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 4 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆130Updated 2 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆85Updated 3 years ago
- SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning☆129Updated 4 years ago
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆43Updated 3 years ago
- An unofficial implementation for online decision transformer☆40Updated 3 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 5 years ago
- Meta-Inverse Reinforcement Learning with Probabilistic Context Variables☆76Updated 2 years ago
- on-policy optimization baselines for deep reinforcement learning☆32Updated 5 years ago
- [ICML 2021] DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning☆32Updated 2 years ago
- ☆40Updated 4 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆61Updated 2 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆69Updated 4 years ago
- PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).☆94Updated last year
- 🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation☆38Updated 3 years ago