thanhnguyentang / mmdrlLinks
Official repo for our AAAI'21 paper, https://arxiv.org/abs/2007.12354
☆27Updated 4 years ago
Alternatives and similar repositories for mmdrl
Users that are interested in mmdrl are comparing it to the libraries listed below
Sorting:
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆102Updated 3 years ago
- on-policy optimization baselines for deep reinforcement learning☆32Updated 5 years ago
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆88Updated 2 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆48Updated 4 years ago
- AGAC: Adversarially Guided Actor-Critic☆48Updated 4 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆92Updated 4 years ago
- Implementation of VALOR (Variational Option Discovery Algorithms)☆10Updated 6 years ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆36Updated 5 years ago
- ☆18Updated 3 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Updated 3 months ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆51Updated 4 years ago
- ☆31Updated 6 years ago
- ☆14Updated 4 years ago
- Mirror Descent Policy Optimization☆40Updated 5 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 5 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆69Updated 4 years ago
- ☆131Updated last year
- Code for Sibling Rivalry and experiments presented in associated paper☆17Updated 5 months ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆30Updated 6 years ago
- ☆54Updated last year
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆46Updated 6 years ago
- Code for the paper "Batch size invariance for policy optimization"☆53Updated 2 years ago
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆35Updated 4 years ago
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆76Updated 2 years ago
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆30Updated 2 years ago
- ☆19Updated 2 years ago
- V-MPO torch version with DMLab30 and GTrXL☆13Updated 4 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆38Updated 7 months ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆53Updated 5 months ago
- Implementation of the Prioritized Option-Critic on the Four-Rooms Environment☆17Updated 7 years ago