kngwyu / Rainy
Deep RL agents with PyTorch
☆36Updated 2 years ago
Related projects: ⓘ
- PyTorch IMPALA implementation☆24Updated 5 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 3 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Updated 3 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Updated 4 years ago
- Revisiting Rainbow☆73Updated 3 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆25Updated 4 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆51Updated 3 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆92Updated 2 years ago
- ☆69Updated 3 months ago
- Soft Actor-Critic with advanced features☆47Updated 3 weeks ago
- Reading notes & PyTorch experiments on OpenAI's "Spinning Up in DRL" tutorial.☆36Updated last year
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆40Updated 3 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆30Updated 4 years ago
- Code for "Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills"☆36Updated 4 years ago
- Implementation of the Option-Critic Architecture☆37Updated 5 years ago
- ☆16Updated last year
- Pytorch implementation of distributed deep reinforcement learning☆72Updated 2 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆44Updated 5 years ago
- Pytorch implementation of Soft Actor-Critic☆18Updated 4 years ago
- V-MPO torch version with DMLab30 and GTrXL☆12Updated 3 years ago
- ☆95Updated last year
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆49Updated 3 years ago
- Hierarchical Self-Play☆21Updated 5 years ago
- on-policy optimization baselines for deep reinforcement learning☆28Updated 4 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 7 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆65Updated 4 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆52Updated 4 years ago
- Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)☆19Updated 4 years ago
- Safe Option-Critic: Learning Safety in the Option-Critic Architecture☆18Updated 5 years ago