thanhnguyentang / mmdrl
Official repo for our AAAI'21 paper, https://arxiv.org/abs/2007.12354
☆25Updated 3 years ago
Related projects: ⓘ
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆96Updated 2 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- ☆39Updated 3 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 3 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆27Updated 3 weeks ago
- Safe Option-Critic: Learning Safety in the Option-Critic Architecture☆18Updated 5 years ago
- Codebase for the Graph-based Policy Learning algorithm, which is designed for learning policies to solve the open ad hoc teamwork problem…☆29Updated 3 years ago
- ☆17Updated 10 months ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆49Updated 8 months ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆100Updated 2 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆24Updated last year
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆51Updated 3 years ago
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆42Updated 2 years ago
- Implementation of VALOR (Variational Option Discovery Algorithms)☆10Updated 5 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Updated 3 years ago
- ☆27Updated last year
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆35Updated 2 weeks ago
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting