zhougroup / IDACLinks
Implicit Distributional Actor Critic
☆11Updated 4 years ago
Alternatives and similar repositories for IDAC
Users that are interested in IDAC are comparing it to the libraries listed below
Sorting:
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 5 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Updated 4 months ago
- Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"☆31Updated 3 years ago
- on-policy optimization baselines for deep reinforcement learning☆32Updated 5 years ago
- PyTorch - Implicit Quantile Networks - Quantile Regression - C51☆22Updated 6 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆39Updated 4 years ago
- ☆11Updated 5 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Updated 4 years ago
- The implementation of Discriminator Soft Actor Critic☆15Updated 5 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆36Updated 5 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Updated 2 years ago
- Official PyTorch Implementation for Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning☆19Updated 2 years ago
- Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning☆22Updated 5 years ago
- An unofficial implementation for online decision transformer☆40Updated 3 years ago
- Mirror Descent Policy Optimization☆41Updated 5 years ago
- ☆17Updated last year
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Updated 6 years ago
- Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…☆15Updated 3 years ago
- Resilient Multi-Agent Reinforcement Learning☆10Updated 3 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆45Updated 5 years ago
- Improving upon state of the art cooperative deep reinforcement learning in StarCraft II☆13Updated 6 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆55Updated 9 months ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago
- Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL☆30Updated 2 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆27Updated 4 years ago
- Official repo for our AAAI'21 paper, https://arxiv.org/abs/2007.12354☆27Updated 4 years ago
- Taming MAML: efficient unbiased meta-reinforcement learning☆30Updated 3 years ago
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆26Updated 4 years ago
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)☆12Updated 2 years ago