zhihanyang2022 / drqn
Exploring whether DRQN + action prior + state-based expert + history-based entropy-reduction expert
☆8Updated 3 years ago
Alternatives and similar repositories for drqn:
Users that are interested in drqn are comparing it to the libraries listed below
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆83Updated last year
- ☆21Updated 11 months ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆37Updated 2 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆38Updated 4 years ago
- soft q learning and soft actor critic☆15Updated 6 years ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Updated 4 years ago
- Codebase for Model-Based Meta-Reinforcement Learning for Flight with Suspended Payloads paper. Website: https://sites.google.com/view/met…☆30Updated 2 years ago
- Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.☆26Updated 8 months ago
- Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)☆17Updated 3 years ago
- Model-based Policy Gradients☆31Updated 5 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Updated 5 months ago
- Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…☆43Updated 5 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 4 years ago
- DecentralizedLearning☆24Updated 2 years ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆19Updated 2 years ago
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆40Updated 2 years ago
- A collection of environments and reference agents for planning and reinforcement learning research in partially observable, multi-agent …☆20Updated 4 months ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- Code for Posterior Sampling for Deep Reinforcement Learning, ICML 2023☆25Updated last year
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆53Updated 4 years ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆45Updated last year
- Toolkit of Causal Model-based Reinforcement Learning.☆33Updated last year
- Graph convolutional memory☆16Updated 2 years ago
- Code repository for SARNet: Learning Multi-Agent Communication through Structured Attentive Reasoning (NeurIPS 2020)☆23Updated 3 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆48Updated 2 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Updated 5 years ago
- Model Predictive Actor-Critic Reinforcement Learning☆60Updated 3 years ago
- Discriminative Particle Filter Reinforcement Learning for Complex Partial Observations (ICLR 2020)☆25Updated 3 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆55Updated 2 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆22Updated 3 years ago