zhihanyang2022 / drqn
Exploring whether DRQN + action prior + state-based expert + history-based entropy-reduction expert
☆8Updated 4 years ago
Alternatives and similar repositories for drqn
Users that are interested in drqn are comparing it to the libraries listed below
Sorting:
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆85Updated last year
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆38Updated 4 years ago
- DecentralizedLearning☆24Updated 2 years ago
- Code for Posterior Sampling for Deep Reinforcement Learning, ICML 2023☆26Updated last year
- Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…☆44Updated 6 years ago
- Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)☆17Updated 3 years ago
- Codes for Paper "Delay-Aware Model-Based Reinforcement Learning for Continuous Control".☆26Updated 5 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆27Updated 5 years ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Updated 4 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆103Updated 4 years ago
- Source code for Pathfinding in Stochastic Environments paper.☆14Updated 2 years ago
- This repository has code for the paper "Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algor…☆26Updated last year
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆40Updated 2 years ago
- Model-based Policy Gradients☆31Updated 5 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆38Updated 2 years ago
- ☆21Updated last year
- Model-based reinforcement learning using CEM, MPC and PETS☆16Updated 5 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Updated 6 months ago
- ☆10Updated 4 years ago
- Evolution-based Soft Actor-Critic (ESAC)☆42Updated 9 months ago
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆54Updated 4 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆55Updated 2 years ago
- Experimenting with meta-learning approaches to opponent modelling in MARL. Building upon previous public implementations of MADDPG and M3…☆14Updated 3 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆49Updated this week
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 4 years ago
- PyTorch implementation of Deep Reinforcement Algorithm☆30Updated 2 years ago
- Soft Actor-Critic with advanced features☆50Updated this week
- ☆18Updated 2 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆71Updated 10 months ago
- Experiment code for testing effect of various action space transformations in reinforcement learning☆30Updated 4 years ago