brett-daley / dqn-lambda
NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.
☆24Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for dqn-lambda
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆66Updated 4 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆70Updated 7 years ago
- Bayesian Reward Shaping Framework for Deep Reinforcement Learning☆21Updated 5 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 3 years ago
- ☆90Updated 11 months ago
- A library of probabilistic model based RL algorithms in pytorch☆107Updated 3 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆48Updated 5 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆66Updated 5 years ago
- Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"☆21Updated 4 years ago
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆126Updated 4 years ago
- ☆97Updated last year
- research and implementations of Deep RL agents and their applications☆47Updated 3 weeks ago
- ☆81Updated 3 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆44Updated 4 years ago
- ☆71Updated 5 months ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆50Updated 3 years ago
- PyTorch Implementation of Ape-X (Distributed prioritized experience replay) architecture with DQN learner☆27Updated 4 years ago
- Deep Variational Reinforcement Learning☆134Updated 2 years ago
- ☆47Updated 4 years ago
- Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]☆25Updated 2 years ago
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆149Updated 4 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 6 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆93Updated 2 years ago
- Prioritized Sequence Experience Replay☆10Updated 3 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆25Updated 4 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆86Updated 5 years ago
- PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF…☆29Updated 4 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 5 years ago