brett-daley / dqn-lambda
NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.
☆24Updated 9 months ago
Alternatives and similar repositories for dqn-lambda:
Users that are interested in dqn-lambda are comparing it to the libraries listed below
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 7 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆67Updated 4 years ago
- ☆97Updated last year
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆93Updated 2 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆45Updated 4 years ago
- Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"☆21Updated 5 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- A library of probabilistic model based RL algorithms in pytorch☆108Updated 3 years ago
- Implementation of the Option-Critic Architecture☆38Updated 6 years ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆69Updated last year
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆27Updated 5 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆50Updated 3 years ago
- Reading notes & PyTorch experiments on OpenAI's "Spinning Up in DRL" tutorial.☆38Updated 2 years ago
- Efficient Exploration through Bayesian Deep Q-Networks☆37Updated 7 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- Code to train RL agents along with Adversarial distrubance agents☆63Updated 7 years ago
- PyTorch IMPALA implementation☆25Updated 5 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆94Updated 4 years ago
- ☆91Updated last year
- Prioritized Sequence Experience Replay☆10Updated 3 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆65Updated 5 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆85Updated 3 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆86Updated 6 years ago
- Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation☆87Updated 6 years ago
- Deep Variational Reinforcement Learning☆134Updated 2 years ago
- FEN Code☆37Updated 5 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 7 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆26Updated 4 years ago
- PyTorch implementation of CommNet☆36Updated 7 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆93Updated 6 years ago