brett-daley / dqn-lambdaLinks
NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.
☆23Updated last year
Alternatives and similar repositories for dqn-lambda
Users that are interested in dqn-lambda are comparing it to the libraries listed below
Sorting:
- A collection of multi-agent reinforcement learning OpenAI gym environments☆45Updated 4 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆47Updated 6 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆72Updated 8 years ago
- ☆99Updated 2 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆94Updated 2 years ago
- ☆47Updated 4 years ago
- PyTorch Implementation of Ape-X (Distributed prioritized experience replay) architecture with DQN learner☆27Updated 4 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆28Updated 6 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆67Updated 5 years ago
- FEN Code☆37Updated 5 years ago
- Efficient Exploration through Bayesian Deep Q-Networks☆37Updated 7 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- Prioritized Sequence Experience Replay☆10Updated 3 years ago
- Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"☆22Updated 5 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆61Updated 4 years ago
- Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"☆34Updated 6 years ago
- PyTorch IMPALA implementation☆26Updated 5 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆51Updated 4 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 4 years ago
- [Experimental] TensorFlow 2 version of stable-baselines, temporary repository☆45Updated 5 years ago
- ☆83Updated 4 years ago
- PyTorch - Implicit Quantile Networks - Quantile Regression - C51☆22Updated 5 years ago
- ☆72Updated 2 years ago
- A practical step-by-step guide to applying RUDDER☆35Updated 5 years ago
- Neural Fictitious Self-Play in Leduc Holdem☆10Updated 6 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated 9 months ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Updated 3 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆66Updated 5 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆46Updated 4 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 7 years ago