brett-daley / dqn-lambda
NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.
☆23Updated 11 months ago
Alternatives and similar repositories for dqn-lambda
Users that are interested in dqn-lambda are comparing it to the libraries listed below
Sorting:
- A collection of multi-agent reinforcement learning OpenAI gym environments☆45Updated 4 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆72Updated 7 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆67Updated 4 years ago
- PyTorch Implementation of Ape-X (Distributed prioritized experience replay) architecture with DQN learner☆27Updated 4 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆61Updated 4 years ago
- ☆71Updated 5 years ago
- PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF…☆32Updated 4 years ago
- Reading notes & PyTorch experiments on OpenAI's "Spinning Up in DRL" tutorial.☆38Updated 2 years ago
- ☆98Updated 2 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆66Updated 5 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆51Updated 3 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆94Updated 2 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆47Updated 5 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 6 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆55Updated 2 months ago
- Safe Policy Improvement with Baseline Bootstrapping☆26Updated 5 years ago
- Efficient Exploration through Bayesian Deep Q-Networks☆37Updated 7 years ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆70Updated last year
- PyTorch IMPALA implementation☆26Updated 5 years ago
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆125Updated 5 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Updated 3 years ago
- PyRL - Reinforcement Learning Framework in Pytorch (Policy Gradient, DQN, DDPG, TD3, PPO, SAC, etc.)☆33Updated 2 years ago
- Project on Successor Features in Deep Reinforcement Learning and Transfer Learning☆24Updated 7 years ago
- Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation☆87Updated 7 years ago
- A library of probabilistic model based RL algorithms in pytorch☆107Updated 4 years ago
- Soft Actor-Critic with advanced features☆50Updated this week
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆28Updated 5 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆96Updated 4 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 4 years ago