Breakend / MultiStepBootstrappingInRL
Here, we compare Q(\sigma) learning presented by Sutton and Barto in [1] to Tree-Backup, n-step Expected Sarsa, and n-step Sarsa.
☆14Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for MultiStepBootstrappingInRL
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆151Updated last year
- Code for the CoRL 2019 paper AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers☆24Updated last year
- Simple grid-world environment compatible with OpenAI-gym☆49Updated 4 years ago
- Tensorflow + OpenAI Gym implementation of Deep Q-Network (DQN), Double DQN (DDQN), Dueling Network and Deep Deterministic Policy Gradient…☆75Updated 7 years ago
- Implementation of Deep/Double Deep/Dueling Deep Q networks for playing Atari games using Keras and OpenAI gym☆40Updated 6 years ago
- ☆42Updated 7 years ago
- Notes and comments about Deep Reinforcement Learning papers☆76Updated 6 years ago
- Code for paper "Episodic Memory Deep Q-Networks" (https://arxiv.org/abs/1805.07603), IJCAI 2018☆62Updated 6 years ago
- Hierarchical Deep RL Network☆30Updated 7 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆70Updated 7 years ago
- Deep Deterministic Policy Gradient implemented in PyTorch for DeepMind Control Suite☆25Updated 6 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆93Updated 2 years ago
- Pytorch implementation of Soft Actor-Critic☆18Updated 4 years ago
- NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.☆24Updated 6 months ago
- ☆25Updated 6 years ago
- ☆45Updated last year
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆182Updated 5 years ago
- Implementation of the paper "Overcoming Exploration in Reinforcement Learning with Demonstrations" Nair et al. over the HER baselines fro…☆152Updated 3 years ago
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow