mabirck / AttentionTRL
Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind
☆10Updated 7 years ago
Alternatives and similar repositories for AttentionTRL:
Users that are interested in AttentionTRL are comparing it to the libraries listed below
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago
- ☆27Updated 5 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 3 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 7 years ago
- In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.☆19Updated 6 years ago
- Duel_DDQN (Dueling Network Architectures + Double DQN) using Keras☆31Updated 8 years ago
- hierarchical Q-learning implementation☆11Updated 9 years ago
- The implementation of Discriminator Soft Actor Critic☆15Updated 5 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Updated 2 years ago
- TD-Regularized Actor-Critic Methods☆36Updated 5 years ago
- Using Pilco algorithm to find a controller for few robotic problems☆43Updated 9 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 6 years ago
- Reinforcement Learning and Deep Learning Resources☆16Updated 6 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- ☆35Updated 6 years ago
- A simple Gridworld environment for Open AI gym☆25Updated 6 years ago
- Tensorflow implementation for "Noisy network for exploration"☆19Updated 7 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆55Updated 5 years ago
- Exploration Strategies for Deep Reinforcement Learning☆39Updated 6 years ago
- Autoregressive policies for continuous control reinforcement learning☆29Updated 5 years ago
- ☆53Updated 7 years ago
- State Space Models for Reinforcement Learning in Tensorflow☆19Updated 6 years ago
- Maximum Causal Entropy Inverse Reinforcement Learning☆47Updated 6 years ago
- Yet another prioritized experience replay buffer implementation.☆48Updated 2 years ago
- Code for paper "Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning".☆35Updated 6 years ago
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆66Updated 6 years ago
- A3C-LSTM algorithm tested on CartPole OpenAI Gym environment☆48Updated 6 years ago
- Exploration by Random Network Distillation☆15Updated 6 years ago
- reimplementation of the ddpg algorithm using tensorflow☆38Updated 8 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆35Updated 6 years ago