l5shi / Multi-DDPG-with-parameter-noise
New reinforcement algorithm base on DDPG
☆17Updated 5 years ago
Related projects: ⓘ
- Deep Recurrent Attention Reinforcement Learning in Atari☆82Updated 6 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆46Updated 3 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆44Updated last year
- A Simple Example for Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env☆71Updated 7 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆53Updated last year
- Project 3 of Udacity's Deep Reinforcement Learning nanodegree program.☆45Updated 5 years ago
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆71Updated 3 years ago
- Implementation of Recurrent Deterministic Policy Gradient.☆35Updated 3 months ago
- Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…☆35Updated 5 years ago
- PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning☆66Updated 4 years ago
- research and implementations of Deep RL agents and their applications☆46Updated 2 weeks ago
- Deep Deterministic Policy Gradient implemented in PyTorch for DeepMind Control Suite☆25Updated 5 years ago
- Code for the paper "Importance Weighted Transfer of Samples in Reinforcement Learning" (ICML 2018).☆16Updated 6 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆101Updated 5 years ago
- MLP-framework (pure numpy) and DDQN-framework for OpenAI's Gym games. +test code for PPO added. +Hindsight Experience Replay(HER) bitfli…☆19Updated 6 years ago
- ☆91Updated 3 years ago
- Pytorch implementation of Soft Actor-Critic☆18Updated 4 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆28Updated 5 years ago
- RainBow, Tensorflow☆49Updated 6 years ago
- Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action☆111Updated 5 years ago
- Efficient Exploration through Bayesian Deep Q-Networks☆35Updated 6 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆62Updated 6 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆48Updated 5 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆92Updated 2 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆48Updated 3 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆25Updated 4 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆60Updated 3 years ago
- A3C-LSTM algorithm tested on CartPole OpenAI Gym environment☆47Updated 6 years ago
- My solution to assignments in UC Berkeley CS294-112: Deep Reinforcement Learning☆90Updated 5 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆44Updated 4 years ago