l5shi / Multi-DDPG-with-parameter-noiseLinks
New reinforcement algorithm base on DDPG
☆18Updated 6 years ago
Alternatives and similar repositories for Multi-DDPG-with-parameter-noise
Users that are interested in Multi-DDPG-with-parameter-noise are comparing it to the libraries listed below
Sorting:
- Reinforcement Learning -- Imitation Learning, Behavior Cloning, DAgger (Data Aggregation)☆20Updated 7 years ago
- Recurrent Deterministic Policy Gradient actor-critic based Reinforcement Learning algorithm in Action☆37Updated 7 months ago
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆79Updated 4 years ago
- ☆80Updated 5 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆47Updated 4 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆56Updated 2 years ago
- An implementation of the Deep Deterministic Policy Gradient (DDPG) algorithm using Keras/Tensorflow with the robot simulated using ROS/Ga…☆61Updated 8 years ago
- Deep Recurrent Attention Reinforcement Learning in Atari☆82Updated 7 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆30Updated 6 years ago
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆125Updated 5 years ago
- research and implementations of Deep RL agents and their applications☆58Updated last month
- Reinforcement Learning in Vrep☆15Updated 5 years ago
- Python code for Deep Reinforcement Learning with Transfer Learning in a Simulated Environment☆74Updated 2 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆27Updated 5 years ago
- Public Source Seeking Repository☆54Updated 5 years ago
- PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning☆67Updated 5 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 4 years ago
- Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…☆38Updated 6 years ago
- Deep Gaussian Process for Inverse Reinforcement Learning☆33Updated 8 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Updated 4 years ago
- ☆21Updated 2 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆44Updated 2 years ago
- Project exploring Multi Task Deep Reinforcement Learning neural network architectures and algorithms with Open AI Gym and TensorFlow☆17Updated 7 years ago
- Clean Python Implementation of the Deep Deterministic Policy Gradients Algorithm☆77Updated 8 years ago
- ☆10Updated 4 years ago
- A3C-LSTM algorithm tested on CartPole OpenAI Gym environment☆48Updated 7 years ago
- Soft Actor-Critic with advanced features☆51Updated 2 weeks ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆63Updated 7 years ago
- pytorch, noisy_distributional_double_dueling_PER_RNN_CNN...CartPole-v1 , Acrobot-v1, MountainCar-v0☆15Updated 7 years ago
- A Simple Example for Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env☆71Updated 8 years ago