l5shi / Multi-DDPG-with-parameter-noiseLinks
New reinforcement algorithm base on DDPG
☆18Updated 6 years ago
Alternatives and similar repositories for Multi-DDPG-with-parameter-noise
Users that are interested in Multi-DDPG-with-parameter-noise are comparing it to the libraries listed below
Sorting:
- TF2 Implementation of the Soft Actor-Critic Algorithm☆43Updated 2 years ago
- Deep Deterministic Policy Gradient implemented in PyTorch for DeepMind Control Suite☆25Updated 6 years ago
- Deep Recurrent Attention Reinforcement Learning in Atari☆85Updated 6 years ago
- MLP-framework (pure numpy) and DDQN-framework for OpenAI's Gym games. +test code for PPO added. +Hindsight Experience Replay(HER) bitfli…☆20Updated 7 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆29Updated 6 years ago
- Exploring whether DRQN + action prior + state-based expert + history-based entropy-reduction expert☆8Updated 4 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆27Updated 5 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆47Updated 4 years ago
- Recurrent Deterministic Policy Gradient actor-critic based Reinforcement Learning algorithm in Action☆37Updated 3 months ago
- Code for the ICRA2018 paper "Learning with training wheels: Speeding up training with a simple controller for Deep Reinforcement Learning…☆37Updated 6 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆55Updated 2 years ago
- Project 3 of Udacity's Deep Reinforcement Learning nanodegree program.☆45Updated 6 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆102Updated 4 years ago
- PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning☆67Updated 5 years ago
- Udacity's Deep Reinforcement Learning Nano-Degree☆17Updated 4 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆28Updated 6 years ago
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆125Updated 5 years ago
- PyTorch implementation of Advantage Actor-Critic (A2C)☆45Updated 7 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆61Updated 4 years ago
- Efficient Exploration through Bayesian Deep Q-Networks☆37Updated 7 years ago
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆77Updated 4 years ago
- TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)☆40Updated 4 years ago
- Hands-On Reinforcement Learning with TensorFlow & TRFL☆13Updated 4 years ago
- ☆72Updated 2 years ago
- My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.☆37Updated 2 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆106Updated 6 years ago
- Code for training policies based on paper Coordinated Multi-Agent Imitation Learning☆26Updated 7 years ago
- Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…☆37Updated 6 years ago
- ☆92Updated 4 years ago
- State Space Models for Reinforcement Learning in Tensorflow☆19Updated 6 years ago