nric / ProximalPolicyOptimizationKeras
This is a deterministic Tensorflow 2.0 (keras) implementation of a Open Ai's proximal policy optimization actor critic algorithm PPO.
☆11Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for ProximalPolicyOptimizationKeras
- Proximal Policy Optimization(PPO) with Keras Implementation☆17Updated 4 years ago
- My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.☆36Updated last year
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 4 years ago
- Basic reinforcement learning implementation with tensorflow version 2.0☆52Updated 4 years ago
- Keras Implementation of PPO to solve OpenAI Gym Environments☆16Updated 6 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆45Updated last year
- Experiments with reinforcement learning and recurrent neural networks☆113Updated last year
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆50Updated 3 years ago
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆126Updated 4 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆42Updated 4 years ago
- 🐳 Implementation of various Distributional Reinforcement Learning Algorithms using TensorFlow2.☆67Updated 3 years ago
- Solving openAI's game 'BipedalWalker-v2' with Deep Reinforcement Learning☆26Updated 4 years ago
- ☆72Updated last year
- Proximal Policy Optimization implementation with TensorFlow☆104Updated 6 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆48Updated 4 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆93Updated 4 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆43Updated last year
- RainBow, Tensorflow☆49Updated 6 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…☆35Updated 5 years ago
- Proximal Policy Optimization with TensorFlow and OpenAI Gym☆17Updated 6 years ago
- PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF…☆28Updated 4 years ago
- Pytorch implementation of distributed deep reinforcement learning☆74Updated 2 years ago
- Old and new Reinforcement Learning algorithms run on the GridUniverse ecosystem☆22Updated 5 years ago
- Deep Recurrent Attention Reinforcement Learning in Atari☆82Updated 6 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆61Updated 3 years ago
- Implementation of Recurrent Deterministic Policy Gradient.☆35Updated 5 months ago
- A3C-LSTM algorithm tested on CartPole OpenAI Gym environment☆47Updated 6 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆50Updated 3 years ago
- Simple bit flipping with sparse rewards using HER, similarly to the original paper☆38Updated 5 years ago