nric / ProximalPolicyOptimizationKeras
This is a deterministic Tensorflow 2.0 (keras) implementation of a Open Ai's proximal policy optimization actor critic algorithm PPO.
☆11Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for ProximalPolicyOptimizationKeras
- Keras Implementation of PPO to solve OpenAI Gym Environments☆16Updated 6 years ago
- Proximal Policy Optimization(PPO) with Keras Implementation☆17Updated 4 years ago
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 5 years ago
- My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.☆36Updated last year
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆133Updated 5 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆45Updated last year
- Proximal Policy Optimization implementation with TensorFlow☆104Updated 6 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- Basic reinforcement learning implementation with tensorflow version 2.0☆52Updated 4 years ago
- Hierarchical Attention in Reinforcement Learning for Stock Order Executions☆27Updated 3 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆50Updated 3 years ago
- 🐳 Implementation of various Distributional Reinforcement Learning Algorithms using TensorFlow2.☆67Updated 3 years ago
- A3C-LSTM algorithm tested on CartPole OpenAI Gym environment☆48Updated 6 years ago
- Experiments with reinforcement learning and recurrent neural networks☆113Updated last year
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow☆101Updated 4 years ago
- Proximal Policy Optimization with TensorFlow and OpenAI Gym☆17Updated 6 years ago
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆126Updated 4 years ago
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆173Updated last year
- A library for ready-made reinforcement learning agents and reusable components for neat prototyping☆299Updated 9 months ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 3 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆43Updated last year
- PyTorch implementation of FQF, IQN and QR-DQN.☆161Updated 3 months ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆94Updated 4 years ago
- Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advanta…☆173Updated 2 months ago
- The implement of GAIL with pytorch☆14Updated 4 years ago
- Implementation of Schmidhuber's Upside Down Reinforcement Learning paper in PyTorch☆27Updated 4 years ago
- A well-documented A2C written in PyTorch☆49Updated 5 years ago
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆140Updated 3 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆42Updated 4 years ago
- ☆72Updated last year