magnusja / ppo
Proximal Policy Optimization with TensorFlow and OpenAI Gym
☆17Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for ppo
- Tensorflow implementation of proximal policy optimization (PPO) algorithm☆13Updated 6 years ago
- Deep Reinforcement Learning Algorithms Implementation in PyTorch☆26Updated last year
- Atari-DRQN (keras ver.)☆33Updated 6 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆134Updated last year
- A repository for code of reinforcement learning algorithms with PyTorch☆29Updated 3 years ago
- Tensorflow implementation of the asynchronous advantage actor-critic (a3c) reinforcement learning algorithm for continuous action space☆46Updated 7 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆53Updated last year
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆132Updated 5 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆45Updated last year
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆93Updated 2 years ago
- Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action☆112Updated 5 years ago
- ☆69Updated 5 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆101Updated 5 years ago
- Simple implementation of the model presented in Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic …☆17Updated 5 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- Pytorch implementation of intrinsic curiosity module with proximal policy optimization☆51Updated 5 years ago
- ☆40Updated 4 years ago
- Proximal Policy Optimization implementation with TensorFlow☆104Updated 6 years ago
- Reinforcement Learning Methods with PyTorch☆38Updated 4 years ago
- ☆70Updated 5 years ago
- Reinforcement learning algorithms with Generalized Advantage Estimation☆21Updated 6 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆98Updated 4 years ago
- PyTorch implementation of CommNet☆36Updated 6 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆61Updated 6 years ago
- Soft Actor-Critic☆141Updated 6 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆41Updated 5 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆53Updated 5 years ago
- A library for building reinforcement learning and imitation learning agents in Pytorch☆58Updated 4 years ago
- Deep Reinforcement Learning for Continuous Control in PyTorch☆93Updated 2 years ago
- Gym environments modified with adversarial agents☆35Updated 7 years ago