magnusja / ppo
Proximal Policy Optimization with TensorFlow and OpenAI Gym
☆17Updated 7 years ago
Alternatives and similar repositories for ppo:
Users that are interested in ppo are comparing it to the libraries listed below
- Tensorflow implementation of proximal policy optimization (PPO) algorithm☆13Updated 7 years ago
- Deep Reinforcement Learning Algorithms Implementation in PyTorch☆27Updated 2 months ago
- Code for "Divide-and-Conquer Reinforcement Learning"☆61Updated 6 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆137Updated 2 years ago
- Reinforcement learning algorithms with Generalized Advantage Estimation☆21Updated 6 years ago
- ☆83Updated 4 years ago
- The state-of-art deep rl algorithms for Montezuma's revenge☆25Updated 6 years ago
- Implementation of the Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning by Tianmin Shu, Caiming Xiong…☆11Updated 6 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆55Updated 2 years ago
- Simple implementation of the model presented in Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic …☆17Updated 6 years ago
- Applying minimaxQ learning algorithm to 2 agents games☆33Updated 7 years ago
- Reinforcement Learning Methods with PyTorch☆39Updated 5 years ago
- A repository for code of reinforcement learning algorithms with PyTorch☆30Updated 3 years ago
- ☆25Updated 7 years ago
- Pytorch implementation of intrinsic curiosity module with proximal policy optimization☆53Updated 6 years ago
- AGAC: Adversarially Guided Actor-Critic☆48Updated 3 years ago
- Atari-DRQN (keras ver.)☆33Updated 6 years ago
- ☆69Updated 6 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆105Updated 5 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆44Updated 2 years ago
- ☆49Updated 6 years ago
- PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning☆66Updated 5 years ago
- This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.