magnusja / ppoLinks
Proximal Policy Optimization with TensorFlow and OpenAI Gym
☆18Updated 7 years ago
Alternatives and similar repositories for ppo
Users that are interested in ppo are comparing it to the libraries listed below
Sorting:
- Deep Reinforcement Learning Algorithms Implementation in PyTorch☆27Updated last year
- Reinforcement Learning Methods with PyTorch☆38Updated 6 years ago
- ☆69Updated 7 years ago
- Atari-DRQN (keras ver.)☆33Updated 7 years ago
- A repository for code of reinforcement learning algorithms with PyTorch☆30Updated 4 years ago
- Applying minimaxQ learning algorithm to 2 agents games☆33Updated 8 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆51Updated 5 years ago
- Tensorflow Implementation for "Noisy network for exploration"☆31Updated 8 years ago
- Simple implementation of the model presented in Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic …☆17Updated 7 years ago
- Maximum Causal Entropy Inverse Reinforcement Learning☆48Updated 7 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆44Updated 3 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Updated 4 years ago
- Tensorflow implementation of proximal policy optimization (PPO) algorithm☆13Updated 7 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆146Updated 2 years ago
- Meta Reinforcement Learning Experiments☆35Updated 8 years ago
- A pytorch tutorial for DRL(Deep Reinforcement Learning)☆224Updated 2 years ago
- ☆49Updated 6 years ago
- ☆86Updated 4 years ago
- Disagreement-Regularized Imitation Learning☆30Updated 4 years ago
- Basic reinforcement learning implementation with tensorflow version 2.0☆52Updated 5 years ago
- Self-implemented code for Model-Based Meta-Reinforcement Learning☆17Updated 6 years ago
- Tensorflow implementation of the asynchronous advantage actor-critic (a3c) reinforcement learning algorithm for continuous action space☆46Updated 8 years ago
- Code for the paper "Meta-Q-Learning"( ICLR 2020)☆106Updated 3 years ago
- Adversarial Imitation Via Variational Inverse Reinforcement Learning☆96Updated 6 years ago
- Tensorflow implementation of SNAIL and RL2☆11Updated 6 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆24Updated 6 years ago
- MAGNet: Multi-agents control using Graph Neural Networks☆132Updated 6 years ago
- In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.☆19Updated 7 years ago
- Distributed Priortized Experience Replay☆10Updated 7 years ago
- Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)☆372Updated 6 years ago