magnusja / ppo
Proximal Policy Optimization with TensorFlow and OpenAI Gym
☆17Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for ppo
- Tensorflow implementation of proximal policy optimization (PPO) algorithm☆13Updated 6 years ago
- Deep Reinforcement Learning Algorithms Implementation in PyTorch☆26Updated last year
- Pytorch implementation of intrinsic curiosity module with proximal policy optimization☆51Updated 5 years ago
- Atari-DRQN (keras ver.)☆33Updated 6 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆134Updated last year
- A repository for code of reinforcement learning algorithms with PyTorch☆29Updated 3 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆45Updated last year
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- Reinforcement learning algorithms with Generalized Advantage Estimation☆21Updated 6 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆49Updated 4 years ago
- ☆81Updated 3 years ago
- This is about imitation learning using PPO and WGAN-GP loss. This is heavily influenced by GAIL-PPO repository in following link - https:…☆9Updated 6 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆93Updated 2 years ago
- The state-of-art deep rl algorithms for Montezuma's revenge☆25Updated 6 years ago
- Tensorflow implementation of the asynchronous advantage actor-critic (a3c) reinforcement learning algorithm for continuous action space☆46Updated 7 years ago
- World Models with A3C on Carracing-v0 in gym☆32Updated 4 years ago
- PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning☆66Updated 4 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Updated 3 years ago
- Proximal Policy Optimization implementation with TensorFlow☆104Updated 6 years ago
- Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action☆112Updated 6 years ago
- StarCraft II Multi Agent Challenge : QMIX, COMA, LIIR, QTRAN, Central V, ROMA, RODE, DOP, Graph MIX☆68Updated 3 years ago
- ☆49Updated 5 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆101Updated 5 years ago
- ☆69Updated 5 years ago
- RLOpensource / IMPALA-Scalable-Distributed-Deep-RL-with-Importance-Weighted-Actor-Learner-Architectures☆36Updated 5 years ago
- Basic reinforcement learning implementation with tensorflow version 2.0☆52Updated 4 years ago
- Applying minimaxQ learning algorithm to 2 agents games☆32Updated 6 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆41Updated 5 years ago
- [NeurIPS 2024] Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow☆16Updated 2 weeks ago
- Implementation of proximal policy optimization(PPO) with tensorflow☆35Updated 6 years ago