PyTorch implementation of Proximal Policy Optimization
☆53Dec 20, 2017Updated 8 years ago
Alternatives and similar repositories for PPO
Users that are interested in PPO are comparing it to the libraries listed below
Sorting:
- Pytorch Implementation of Proximal Policy Optimization Algorithm☆20Mar 7, 2018Updated 8 years ago
- Implementation of PPO in Pytorch☆41Dec 6, 2017Updated 8 years ago
- ☆20Apr 10, 2018Updated 7 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Jun 9, 2018Updated 7 years ago
- Proximal Policy Optimization in PyTorch☆39Dec 10, 2017Updated 8 years ago
- [DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation☆52Feb 4, 2020Updated 6 years ago
- PyTorch implementation of Advantage Actor-Critic (A2C)☆47Nov 25, 2017Updated 8 years ago
- self implementation of DPPO, Distributed Proximal Policy Optimization, by using tensorflow☆12Sep 1, 2017Updated 8 years ago
- Electroplating simulation environment☆20Sep 26, 2024Updated last year
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- Learning Continuous Control in Deep Reinforcement Learning☆14Nov 24, 2018Updated 7 years ago
- RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code☆719May 12, 2024Updated last year
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆35Aug 23, 2018Updated 7 years ago
- Research Paper: "Graph Contrastive Learning as a Versatile Foundation for Advanced scRNA-seq Data Analysis"☆10Nov 20, 2024Updated last year
- ☆40Nov 23, 2021Updated 4 years ago
- An implementation of TRPO with GAE in PyTorch☆16Jul 22, 2023Updated 2 years ago
- Winning models for the N+1 Fish, N+2 Fish competition.☆20Sep 7, 2023Updated 2 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Nov 8, 2018Updated 7 years ago
- A3C LSTM Atari with Pytorch plus A3G design☆570Apr 18, 2023Updated 2 years ago
- Exploration Strategies for Deep Reinforcement Learning☆39Oct 31, 2018Updated 7 years ago
- Reason8.ai PyTorch solution for NIPS RL 2017 challenge☆84Oct 15, 2019Updated 6 years ago
- Random Network Distillation(RND) algo in Pytorch☆51Feb 26, 2019Updated 7 years ago
- Train an RL agent to localize actively (PyTorch)☆211Apr 9, 2018Updated 7 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆54Nov 10, 2025Updated 3 months ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆54May 12, 2025Updated 9 months ago
- PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…☆3,876May 29, 2022Updated 3 years ago
- BabyAI++: Towards Grounded language Learning beyond Memorization, ICLR BeTR-RL 2020☆26Jul 28, 2020Updated 5 years ago
- Deep Reinforcement Learning with pytorch & visdom☆804Jul 16, 2020Updated 5 years ago
- A pytorch tutorial for DRL(Deep Reinforcement Learning)☆225Apr 24, 2023Updated 2 years ago
- Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…☆32Sep 17, 2018Updated 7 years ago
- Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)☆372Aug 1, 2019Updated 6 years ago
- Simple Cartpole example writed with pytorch.☆170Oct 29, 2019Updated 6 years ago
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆190Mar 18, 2019Updated 6 years ago
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆81Jan 19, 2019Updated 7 years ago
- ☆30Oct 18, 2017Updated 8 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆184Mar 25, 2018Updated 7 years ago
- ☆10Aug 5, 2021Updated 4 years ago
- PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Lear…☆1,272Feb 9, 2021Updated 5 years ago
- Implementation of proximal policy optimization(PPO) with tensorflow☆35Feb 10, 2018Updated 8 years ago