Bick95 / PPOLinks
Comprehensive Implementation of Proximal Policy Optimization
☆12Updated 4 years ago
Alternatives and similar repositories for PPO
Users that are interested in PPO are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of Soft Actor-Critic☆20Updated 5 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆122Updated 4 years ago
- Code for the paper "Batch size invariance for policy optimization"☆53Updated 2 years ago
- The official code release for "Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning", ICLR 2025☆11Updated 5 months ago
- ☆132Updated last year
- Explorer is a PyTorch reinforcement learning framework for exploring new ideas.☆97Updated 5 months ago
- QuaRL is an open-source framework for systematically studying the effect of applying quantization to reinforcement learning algorithms.☆75Updated 2 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆22Updated 4 years ago
- ☆113Updated 6 years ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆65Updated 8 years ago
- Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"☆14Updated 3 years ago
- A Multi-agent Learning Framework☆62Updated 4 years ago
- PyTorch implementation of "Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs", NeurIPS 2020☆45Updated 5 years ago
- ☆18Updated 6 years ago
- An implementation of MuZero in JAX.☆57Updated 3 years ago
- Learning to Incentivize Other Learning Agents☆34Updated 3 years ago
- Implementation of some of the Deep Distributional Reinforcement Learning Algorithms.☆25Updated 5 months ago
- Theory of Reinforcement Learning☆17Updated 4 years ago
- Simple, readable, yet full-featured implementation of PPO in Pytorch☆49Updated 6 months ago
- ☆29Updated last year
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆33Updated 4 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 5 years ago
- PyTorch implementation of Advantage Actor-Critic (A2C)☆46Updated 7 years ago
- The Arcade Learning Environment (ALE) -- a platform for AI research.☆24Updated last year
- Implementing REINFORCE algorithm on Pong, Lunar Lander and Cartplot + Medium Article☆23Updated 4 years ago
- Optim4RL is a Jax framework of learning to optimize for reinforcement learning.☆26Updated 11 months ago
- Safe Option-Critic: Learning Safety in the Option-Critic Architecture☆20Updated 6 years ago
- Code and links for over 25,000 trained Atari agents☆98Updated last year
- Keeping track of RL experiments☆165Updated 2 years ago
- SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning☆129Updated 4 years ago