Bick95 / PPOLinks
Comprehensive Implementation of Proximal Policy Optimization
☆11Updated 4 years ago
Alternatives and similar repositories for PPO
Users that are interested in PPO are comparing it to the libraries listed below
Sorting:
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆121Updated 4 years ago
- Pytorch implementation of Soft Actor-Critic☆20Updated 5 years ago
- Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"☆14Updated 2 years ago
- Code for the paper "Batch size invariance for policy optimization"☆53Updated 2 years ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- QuaRL is an open-source framework for systematically studying the effect of applying quantization to reinforcement learning algorithms.☆72Updated 2 years ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆65Updated 8 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Updated 7 years ago
- ☆133Updated last year
- ☆30Updated last year
- Online Decision Transformer☆266Updated last year
- Reinforcement Learning with Convex Constraints☆14Updated 3 years ago
- Scaling scaling laws with board games.☆53Updated 2 years ago
- ☆65Updated last year
- PyTorch implementation of Advantage Actor-Critic (A2C)☆46Updated 7 years ago
- Mirror Descent Policy Optimization☆38Updated 4 years ago
- ☆18Updated 6 years ago
- Explorer is a PyTorch reinforcement learning framework for exploring new ideas.☆95Updated 2 months ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Updated 7 years ago
- 🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation☆39Updated 3 years ago
- Optim4RL is a Jax framework of learning to optimize for reinforcement learning.☆26Updated 9 months ago
- Tracking literature and additional online resources on transformers for sequential decision making including RL and beyond.☆47Updated 2 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆149Updated 2 years ago
- A leaderboard of human and machine performance on the Arcade Learning Environment (ALE).☆20Updated 7 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated last year
- ☆32Updated 2 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆51Updated 4 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆114Updated last year
- Combining Evolutionary Algorithms and deep RL in various ways☆105Updated 4 years ago
- Awesome RL: Papers, Books, Codes, Benchmarks☆116Updated last year