Bick95 / PPOLinks
Comprehensive Implementation of Proximal Policy Optimization
☆10Updated 3 years ago
Alternatives and similar repositories for PPO
Users that are interested in PPO are comparing it to the libraries listed below
Sorting:
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- Mirror Descent Policy Optimization☆38Updated 4 years ago
- QuaRL is an open-source framework for systematically studying the effect of applying quantization to reinforcement learning algorithms.☆68Updated 2 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆86Updated 3 years ago
- Code implementing the CORE-RL algorithm with DDPG, PPO, and TRPO. See the paper "Control Regularization for Reduced Variance Reinforcemen…☆32Updated 4 years ago
- ☆43Updated 8 years ago
- Pytorch implementation of Soft Actor-Critic☆19Updated 5 years ago
- Theory of Reinforcement Learning☆16Updated 4 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- ☆17Updated 3 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆113Updated 9 months ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆48Updated 11 months ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆21Updated 4 years ago
- Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"☆14Updated 2 years ago
- ☆18Updated 6 years ago
- Sample-Efficient Automated Deep Reinforcement Learning☆34Updated 4 years ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- ☆28Updated last year
- ☆47Updated 4 years ago
- Optim4RL is a Jax framework of learning to optimize for reinforcement learning.☆25Updated 6 months ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆53Updated 3 weeks ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆94Updated 4 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆23Updated last year
- ☆65Updated last year
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- ☆41Updated 3 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆31Updated 4 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated 9 months ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆38Updated 4 years ago