Bick95 / PPOLinks
Comprehensive Implementation of Proximal Policy Optimization
☆12Updated 4 years ago
Alternatives and similar repositories for PPO
Users that are interested in PPO are comparing it to the libraries listed below
Sorting:
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Updated 7 years ago
- Code for a model-based version of Constrained Policy Optimization☆11Updated 4 years ago
- TorchingUp provides minimal implementations of common Reinforcement Learning algorithms written in PyTorch. It is meant to complement Ope…☆54Updated 3 years ago
- QuaRL is an open-source framework for systematically studying the effect of applying quantization to reinforcement learning algorithms.☆79Updated 2 years ago
- Pytorch implementation of Soft Actor-Critic☆20Updated 5 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆122Updated 4 years ago
- Clone of OpenAI's Spinning Up in PyTorch☆156Updated 3 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 7 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆122Updated last year
- TD-Regularized Actor-Critic Methods☆36Updated 6 years ago
- The Arcade Learning Environment (ALE) -- a platform for AI research.☆24Updated last year
- ☆135Updated last year
- Controlgym: Large-Scale Control Environments for Benchmarking Reinforcement Learning Algorithms☆49Updated last year
- Code for the paper "Phasic Policy Gradient"☆267Updated 2 years ago
- Code for the paper "Batch size invariance for policy optimization"☆56Updated 2 years ago
- Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"☆14Updated 3 years ago
- Reinforcement Learning Assembly☆92Updated 4 years ago
- An implementation of MuZero in JAX.☆57Updated 3 years ago
- ☆29Updated last year
- Code release for Learning with Opponent-Learning Awareness and variations.☆151Updated 2 years ago
- Minimal implementation of the network layers of the paper "Noisy Networks for Exploration" using Pytorch.☆13Updated 10 months ago
- Optim4RL is a Jax framework of learning to optimize for reinforcement learning.☆28Updated last year
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆56Updated 3 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Updated 3 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Updated 7 years ago
- Keeping track of RL experiments☆166Updated 3 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆195Updated 3 years ago
- Mirror Descent Policy Optimization☆42Updated 5 years ago
- Efficient Exploration through Bayesian Deep-Q Networks.☆18Updated 3 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆51Updated 5 years ago