zplizzi / pytorch-ppo
Simple, readable, yet full-featured implementation of PPO in Pytorch
☆44Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for pytorch-ppo
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆132Updated 3 months ago
- Curiosity-driven Exploration by Self-supervised Prediction☆134Updated last year
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆43Updated last year
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆133Updated 5 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆152Updated 5 months ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆94Updated 4 years ago
- Code for the paper "Phasic Policy Gradient"☆252Updated last year
- Baseline implementation of recurrent PPO using truncated BPTT☆125Updated 6 months ago
- Pytorch implementation of distributed deep reinforcement learning☆74Updated 2 years ago
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆163Updated 2 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆98Updated 4 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆155Updated 2 years ago
- Deep Reinforcement Learning for Continuous Control in PyTorch☆93Updated 2 years ago
- Implementation of Truncated Quantile Critics method for continuous reinforcement learning. https://bayesgroup.github.io/tqc/☆90Updated 3 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆117Updated 3 months ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆186Updated last year
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆152Updated last week
- Code for MOPO: Model-based Offline Policy Optimization☆171Updated 2 years ago
- PyTorch implementation of Never Give Up: Learning Directed Exploration Strategies☆56Updated 3 years ago
- ☆190Updated last year
- Proximal policy optimization in PyTorch. Easy to read and understand.☆49Updated 4 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆99Updated 2 years ago
- ☆118Updated 3 months ago
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆170Updated last year
- SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning☆119Updated 3 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆161Updated 3 months ago
- Adaptive Attention Span for Reinforcement Learning☆132Updated 4 years ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆157Updated 2 years ago
- Datasets for data-driven deep reinforcement learning with PyBullet environments☆143Updated 3 years ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆192Updated 2 years ago