zplizzi / pytorch-ppo
Simple, readable, yet full-featured implementation of PPO in Pytorch
☆44Updated 2 years ago
Related projects: ⓘ
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆128Updated 5 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆131Updated last year
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆92Updated 4 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated last month
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆158Updated 2 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆150Updated 2 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆158Updated last month
- Baseline implementation of recurrent PPO using truncated BPTT☆118Updated 4 months ago
- ☆226Updated 2 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆146Updated last year
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆119Updated 3 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆137Updated 3 months ago
- PyTorch implementation of Never Give Up: Learning Directed Exploration Strategies☆55Updated 3 years ago
- Deep Reinforcement Learning for Continuous Control in PyTorch☆92Updated 2 years ago
- ☆187Updated last year
- Implementation of Truncated Quantile Critics method for continuous reinforcement learning. https://bayesgroup.github.io/tqc/☆90Updated 3 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆113Updated last month
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆191Updated last year
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆92Updated 5 years ago
- Pytorch implementation of distributed deep reinforcement learning☆72Updated 2 years ago
- Adaptive Attention Span for Reinforcement Learning☆130Updated 4 years ago
- ☆115Updated last month
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆184Updated last year
- Level-based Foraging (LBF): A multi-agent environment for RL☆152Updated this week
- Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)☆179Updated last year
- Gridworld for MARL experiments☆137Updated 3 years ago
- Implementation of Trajectory Transformer with attention caching and batched beam search☆101Updated last year
- Code for MOPO: Model-based Offline Policy Optimization☆169Updated 2 years ago
- Code for the paper "Phasic Policy Gradient"☆245Updated last year
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆77Updated last year