lucidrains / phasic-policy-gradient
An implementation of Phasic Policy Gradient, a proposed improvement of Proximal Policy Gradients, in Pytorch
☆46Updated this week
Related projects ⓘ
Alternatives and complementary repositories for phasic-policy-gradient
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆105Updated 2 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆49Updated 2 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆30Updated this week
- ☆65Updated 2 weeks ago
- Baselines for gymnax 🤖☆60Updated last year
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆27Updated last year
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆99Updated 2 years ago
- Code for "Proximal Distilled Evolutionary Reinforcement Learning", accepted at AAAI 2020☆50Updated 3 months ago
- Pytorch implementation of DreamerV2: MASTERING ATARI WITH DISCRETE WORLD MODELS☆50Updated 2 years ago
- ☆17Updated 5 months ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆42Updated 4 years ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆45Updated 8 months ago
- Implementation of Trajectory Transformer with attention caching and batched beam search☆107Updated last year
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆163Updated 2 years ago
- Repository for the PGA-MAP-Elites algorithm. PGA-MAP-Elites was developed to efficiently scale MAP-Elites to large genotypes and noisy d…☆35Updated 3 years ago
- A collection of RL algorithms written in JAX.☆95Updated 2 years ago
- ☆28Updated 2 years ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆86Updated 3 weeks ago
- Repository for the QDgym code. A framework for Quality Diversity optimization benchmark tasks based OpenAI Gym.☆21Updated 3 years ago
- Accelerated replay buffers in JAX☆40Updated 2 years ago
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆46Updated last year
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆50Updated 3 years ago
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆30Updated last year
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆81Updated last year
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆58Updated last year
- Extreme Q-Learning: Max Entropy RL without Entropy☆80Updated last year
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆40Updated this week
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆50Updated 3 years ago
- Benchmarking RL generalization in an interpretable way.☆132Updated 9 months ago
- Baseline implementation of recurrent PPO using truncated BPTT☆125Updated 6 months ago