lucidrains / phasic-policy-gradient

An implementation of Phasic Policy Gradient, a proposed improvement of Proximal Policy Gradients, in Pytorch

☆51

Alternatives and similar repositories for phasic-policy-gradient:

Users that are interested in phasic-policy-gradient are comparing it to the libraries listed below

vwxyzjn / cleanba
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
☆109Updated 5 months ago
BY571 / Munchausen-RL
PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN
☆43Updated 4 years ago
DHDev0 / Muzero-unplugged
Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…
☆27Updated last year
zplizzi / pytorch-ppo
Simple, readable, yet full-featured implementation of PPO in Pytorch
☆44Updated 2 years ago
quantumiracle / MARS
MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.
☆45Updated 10 months ago
google-research / reincarnating_rl
[NeurIPS 2022] Open source code for reusing prior computational work in RL.
☆93Updated last year
openai / phasic-policy-gradient
Code for the paper "Phasic Policy Gradient"
☆259Updated last year
alirezakazemipour / Continuous-PPO
Proximal Policy Optimization (Continuous Version) in PyTorch.
☆28Updated 3 years ago
wisnunugroho21 / reinforcement_learning_ppo_rnd
Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…
☆50Updated 4 years ago
toshikwa / rljax
A collection of RL algorithms written in JAX.
☆95Updated 2 years ago
adityabingi / Dreamer
Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite
☆34Updated 2 years ago
google-deepmind / csuite
☆43Updated 4 months ago
toshikwa / fqf-iqn-qrdqn.pytorch
PyTorch implementation of FQF, IQN and QR-DQN.
☆167Updated 6 months ago
jurgisp / memory-maze
Evaluating long-term memory of reinforcement learning algorithms
☆138Updated last year
Howuhh / faster-trajectory-transformer
Implementation of Trajectory Transformer with attention caching and batched beam search
☆109Updated last year
yusukeurakami / dreamer-pytorch
pytorch-implementation of Dreamer (Model-based Image RL Algorithm)
☆165Updated last week
facebookresearch / mtenv
MultiTask Environments for Reinforcement Learning.
☆74Updated 2 years ago
MarcoMeter / endless-memory-gym
Challenging Memory-based Deep Reinforcement Learning Agents
☆92Updated 3 months ago
evgenii-nikishin / rl_with_resets
JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"
☆102Updated 2 years ago
jakegrigsby / deep_control
Deep Reinforcement Learning for Continuous Control in PyTorch
☆93Updated 3 years ago
denisyarats / exorl
ExORL: Exploratory Data for Offline Reinforcement Learning
☆107Updated 2 years ago
MarcoMeter / episodic-transformer-memory-ppo
Clean baseline implementation of PPO using an episodic TransformerXL memory
☆164Updated 7 months ago
facebookresearch / controllable_agent
The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning…
☆61Updated last year
mila-iqia / spr
Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"
☆158Updated 3 years ago
schatty / oprl
A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing
☆134Updated 6 months ago
BY571 / DQN-Atari-Agents
DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…
☆121Updated 4 years ago
michaelnny / deep_rl_zoo
A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartP…
☆107Updated 11 months ago
takuseno / d4rl-atari
Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)
☆111Updated 5 months ago
amiranas / minerl_imitation_learning
☆21Updated 4 years ago
openrlbenchmark / openrlbenchmark
☆211Updated 2 months ago