datvodinh / ppo-transformerLinks

A Reinforcement Learning Project using PPO + Transformer

☆62

Alternatives and similar repositories for ppo-transformer

Users that are interested in ppo-transformer are comparing it to the libraries listed below

Sorting:

MarcoMeter / episodic-transformer-memory-ppo
Clean baseline implementation of PPO using an episodic TransformerXL memory
☆183Updated last year
MarcoMeter / recurrent-ppo-truncated-bptt
Baseline implementation of recurrent PPO using truncated BPTT
☆151Updated last year
williamyuanv0 / Transformer-in-Reinforcement-Learning-for-Decision-Making-A-Survey
Transformer in RL for decision-making
☆98Updated 2 years ago
Jingliang-Duan / DSAC-v1
DSAC; Distributional Soft Actor-Critic
☆129Updated 5 months ago
liuzuxin / OSRL
🤖 Elegant implementations of offline safe RL algorithms in PyTorch
☆207Updated 10 months ago
Dragon-Zhuang / BPPO
Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).
☆87Updated last year
sfujim / TD7
Author's PyTorch implementation of TD7 for online and offline RL
☆146Updated last year
sfujim / TD3_BC
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
☆367Updated 3 years ago
XinJingHao / PPO-Continuous-Pytorch
A clean and robust Pytorch implementation of PPO on continuous action space.
☆159Updated last year
kevslinger / DTQN
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
☆165Updated last year
liuzuxin / FSRL
🚀 A fast safe reinforcement learning library in PyTorch
☆204Updated 10 months ago
liuzuxin / DSRL
🔥 Datasets and env wrappers for offline safe reinforcement learning
☆98Updated 10 months ago
BellmanTimeHut / DIPO
☆108Updated 2 years ago
watchernyu / REDQ
Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.
☆171Updated 8 months ago
hcnoh / gail-pytorch
A simple implementation of Generative Adversarial Imitation Learning with PyTorch
☆164Updated 3 years ago
zbzhu99 / madiff
Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"
☆78Updated last month
datvodinh / recurrent-ppo
A Reinforcement Learning Project using PPO + LSTM
☆89Updated 2 years ago
MyRepositories-hub / Simple-Policy-Optimization
☆79Updated 3 weeks ago
toshikwa / gail-airl-ppo.pytorch
PyTorch implementation of GAIL and AIRL based on PPO.
☆222Updated 4 years ago
toshikwa / soft-actor-critic.pytorch
PyTorch implementation of Soft Actor-Critic(SAC).
☆103Updated 5 years ago
yihaosun1124 / OfflineRL-Kit
An elegant PyTorch offline reinforcement learning library for researchers.
☆353Updated 3 weeks ago
schroederdewitt / multiagent_mujoco
Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.
☆358Updated 2 years ago
alirezakazemipour / SAC
Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.
☆28Updated 2 months ago
polixir / OfflineRL
A collection of offline reinforcement learning algorithms.
☆191Updated 8 months ago
BY571 / CQL
PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…
☆136Updated last year
chrisyrniu / Recent-Advances-in-Multi-Agent-Reinforcement-Learning
A collection of recent MARL papers
☆94Updated 8 months ago
jqueeney / robust-safe-rl
Robust and safe deep reinforcement learning algorithms
☆15Updated last year
Ericonaldo / ILSwiss
ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…
☆174Updated 2 years ago
CherryPieSexy / imitation_learning
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
☆146Updated 3 years ago
BY571 / Soft-Actor-Critic-and-Extensions
PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…
☆290Updated 4 years ago