datvodinh / ppo-transformerLinks
A Reinforcement Learning Project using PPO + Transformer
☆53Updated last year
Alternatives and similar repositories for ppo-transformer
Users that are interested in ppo-transformer are comparing it to the libraries listed below
Sorting:
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆180Updated 11 months ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆162Updated 10 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆144Updated last year
- Transformer in RL for decision-making☆97Updated 2 years ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆205Updated 8 months ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆137Updated last year
- A Reinforcement Learning Project using PPO + LSTM☆80Updated last year
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆87Updated last year
- PyTorch implementation of discrete version of Soft Actor-Critic.☆34Updated 3 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆146Updated last year
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆52Updated 3 years ago
- Implementation of PPO Lagrangian in PyTorch☆46Updated 2 years ago
- Datasets with baselines for offline multi-agent reinforcement learning.☆170Updated 3 weeks ago
- ☆40Updated 3 years ago
- DSAC; Distributional Soft Actor-Critic☆127Updated 3 months ago
- Prioritized Experience Replay implementation with proportional prioritization☆78Updated last year
- Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.☆27Updated 3 weeks ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆85Updated last year
- A collection of offline reinforcement learning algorithms.☆185Updated 6 months ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆168Updated 6 months ago
- This is the official implementation of Multi-Agent PPO.☆106Updated 2 years ago
- 🚀 A fast safe reinforcement learning library in PyTorch☆194Updated 8 months ago
- ☆102Updated 2 years ago
- A clean and robust Pytorch implementation of PPO on Discrete action space☆69Updated 11 months ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated 10 months ago
- ☆204Updated 2 years ago
- Implementation of Trajectory Transformer with attention caching and batched beam search☆112Updated 2 years ago
- A collection of recent MARL papers☆93Updated 6 months ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆56Updated 2 years ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆55Updated last year