datvodinh / ppo-transformer
A Reinforcement Learning Project using PPO + Transformer
☆49Updated last year
Alternatives and similar repositories for ppo-transformer:
Users that are interested in ppo-transformer are comparing it to the libraries listed below
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆162Updated 10 months ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆176Updated 10 months ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆51Updated 3 years ago
- ☆111Updated 2 years ago
- Datasets with baselines for offline multi-agent reinforcement learning.☆166Updated 2 weeks ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆83Updated last year
- ☆40Updated 3 years ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆179Updated 7 months ago
- Deep recurrent Q learning on CartPole-v1 environment☆88Updated last year
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆201Updated 7 months ago
- Transformer in RL for decision-making☆97Updated 2 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆34Updated 3 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆142Updated last year
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆181Updated 2 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆96Updated 4 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆176Updated 9 months ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆135Updated last year
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆169Updated last year
- The official code releasement of publications in MARL field of TJU RL lab.☆75Updated 2 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆127Updated 9 months ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆49Updated 2 years ago
- A collection of offline reinforcement learning algorithms.☆180Updated 5 months ago
- DSAC; Distributional Soft Actor-Critic☆125Updated 2 months ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆61Updated last year
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆52Updated 4 years ago
- ☆201Updated last year
- Prioritized Experience Replay implementation with proportional prioritization☆77Updated last year
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆102Updated 2 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆128Updated last year
- A Reinforcement Learning Project using PPO + LSTM☆73Updated last year