kevslinger / DTQNLinks
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
☆165Updated last year
Alternatives and similar repositories for DTQN
Users that are interested in DTQN are comparing it to the libraries listed below
Sorting:
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆180Updated 2 years ago
- ☆216Updated 2 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆141Updated last year
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆189Updated last year
- Prioritized Experience Replay implementation with proportional prioritization☆84Updated 2 years ago
- Datasets with baselines for Offline MARL.☆178Updated 3 weeks ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆211Updated last year
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆53Updated 3 years ago
- 🚀 A fast safe reinforcement learning library in PyTorch☆211Updated 11 months ago
- DSAC; Distributional Soft Actor-Critic☆131Updated 7 months ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆187Updated last year
- A collection of recent MARL papers☆96Updated 9 months ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆104Updated 3 years ago
- Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'☆67Updated 3 years ago
- A collection of offline reinforcement learning algorithms.☆196Updated 9 months ago
- PyTorch implementation of SAC-Discrete.☆310Updated last year
- Baseline implementation of recurrent PPO using truncated BPTT☆151Updated last year
- Transformer in RL for decision-making☆100Updated 2 years ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆191Updated last year
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆88Updated last year
- ☆40Updated 3 years ago
- ☆113Updated 2 years ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆161Updated last year
- This is the official implementation of Multi-Agent PPO.☆116Updated 2 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- ☆43Updated 3 years ago
- ☆102Updated 3 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆40Updated 5 years ago
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆163Updated 2 years ago
- ☆265Updated last year