kevslinger / DTQN
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
☆152Updated 6 months ago
Alternatives and similar repositories for DTQN:
Users that are interested in DTQN are comparing it to the libraries listed below
- This is the official implementation of Multi-Agent PPO.☆102Updated 2 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆157Updated 9 months ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆130Updated 8 months ago
- Datasets with baselines for offline multi-agent reinforcement learning.☆160Updated last week
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆49Updated 3 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆114Updated 9 months ago
- A collection of offline reinforcement learning algorithms.☆165Updated 2 months ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆186Updated 4 months ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆142Updated last year
- DSAC; Distributional Soft Actor-Critic☆121Updated 11 months ago
- Code for Weighted QMIX☆126Updated 4 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆68Updated 2 years ago
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆174Updated last year
- ☆93Updated 3 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆124Updated last year
- There will be updates later☆83Updated 5 years ago
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆113Updated 2 years ago
- ☆191Updated last year
- ☆231Updated 11 months ago
- Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"☆199Updated 5 years ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆169Updated 4 months ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆340Updated 3 years ago
- ☆106Updated last year
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆97Updated 3 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆164Updated 7 months ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆81Updated last year
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆160Updated 2 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆99Updated 2 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆134Updated 9 months ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆68Updated last year