kevslinger / DTQNLinks
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
☆171Updated last year
Alternatives and similar repositories for DTQN
Users that are interested in DTQN are comparing it to the libraries listed below
Sorting:
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆207Updated last year
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆198Updated last year
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆183Updated 2 years ago
- ☆220Updated 2 years ago
- Datasets with baselines for Offline MARL.☆191Updated last month
- Transformer in RL for decision-making☆103Updated 2 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆143Updated last year
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆53Updated last year
- ☆106Updated 4 years ago
- ☆277Updated last year
- Baseline implementation of recurrent PPO using truncated BPTT☆156Updated last year
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆139Updated last year
- This is the official implementation of Multi-Agent PPO.☆125Updated 2 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆107Updated 3 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆55Updated 4 years ago
- ☆40Updated 4 years ago
- A collection of offline reinforcement learning algorithms.☆207Updated last year
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆88Updated 2 years ago
- DSAC; Distributional Soft Actor-Critic☆134Updated 9 months ago
- ☆100Updated 5 years ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆221Updated last year
- Level-based Foraging (LBF): A multi-agent environment for RL☆198Updated last year
- PyTorch implementation of SAC-Discrete.☆312Updated last year
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆171Updated last year
- There will be updates later☆87Updated 6 years ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆294Updated 4 years ago
- ☆40Updated 3 years ago
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆166Updated 2 years ago
- Code for Weighted QMIX☆144Updated 5 years ago