kevslinger / DTQNLinks
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
☆170Updated last year
Alternatives and similar repositories for DTQN
Users that are interested in DTQN are comparing it to the libraries listed below
Sorting:
- ☆217Updated 2 years ago
- Datasets with baselines for Offline MARL.☆182Updated this week
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆197Updated last year
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆181Updated 2 years ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆51Updated last year
- ☆270Updated last year
- Level-based Foraging (LBF): A multi-agent environment for RL☆196Updated last year
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆218Updated last year
- A collection of offline reinforcement learning algorithms.☆203Updated 11 months ago
- 🚀 A fast safe reinforcement learning library in PyTorch☆218Updated last year
- Transformer in RL for decision-making☆102Updated 2 years ago
- ☆104Updated 4 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆106Updated 3 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆141Updated last year
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆137Updated last year
- A plotter for reinforcement learning (RL)☆234Updated 3 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆87Updated 2 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆54Updated 4 years ago
- This is the official implementation of Multi-Agent PPO.☆120Updated 2 years ago
- Prioritized Experience Replay implementation with proportional prioritization☆84Updated 2 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆70Updated 2 years ago
- Code for Weighted QMIX☆142Updated 4 years ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆80Updated 2 years ago
- PyTorch implementation of SAC-Discrete.☆311Updated last year
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆193Updated last year
- ☆97Updated 4 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆155Updated last year
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆360Updated 2 years ago
- There will be updates later☆85Updated 6 years ago