kevslinger / DTQN
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
☆161Updated 9 months ago
Alternatives and similar repositories for DTQN:
Users that are interested in DTQN are comparing it to the libraries listed below
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆175Updated 10 months ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆169Updated last year
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆200Updated 7 months ago
- Datasets with baselines for offline multi-agent reinforcement learning.☆166Updated last week
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆181Updated 2 years ago
- This is the official implementation of Multi-Agent PPO.☆105Updated 2 years ago
- DSAC; Distributional Soft Actor-Critic☆125Updated 2 months ago
- Code for Weighted QMIX☆136Updated 4 years ago
- ☆201Updated last year
- PyTorch implementation of SAC-Discrete.☆304Updated 9 months ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆51Updated 3 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆135Updated 11 months ago
- There will be updates later☆84Updated 5 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆167Updated 3 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆107Updated 3 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆126Updated 9 months ago
- 🚀 A fast safe reinforcement learning library in PyTorch☆185Updated 7 months ago
- ☆96Updated 3 years ago
- ☆40Updated 3 years ago
- Collection of OpenAI parametrized action-space environments.☆64Updated last month
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆349Updated 2 years ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆179Updated 7 months ago
- The official code releasement of publications in MARL field of TJU RL lab.☆75Updated 2 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆85Updated 2 years ago
- ☆93Updated 4 years ago
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆65Updated 3 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆359Updated 3 years ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆75Updated last year
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆83Updated last year
- Baseline implementation of recurrent PPO using truncated BPTT☆142Updated last year