kevslinger / DTQNLinks
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
☆163Updated 11 months ago
Alternatives and similar repositories for DTQN
Users that are interested in DTQN are comparing it to the libraries listed below
Sorting:
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆180Updated last year
- Datasets with baselines for offline multi-agent reinforcement learning.☆172Updated last month
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆180Updated 2 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆177Updated last year
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆205Updated 9 months ago
- This is the official implementation of Multi-Agent PPO.☆107Updated 2 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆53Updated 3 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆128Updated 10 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆145Updated last year
- ☆209Updated 2 years ago
- A collection of offline reinforcement learning algorithms.☆189Updated 7 months ago
- Code for Weighted QMIX☆137Updated 4 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆87Updated last year
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆361Updated 3 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆137Updated last year
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆168Updated 7 months ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆182Updated 9 months ago
- DSAC; Distributional Soft Actor-Critic☆129Updated 4 months ago
- There will be updates later☆85Updated 6 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆148Updated last year
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆353Updated 2 years ago
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆320Updated 10 months ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆139Updated last year
- ☆252Updated last year
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆158Updated last year
- Prioritized Experience Replay implementation with proportional prioritization☆81Updated last year
- ☆99Updated 3 years ago
- PyTorch implementation of SAC-Discrete.☆305Updated 11 months ago
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆155Updated 2 years ago
- PyTorch implementation of GAIL and AIRL based on PPO.☆224Updated 4 years ago