kevslinger / DTQNLinks
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
☆169Updated last year
Alternatives and similar repositories for DTQN
Users that are interested in DTQN are comparing it to the libraries listed below
Sorting:
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆180Updated 2 years ago
- A collection of offline reinforcement learning algorithms.☆200Updated 10 months ago
- Datasets with baselines for Offline MARL.☆181Updated last month
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆213Updated last year
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆142Updated last year
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆190Updated last year
- Transformer in RL for decision-making☆100Updated 2 years ago
- ☆217Updated 2 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆192Updated last year
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆194Updated last year
- Baseline implementation of recurrent PPO using truncated BPTT☆153Updated last year
- An elegant PyTorch offline reinforcement learning library for researchers.☆359Updated 2 months ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆104Updated 3 years ago
- 🚀 A fast safe reinforcement learning library in PyTorch☆216Updated last year
- ☆267Updated last year
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆360Updated 2 years ago
- DSAC; Distributional Soft Actor-Critic☆132Updated 7 months ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆178Updated 3 years ago
- This is the official implementation of Multi-Agent PPO.☆117Updated 2 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆53Updated 4 years ago
- ☆102Updated 3 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆134Updated last year
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆85Updated 2 years ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆168Updated last year
- Prioritized Experience Replay implementation with proportional prioritization☆84Updated 2 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆374Updated 3 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆89Updated last year
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆49Updated last year
- A plotter for reinforcement learning (RL)☆233Updated 3 years ago