williamyuanv0 / Transformer-in-Reinforcement-Learning-for-Decision-Making-A-Survey
Transformer in RL for decision-making
☆87Updated 2 years ago
Alternatives and similar repositories for Transformer-in-Reinforcement-Learning-for-Decision-Making-A-Survey:
Users that are interested in Transformer-in-Reinforcement-Learning-for-Decision-Making-A-Survey are comparing it to the libraries listed below
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆69Updated last year
- DSAC; Distributional Soft Actor-Critic☆123Updated this week
- a clean and robust Pytorch implementation of SAC on continuous action space☆66Updated 8 months ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆159Updated 9 months ago
- ☆70Updated last year
- Implementation of PPO Lagrangian in PyTorch☆35Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆81Updated last year
- A collection of recent MARL papers☆83Updated 2 months ago
- Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]☆68Updated 5 years ago
- ☆28Updated 10 months ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆55Updated last year
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆116Updated 9 months ago
- 🚀 A fast safe reinforcement learning library in PyTorch☆170Updated 4 months ago
- The official code releasement of publications in MARL field of TJU RL lab.☆68Updated 2 years ago
- PyTorch implementation of Constrained Policy Optimization☆51Updated 3 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆54Updated 8 months ago
- ☆191Updated last year
- This is the official implementation of Multi-Agent PPO.☆102Updated 2 years ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆154Updated 7 months ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆49Updated 3 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆97Updated 3 years ago
- ☆94Updated 3 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆127Updated last year
- Deep recurrent Q learning on CartPole-v1 environment☆83Updated last year
- A clean and robust Pytorch implementation of PPO on continuous action space.☆130Updated 8 months ago
- MATE: the Multi-Agent Tracking Environment.☆44Updated last year
- ☆41Updated 3 years ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆51Updated last year
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆187Updated 5 months ago
- Constrained Policy Optimization implementation on Safety Gym☆23Updated 3 years ago