TJU-DRL-LAB / self-supervised-rlLinks
☆40Updated 3 years ago
Alternatives and similar repositories for self-supervised-rl
Users that are interested in self-supervised-rl are comparing it to the libraries listed below
Sorting:
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆80Updated 2 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- Code accompanying the paper "Off-Policy Primal-Dual Safe Reinforcement Learning"☆20Updated last year
- [NeurIPS 2022] Code for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments☆12Updated 2 years ago
- ☆49Updated 4 years ago
- ☆50Updated 3 years ago
- 🚀 A fast safe reinforcement learning library in PyTorch☆211Updated 11 months ago
- ☆43Updated 4 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆186Updated last year
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆104Updated 3 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆60Updated 2 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆65Updated last year
- [ICML 2022] Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning☆36Updated 3 years ago
- ☆43Updated 3 years ago
- ☆75Updated last year
- Robust and safe deep reinforcement learning algorithms☆15Updated last year
- Generate expert demonstrations; GAIL(Generative Adversarial Imitation Learning); IRL(Inverse Reinforcement Learning)☆32Updated 4 years ago
- ☆39Updated 3 years ago
- [NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizat…☆38Updated last year
- The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆43Updated 10 months ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆88Updated last year
- Efficient Adversarial Training without Attacking: Worst-Case-Aware Robust Reinforcement Learning☆26Updated 2 years ago
- ☆17Updated 2 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆172Updated 10 months ago
- Public implementation of "Multi-Agent Graph-Attention Communication and Teaming" from AAMAS'21☆85Updated last year
- DSAC; Distributional Soft Actor-Critic☆130Updated 7 months ago
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆65Updated 4 years ago
- Implementation of PPO Lagrangian in PyTorch☆50Updated 3 years ago
- A novel Hierarchical Imitation Learning algorithm based on AIRL.☆22Updated 2 years ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆60Updated 2 years ago