baturaysaglam / LA3P
Actor Prioritized Experience Replay
☆15Updated last year
Alternatives and similar repositories for LA3P
Users that are interested in LA3P are comparing it to the libraries listed below
Sorting:
- Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"☆13Updated 3 years ago
- Robust and safe deep reinforcement learning algorithms☆14Updated last year
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆62Updated 11 months ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆55Updated 2 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆34Updated 3 years ago
- DecentralizedLearning☆24Updated 2 years ago
- ☆49Updated 3 years ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆54Updated last year
- Implementations of safe reinforcement learning algorithms☆27Updated last year
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆107Updated 3 years ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆75Updated last year
- ☆72Updated last year
- ☆39Updated 2 years ago
- Meta RL codebase for Unstable Baselines☆21Updated 2 years ago
- ☆20Updated 2 years ago
- 🔥 Datasets and env wrappers for offline safe reinforcement learning☆91Updated 8 months ago
- The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆41Updated 6 months ago
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆65Updated 3 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆87Updated last year
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆20Updated 5 months ago
- Distributional Soft Actor Critic☆53Updated 4 years ago
- ☆46Updated 2 years ago
- The code for paper, "Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration", NeurIPS 2021.☆40Updated 2 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆85Updated 2 years ago
- CORRO code☆35Updated 2 years ago
- ☆55Updated 2 years ago
- Constrained Policy Optimization implementation on Safety Gym☆27Updated 3 years ago
- ☆29Updated 2 years ago
- ☆28Updated 3 years ago
- ☆38Updated 3 years ago